Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about Painter Paper #55

Open
FrancescoSaverioZuppichini opened this issue Sep 2, 2023 · 1 comment
Open

Questions about Painter Paper #55

FrancescoSaverioZuppichini opened this issue Sep 2, 2023 · 1 comment

Comments

@FrancescoSaverioZuppichini

Hi there 👋

We are a community of CV Engineers and we were reading Visual Prompting via Image Inpainting

We would like to ask a couple of questions:

Why did you create the dataset in that way? It is not similar to the final input and you could have created something way easier by taking normal CV segmentation datasets and composing the grid image.

This is the image in the paper for training, yet it is unclear how you do inference. Can you give me some pseudo code assuming we take as input $x$ (image) and $m$ the mask part that we will have to fill

image

How did you find the right z_i for each "patch" token coming from MAE?

Could you give us the intuition on why you are doing the training in this way and not directly predicting the patch tokens on the missing parts?

Thank you

Cheers,

Fra

@IcecreamArtist
Copy link

Seems like this paper is not from the authors of this code repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants