Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reconstruction and editing with gamma=0 and eta=0 #5

Open
sashapff opened this issue Jan 29, 2025 · 1 comment
Open

Reconstruction and editing with gamma=0 and eta=0 #5

sashapff opened this issue Jan 29, 2025 · 1 comment

Comments

@sashapff
Copy link

Can you please explain why, when using the Euler scheme without modifications with the controllers from the article (the case when gamma=0 and eta=0), the reconstruction and the edited image do not fall into the data distribution? Is it because such a large computational error accumulates or why?

I attach an example for prompts 'a cat' for the reconstruction (guidance_scale=1.) and 'a tiger' for the editing (guidance_scale=3.5).

Image

@LituRout
Copy link
Owner

Hi Alexandra, thanks for your interest in our paper. Are you using the prompt 'a cat' with guidance scale 1.0 while inverting the cat image with gamma=0? If yes, then this is wrong because you are removing the catness along with inverting the image. The catness comes from the text conditional score for nonzero guidance. Ideally, you would want to invert with zero guidance scale (no text conditioning), which would preserve catness in the structured noise y_1. Then, if you initialize the reverse flow with y_1, you should get more cat looking image in reconstruction. Please let me know if this resolves your concern. Happy to provide any further clarifications.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants