Reconstruction and editing with gamma=0 and eta=0 #5

sashapff · 2025-01-29T20:09:39Z

Can you please explain why, when using the Euler scheme without modifications with the controllers from the article (the case when gamma=0 and eta=0), the reconstruction and the edited image do not fall into the data distribution? Is it because such a large computational error accumulates or why?

I attach an example for prompts 'a cat' for the reconstruction (guidance_scale=1.) and 'a tiger' for the editing (guidance_scale=3.5).

LituRout · 2025-01-30T03:01:32Z

Hi Alexandra, thanks for your interest in our paper. Are you using the prompt 'a cat' with guidance scale 1.0 while inverting the cat image with gamma=0? If yes, then this is wrong because you are removing the catness along with inverting the image. The catness comes from the text conditional score for nonzero guidance. Ideally, you would want to invert with zero guidance scale (no text conditioning), which would preserve catness in the structured noise y_1. Then, if you initialize the reverse flow with y_1, you should get more cat looking image in reconstruction. Please let me know if this resolves your concern. Happy to provide any further clarifications.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reconstruction and editing with gamma=0 and eta=0 #5

Reconstruction and editing with gamma=0 and eta=0 #5

sashapff commented Jan 29, 2025

LituRout commented Jan 30, 2025

Reconstruction and editing with gamma=0 and eta=0 #5

Reconstruction and editing with gamma=0 and eta=0 #5

Comments

sashapff commented Jan 29, 2025

LituRout commented Jan 30, 2025