Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Doesn't seem to generalize well #6

Closed
andreaferretti opened this issue Jun 29, 2022 · 5 comments
Closed

Doesn't seem to generalize well #6

andreaferretti opened this issue Jun 29, 2022 · 5 comments

Comments

@andreaferretti
Copy link

I tried to test the performance of DDRM by adding a photo of myself to the ood_celeba directory and running

python main.py --ni --config celeba_hq.yml --doc celeba --timesteps 20 --eta 0.85 --etaB 1 --deg sr4 --sigma_0 0.05 -i celeba_hq_sr4_sigma_0.05

While the examples from the original CelebA set are reconstructed very accurately, results on this additional photo are less than optimal, let's say :-) Now I'm not a celebrity, but I was wondering whether you have an idea of what is happening here
me-face
0_-1
.

@andreaferretti
Copy link
Author

Same issue with Scarlett Johansson
scarlett

2_-1

@bahjat-kawar
Copy link
Owner

Hello,
CelebA is an aligned face dataset. This means that the facial features such as the eyes and the mouth are always aligned to specific positions in the image. The images you tested are not aligned to the same positions as in the CelebA dataset, which explains the artefacts you've shown here.
Replacing the CelebA model with a different diffusion model that trained on non-aligned faces should achieve much better results. You can possibly try the ImageNet models, but ImageNet models are notoriously bad at human faces, so it may not be ideal.
A different solution would be to align the face images you want to use in a similar way that the authors of the CelebA dataset did.
I hope this helps.

@andreaferretti
Copy link
Author

Thank you, it was very silly of me to miss this! :-)

@jiamings
Copy link
Collaborator

jiamings commented Jul 1, 2022

There is a FFHQ checkpoint from https://github.com/jychoi118/ilvr_adm/ that you might be interested in checking out as well. It is not too hard to add this into our code, as their model is also based on the OpenAI's ADM model. Afaik, the CelebA model provided by the DDPM authors has some color shift issues if you try to use it for unconditional generation.

@andreaferretti
Copy link
Author

Nice to know, thank you very much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants