Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there any possibility to extend the project as stable diffusion extension #26

Open
kaninbu opened this issue May 21, 2023 · 5 comments

Comments

@kaninbu
Copy link

kaninbu commented May 21, 2023

单纯问个, 大佬有打算把它整合进stable diffusion吗?

这对于许多人来说应该会有很大的帮助,尤其是在控制stable diffusion生成奇怪的图之下用于控制然后输出成自己想要的样子

@andrewbutts
Copy link

Is it easy enuf to optimize a stable diffusion latent image instead of a gan latent code? Seems like it'd be possible but maybe a lot slower.

@PDillis
Copy link
Contributor

PDillis commented Jun 26, 2023

No. That is another paper by itself, as you'll have to focus entirely on the StableDiffusion architecture (DragGAN uses only the inner representations of the Generator of StyleGAN2).

image

It is a good idea, but isn't as straightforward as in GANs, so perhaps you will need to focus on the neighborhood of the synthesized images during the diffusion process in the image space as in Align Your Latents.

@LianShuaiLong
Copy link

两种不同的生成模型,无法整合的

@PDillis
Copy link
Contributor

PDillis commented Jun 27, 2023

There is now a project inspired by this, though I have no idea when they'll release code or how well it works (it's a work in progress after all): https://arxiv.org/abs/2306.14435

image

@zjysteven
Copy link

zjysteven commented Jun 27, 2023

There is now a project inspired by this, though I have no idea when they'll release code or how well it works (it's a work in progress after all): https://arxiv.org/abs/2306.14435

image

Not trying to be destructive in any sense, but it seems from this figure that DragDiffusion does not show clear advantage over DragGAN (most illustrated movements/changes are kinda small). I wouldn't be surprised if this is indeed the case though, as I'd imagine dragging the latents of diffusion model would be more difficult given its forward/diffusion process.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants