Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Avenue for exploration / vqgan + crepe #8

Open
johndpope opened this issue Oct 31, 2021 · 1 comment
Open

Avenue for exploration / vqgan + crepe #8

johndpope opened this issue Oct 31, 2021 · 1 comment

Comments

@johndpope
Copy link

johndpope commented Oct 31, 2021

https://github.com/marl/crepe

This image represents audio pitch detection via crepe. Is it possible to use something like vqgan to glue this and unagan together ?
you could basically explore the latent space by drawing different squiggles

I have lots of vocal stems to help out testing.

@johndpope
Copy link
Author

I found this - https://github.com/v-iashin/SpecVQGAN
It's not quite solving what I want above - but potentially simplifies the problem.
Take a picture - and spit out audio - change the squiggle - get a different rendition of vocals.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant