Avenue for exploration / vqgan + crepe #8

johndpope · 2021-10-31T05:31:29Z

This image represents audio pitch detection via crepe. Is it possible to use something like vqgan to glue this and unagan together ?
you could basically explore the latent space by drawing different squiggles

I have lots of vocal stems to help out testing.

johndpope · 2021-10-31T06:14:16Z

I found this - https://github.com/v-iashin/SpecVQGAN
It's not quite solving what I want above - but potentially simplifies the problem.
Take a picture - and spit out audio - change the squiggle - get a different rendition of vocals.

johndpope mentioned this issue Nov 2, 2021

bending the re/de-constructed melspectrogram to create new sounds. v-iashin/SpecVQGAN#4

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avenue for exploration / vqgan + crepe #8

Avenue for exploration / vqgan + crepe #8

johndpope commented Oct 31, 2021 •

edited

Loading

johndpope commented Oct 31, 2021

Avenue for exploration / vqgan + crepe #8

Avenue for exploration / vqgan + crepe #8

Comments

johndpope commented Oct 31, 2021 • edited Loading

johndpope commented Oct 31, 2021

johndpope commented Oct 31, 2021 •

edited

Loading