skipped phonemes in generated audio #13

thivux · 2024-02-26T05:31:56Z

hi, thank you for sharing your code.

i am trying to do voice conversion from English speech to Vietnamese speaker. to do that, i did the following steps

extract units for both English and Vietnamese dataset
train kmeans on both types of units & extract discrete labels
train soft encoder
extract soft units
train acoustic model
train hifigan on Vietnamese dataset

the output for Vietnamese speech (input audio is Vietnamese, of a different speaker) is okay. but output for English is not that good. phonemes are often skipped or mispronouced. do you have any suggestions on how i can improve the results?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

skipped phonemes in generated audio #13

skipped phonemes in generated audio #13

thivux commented Feb 26, 2024

skipped phonemes in generated audio #13

skipped phonemes in generated audio #13

Comments

thivux commented Feb 26, 2024