You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Your dataset should be like:
└───wavs
├───dev
│ ├───LJ001-0001.wav
│ ├───...
│ └───LJ050-0278.wav
└───train
├───LJ002-0332.wav
├───...
└───LJ047-0007.wav
Could you give me a link to see your dataset (AccelSpeech-Full.zip), or be more specific about what the difference between dev and train is. Also, why does train start off at 2 and 332 and end at 47 and 7?
Thanks
The text was updated successfully, but these errors were encountered:
As I understand it's an example from LJspeech dataset. I am also interested what's the minimum amount of wav files for train and dev?
Your dataset should be like:
└───wavs
├───dev
│ ├───LJ001-0001.wav
│ ├───...
│ └───LJ050-0278.wav
└───train
├───LJ002-0332.wav
├───...
└───LJ047-0007.wav
Could you give me a link to see your dataset (AccelSpeech-Full.zip), or be more specific about what the difference between dev and train is. Also, why does train start off at 2 and 332 and end at 47 and 7?
Could you give me a link to see your dataset (AccelSpeech-Full.zip), or be more specific about what the difference between
dev
andtrain
is. Also, why doestrain
start off at 2 and 332 and end at 47 and 7?Thanks
The text was updated successfully, but these errors were encountered: