French-Spoken-Digits-Dataset

French equivalent to MNIST Audio Dataset

Executing ./script.sh will enhance the dataset with pitch-shifted and noisy samples, also uniting resulting audio and adding the /voices audio used as an unknown class, and generate a .csv file for using in Tensorflow

You will also find utilitary scripts like slice.sh that will convert all .mp3 files in /raw to an equivalent .wav and then isolate unique digits and output each digits of the raw audio with the right naming convention. It is usefull when adding a new speaker to the dataset, after following the instructions of Guide_enregistrement.pdf.

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
original		original
raw		raw
utilities		utilities
voice		voice
French_digits_perso.csv		French_digits_perso.csv
French_digits_perso.xlsx		French_digits_perso.xlsx
Guide_enregistrement.pdf		Guide_enregistrement.pdf
README.md		README.md
audio_splitter.py		audio_splitter.py
create_csv.py		create_csv.py
data_augmentation.py		data_augmentation.py
rename_edge.py		rename_edge.py
resampling.sh		resampling.sh
script.sh		script.sh
slice.sh		slice.sh
testsplit_0123.wav		testsplit_0123.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

French-Spoken-Digits-Dataset

About

Artifeel/FrenchAudioDataset

Folders and files

Latest commit

History

Repository files navigation

French-Spoken-Digits-Dataset

About

Resources

Stars

Watchers

Forks