acoustic-language-id

Spoken Language Detection using the Kaggle Spoken Language Identification data set.

This work is for the final project of the Machine Learning (CSCI E-89 (16392)) course at Harvard Extension School.

Download the “train” and “test” data sets from https://www.kaggle.com/toponowicz/spoken-language-identification. This will result in two files: “train.zip” and “test.zip”.
Unzip the two files into “train” and “test” folders
In the “test” folder, you should see three sub-folders: “de”, “en”, and “es”. Move all the files under the sub-folders to the “test” folder, and delete the empty sub-folders. This step ensures that the train and test sets have the same directory structure.

python run-exp.py 
	--train-dir <train dir>
	--test-dir <test dir>
	--save-model-dir <direction to save model file>

By default it’s going to train the CNN model, but feel free to change the code to call build the RNN model.

python run-exp.py 
	--train-dir <train dir>
	--test-dir <test dir>
	--load-model <model file>

This is going to output the final accuracy, which is 94.07%.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md
run-exp.py		run-exp.py

Provide feedback