Model Comparisons

Compares the performance of STT (transcription) models on classrom audio data.

Data

https://tcu.app.box.com/file/1339927312124?s=gdr527kvqai17wtnhqc03kr8yq2iwc2h

Completely unprocessed.

Models

AWS Transcribe 1

https://aws.amazon.com/transcribe/
Used English general model NO CUSTOMIZATION NO special vocabulary results in aws_transcribe_1.json

Whisper 1

https://github.com/openai/whisper/
Used Large v2 model NO CUSTOMIZATION NO special vocabulary results in whisper_1.json
- Took around 10 minutes to run on 1 hour of audio, on ml.cs.tcu.edu RESULTS: (better than aws transcribe 1)

Whisper 2

First used noise reduction on the audio
Then used Large v2 model NO CUSTOMIZATION NO special vocabulary results in whisper_2.json
- Took around 10 minutes to run on 1 hour of audio, on ml.cs.tcu.edu RESULTS: (worse than whisper 1)

Files

Results

I used the WER metric to compare the results of the models. The lower the WER, the better the model.

Unfortunately I do not have a ground truth for this audio.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.gitignore		.gitignore
Linux-9-28-2023.m4a		Linux-9-28-2023.m4a
README.md		README.md
assemblyAI.csv		assemblyAI.csv
aws_transcribe_1.json		aws_transcribe_1.json
aws_transcribe_court.json		aws_transcribe_court.json
cleaned_audio.mp3		cleaned_audio.mp3
cleaned_chunk_0.m4a		cleaned_chunk_0.m4a
court_audio.mp3		court_audio.mp3
reduction of noise.py		reduction of noise.py
transcript.txt		transcript.txt
transcript_cleaned.txt		transcript_cleaned.txt
wer.py		wer.py
whisper_1.json		whisper_1.json
whisper_2.json		whisper_2.json
whisper_court.json		whisper_court.json
whisper_transcription.py		whisper_transcription.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Model Comparisons

Data

Models

Files

Results

About

Uh oh!

Releases

Packages

Languages

TCU-ClassifAI/model-comparisons

Folders and files

Latest commit

History

Repository files navigation

Model Comparisons

Data

Models

Files

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages