A zero-dependency C implementation of Moonshine, a fast and accurate automatic speech recognition (ASR) model.
All weights are exported as float32 from HuggingFace.
pip install -r scripts/requirements.txtDownloads weights from HuggingFace and converts them to .bin files.
# Export both tiny and base (default)
python scripts/export-weights.py
# Export only tiny
python scripts/export-weights.py --model tinyOutput goes to models/<model>/ (e.g. models/tiny/, models/base/), each containing:
encoder.bin— float32 encoder weightsdecoder.bin— float32 decoder weightstokenizer.bin— BPE tokenizer (downloaded and converted from HuggingFace)
make
./test_process models/tiny jfk.wav