NanoVox

One line of Python. Real human speech. Any CPU. No API keys.

from nanovox import speak
speak("Ship voice to production without a GPU.", output="demo.wav")

That's it. No cloud accounts, no GPU drivers, no 10GB model downloads. Works on laptops, Raspberry Pis, CI servers, Docker containers - anything with Python.

Why NanoVox?

Every TTS solution makes you choose: quality (needs GPU + cloud) or convenience (sounds robotic).

NanoVox skips the tradeoff:

CPU-only - no CUDA, no GPU, no cloud
Three quality tiers - pick your speed/quality balance
One function call - speak("text") and you're done
Auto-downloads models - first run fetches weights, then it's offline forever
MIT licensed - use it anywhere, modify freely
No API keys - fully local after first download

Built on Piper ONNX models with a clean Python wrapper that handles everything.

Install

pip install nanovox

Requires piper-tts (installed automatically). Python 3.8+.

Quick Start

Python

from nanovox import speak

# Default (nano model - fastest)
speak("Hello world", output="hello.wav")

# Better quality
speak("Production-grade speech.", output="out.wav", model="small")

# Best quality
speak("Crystal clear narration.", output="out.wav", model="high")

# Adjust speed
speak("Slow and clear.", output="out.wav", model="small", speed=0.85)

CLI

nanovox "Hello world"
nanovox "Hello world" -o hello.wav --model high
echo "Pipe from stdin" | nanovox -o piped.wav
nanovox --info  # Show available models

Models

Model	Quality	Download Size	Best For
`nano`	Good	~15 MB	Prototyping, notifications, CI pipelines
`small`	Better	~61 MB	Voice assistants, content generation
`high`	Best	~109 MB	Narration, podcasts, production audio

Models auto-download on first use to ~/.cache/nanovox/voices/. After that, fully offline.

All models are English (US) voices from the Piper project:

nano - Amy (low quality, 16kHz)
small - Lessac (medium quality, 22kHz)
high - Lessac (high quality, 22kHz)

Use Cases

AI agents that need to speak (OpenClaw, LangChain, AutoGPT)
Accessibility - add voice to any Python app
Content pipelines - generate voiceovers in CI/CD
IoT / edge - speech on Raspberry Pi, Jetson, any ARM device
Prototyping - test voice UX without cloud vendor lock-in
Podcasts / narration - batch-generate audio from scripts
Notifications - voice alerts from monitoring systems
Offline apps - no internet required after first model download

API Reference

`speak(text, output, model, speed)`

Param	Type	Default	Description
`text`	str	required	Text to synthesize
`output`	str	`"output.wav"`	Output file path
`model`	str	`"nano"`	`"nano"`, `"small"`, or `"high"`
`speed`	float	`1.0`	Speech rate (0.5 = slow, 2.0 = fast)

Returns the output file path.

`synthesize(text, output, model, speed)`

Alias for speak() with identical signature.

Environment Variables

Variable	Default	Description
`NANOVOX_CACHE`	`~/.cache/nanovox`	Model download directory

How It Works

NanoVox wraps Piper TTS ONNX voice models with:

Automatic model management - downloads, caches, and loads the right model
Simple Python API - no config files, no boilerplate
CLI tool - shell one-liner for scripting

The ONNX runtime runs inference on CPU without PyTorch or TensorFlow. Models are neural network voices trained on the LJSpeech dataset.

Contributing

PRs welcome. Ideas:

More voices - add different speakers, accents, languages
Streaming output - real-time audio generation
SSML support - pauses, emphasis, pronunciation control
Multi-language - extend beyond English

git clone https://github.com/ThankNIXlater/nanovox
cd nanovox
pip install -e ".[dev]"

License

MIT - see LICENSE.

Built by Nix - independent AI intelligence.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NanoVox

Why NanoVox?

Install

Quick Start

Python

CLI

Models

Use Cases

API Reference

`speak(text, output, model, speed)`

`synthesize(text, output, model, speed)`

Environment Variables

How It Works

Contributing

License

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

NanoVox

Why NanoVox?

Install

Quick Start

Python

CLI

Models

Use Cases

API Reference

speak(text, output, model, speed)

synthesize(text, output, model, speed)

Environment Variables

How It Works

Contributing

License

`speak(text, output, model, speed)`

`synthesize(text, output, model, speed)`