voxtral
Here are 41 public repositories matching this topic...
Turn PDFs and EPUBs into audiobooks; subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.
-
Updated
May 25, 2026 - Python
C++ ggml runtime hub for multilingual ASR models: Cohere Transcribe, Parakeet TDT, Voxtral, Canary 1B v2, etc, plus universal forced alignment via NeMo Forced Aligner-style CTC, and others. Fork of whisper.cpp.
-
Updated
May 26, 2026 - C++
Super STT enables effortless voice-to-text in any application, using the most advanced speech models.
-
Updated
May 26, 2026 - Rust
Offline Speech-to-Text (STT) service using Mistral's Voxtral model with Wyoming protocol compatibility for Home Assistant Assist integration.
-
Updated
Mar 15, 2026 - Python
Voxtral is a state-of-the-art model developed to handle both speech transcription and audio understanding with remarkable accuracy and efficiency. This demo interface lets you run the Voxtral model on powerful GPUs to evaluate its performance and see how it can be used for transcription and deeper analysis.
-
Updated
Jul 26, 2025 - Python
Effortless Push-to-Talk Transcription, Anywhere.
-
Updated
Apr 23, 2026 - Python
A Web UI for easy subtitle using various models including voxtral
-
Updated
Jul 22, 2025 - Python
speech to text gui for different (e.g. Whisper, Voxtral) models and backends, including whisper.cpp, crispasar, mlx-whisper, faster-whisper, ctranslate2; applies pyannote for diarization
-
Updated
May 18, 2026 - Python
Voxtral Codec : Combining Semantic VQ and Acoustic FSQ for Ultra-Low Bitrate Speech Generation (Voxtral TTS Backbone)
-
Updated
Mar 27, 2026 - Python
Professional local-first AI production pipeline for long-form narration. Clone voices and generate studio-grade audiobooks (M4B/MP3) using Coqui XTTS-v2 and support for Voxtral (cloud)
-
Updated
May 26, 2026 - Python
Experimentation with Voxtral-Mini-4B-Realtime-2602 and DeepL API for live translation
-
Updated
Mar 23, 2026 - Astro
github mirror for radioshaq - ham radio full time quarterback and part-time lobster
-
Updated
Mar 15, 2026 - Python
Enterprise-grade speech-to-text toolkit with pluggable backends (Whisper, Voxtral). Features speaker diarization, 80%+ test coverage, CI/CD quality gates, and fully offline operation.
-
Updated
Feb 18, 2026 - Python
Local implementation for voxtral
-
Updated
Dec 20, 2025 - C++
Real-time cloud-based speech-to-text for Windows. Powered by Mistral's realtime transcription API
-
Updated
Feb 9, 2026 - Python
🐝 Telegram AI agent designed for distracted users (like drivers). Powered by ElevenLabs, Voxtral, and a fine-tuned Ministral 3B using Nvidia Brev!
-
Updated
Mar 8, 2026 - Swift
🔊 Streamline audio processing with Voxtral.c, a pure C implementation for Mistral AI's Voxtral 4B model, featuring real-time transcription and low memory use.
-
Updated
May 26, 2026 - C
Desktop studio for local and remote Voxtral TTS with preset voices, Mistral API support, export controls, and a premium resizable UI.
-
Updated
Mar 29, 2026 - Python
Real-time phone scam detection powered by Mistral's Voxtral Mini - analyzes live audio and transcripts to identify fraud patterns
-
Updated
Mar 2, 2026 - Python
Improve this page
Add a description, image, and links to the voxtral topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the voxtral topic, visit your repo's landing page and select "manage topics."