-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Open
Description
I was planning to use csm-1b as the final layer of a real-time voice agent (VAD -> ASR -> LLM -> TTS) that I was working on. However, due to its super slow inference time, it seems like an unrealistic goal at this point.
This is the repo I'm working on: https://github.com/asiff00/On-Device-Speech-to-Speech-Conversational-AI
I currently use kokoro as the TTS engine, which what makes it possible to run in real-time.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels