🗣️ Convo-AI 🤖

A conversational AI system with voice + text input, powered by Ollama (local LLM), Whisper speech-to-text, and XTTS v2 text-to-speech.

⚙️ Prerequisites

🐍 Python 3.8+
🎵 FFmpeg (for audio processing)
🦙 Ollama (for local LLM serving)
🎤 A working microphone (for voice input)

📥 Installation

Clone the repo

git clone <repository-url>
cd Convo-Ai

Create a virtual environment

# macOS/Linux
python3 -m venv venv
source venv/bin/activate

# Windows
python -m venv venv
.�env\Scripts�ctivate

Install dependencies

pip install -r requirements.txt

Install FFmpeg

# macOS
brew install ffmpeg

# Ubuntu/Debian
sudo apt-get install ffmpeg

# Windows (Chocolatey)
choco install ffmpeg

Install & start Ollama

# Download: https://ollama.ai/download
ollama serve

▶ Usage

Start the server

source venv/bin/activate   # or .\venv\Scripts\activate on Windows
python server.py

Run the client

source venv/bin/activate
python talk.py

Choose input mode

🎤 Voice → Press r to start/stop recording
⌨ Text → Type and press Enter

✨ Core Features

🎤 Voice and text input modes
🤖 Natural language processing with Ollama
🔊 Text-to-speech output (XTTS v2)
📝 Conversation history & session logging
🎭 Mood analysis
🌐 Optional web interface (FastAPI + WebSocket)
🔒 Privacy-first: all processing runs locally

⚙️ Configuration

Edit config.json to adjust:

Voice model
Speed & pitch
WebSocket URL
LLM settings

🛠️ Tech Stack

Backend → Python, FastAPI
Speech-to-Text → OpenAI Whisper
LLM → Ollama (local)
TTS → XTTS v2
Frontend → HTML + JavaScript
Realtime → WebSocket

📁 Project Structure

convo-ai-isolated/
├── src/
│   ├── server.py
│   └── talk.py
├── templates/
│   └── index.html
├── static/
├── logs/
├── tts_cache/
├── config.json
├── requirements.txt
├── README.md
└── HOWTO.md

🛠️ Troubleshooting

✅ Virtual environment activated?
✅ Dependencies installed?
✅ FFmpeg installed?
✅ Ollama running?
✅ Microphone permissions granted?

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/AmazingFeature)
Commit (git commit -m 'Add some AmazingFeature')
Push (git push origin feature/AmazingFeature)
Open a Pull Request

🙏 Acknowledgments

Ollama — Local LLM
Whisper — Speech-to-text
XTTS — Text-to-speech
FastAPI — Web framework

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
templates		templates
.gitattributes		.gitattributes
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
config.json		config.json
howto.md		howto.md
requirements.txt		requirements.txt
server.py		server.py
talk.py		talk.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🗣️ Convo-AI 🤖

⚙️ Prerequisites

📥 Installation

▶ Usage

✨ Core Features

⚙️ Configuration

🛠️ Tech Stack

📁 Project Structure

🛠️ Troubleshooting

🤝 Contributing

🙏 Acknowledgments

About

Uh oh!

Uh oh!

Languages

License

BradleyMatera/Convo-Ai

Folders and files

Latest commit

History

Repository files navigation

🗣️ Convo-AI 🤖

⚙️ Prerequisites

📥 Installation

▶ Usage

✨ Core Features

⚙️ Configuration

🛠️ Tech Stack

📁 Project Structure

🛠️ Troubleshooting

🤝 Contributing

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages