WebSocket terminal server with voice input support and Claude AI intelligent interaction.
- WebSocket Terminal - Real-time terminal web interface based on Axum framework
- Voice Input - Speech-to-text support for executing commands via voice
- Claude AI Integration - AI-assisted terminal interaction using
echokit_terminal - Multiple ASR Support
- OpenAI Whisper API
- Alibaba Cloud Paraformer real-time speech recognition (todo)
Create a .env file and configure the Whisper API (Groq recommended):
ASR_API_KEY=your_api_key_here
ASR_URL=https://api.groq.com/openai/v1/audio/transcriptions
ASR_MODEL=whisper-large-v3
ASR_LANG=en
ASR_PROMPT=# Run directly with cargo
cargo run -- -- claude
# Or build and run
cargo build --release
./target/release/vibetty -- claudeFor more options, use --help:
cargo run -- --helpVisit: http://localhost:3000 after starting the service.
Change the current working directory via HTTP API.
Endpoint: POST /api/change-dir
Request:
curl -X POST http://localhost:3000/api/change-dir \
-H "Content-Type: application/json" \
-d '{"path": "/path/to/directory"}'Example:
# Change to absolute path
curl -X POST http://localhost:3000/api/change-dir \
-H "Content-Type: application/json" \
-d '{"path": "/home/user/documents"}'
# Change to relative path
curl -X POST http://localhost:3000/api/change-dir \
-H "Content-Type: application/json" \
-d '{"path": "../parent-folder"}'Note: This endpoint only accepts requests from localhost for security reasons.