Audio-Enhanced Conversational AI Chatbot
People often consume a wealth of information via videos, webinars, and podcasts but lack efficient ways to interact with or query specific information within these media. The challenge lies in converting long-form audio or video content into an interactive experience, allowing users to ask questions and retrieve precise information from the content without manually searching or listening through the entire file.
Develop a tool that transforms video or audio content into an interactive, conversational experience by transcribing audio, processing content for context, and providing accurate, real-time answers to user questions.
- Transcribing audio from YouTube videos.
- Processing and organizing this content for effective retrieval.
- Ceate Summary for entire transcript
- Enabling users to query and receive accurate, context-based answers from the transcription using a language model.
git clone <repository-url>
cd <project-folder>
conda create -p env python=3.10 -y
conda activate env/
pip install -r requirements.txt
Create a .env file and add the required key-value pairs:
ASSEMBLY_AI_KEY = your_api_key
OPENAI_API_KEY = your_secret_key
streamlit run app.py
This project is licensed under the MIT License. See the LICENSE file for details.