This repository contains my master's thesis project from the National Polytechnic School of Oran, Algeria, focusing on exploring the fine-tuning process of Large Language Models (LLMs) for audio summarization.
The project aims to develop a web-based audio summarization platform by leveraging the power of fine-tuned LLMs, specifically the Whisper model. The platform provides an efficient way to generate summaries from audio content.
audio-summarization/
├── code/
│ ├── notebooks/ # Fine-tuning notebooks for Whisper model
│ └── app/ # Streamlit web application
├── dataset/ # Training dataset for model fine-tuning
└── audio_sunerisation.pdf # Thesis documentation
- Fine-tuning implementation of Whisper model
- Streamlit-based web interface
- Audio processing and summarization pipeline
- Custom dataset integration
- Clone the repository
- Install dependencies
- Run the Streamlit app:
cd code/app
streamlit run app.py
For detailed information about the project, methodology, and findings, please refer to the thesis document (audio_sunerisation.pdf
).
Oussama MAHDJOUR Master's Student - Information Systems National Polytechnic School of Oran, Algeria