AI Meeting Notes

A Streamlit-based application for recording, managing, transcribing, and converting meeting audio into structured AI-powered meeting notes using OpenAI's Whisper and GPT-5 APIs.

Features

🎙️ Audio Recording & Upload: Record from microphone (multi-channel) or upload audio files (WAV, MP3, M4A, FLAC, OGG, WebM)
📝 AI Transcription: Multiple OpenAI models (GPT-4o Mini, GPT-4o, Whisper-1) with automatic language detection
🤖 AI Meeting Notes: Transform transcriptions into structured meeting notes with GPT-5 models
📦 Smart Compression: Automatic compression for large files (>25MB) or long audio (>20min)
⚡ Long Audio Support: Automatic chunking and parallel processing for files over 20 minutes
🔒 Secure Storage: Store any content in your desktop

Prerequisites

Python 3.12+
FFmpeg: brew install ffmpeg (macOS) or sudo apt-get install ffmpeg (Ubuntu)
OpenAI API Key: Get from OpenAI Platform
uv Package Manager: Install uv

Installation

# Clone repository
git clone https://github.com/DevSlem/ai-meeting-notes.git
cd ai-meeting-notes

# Install dependencies and start the app
uv run streamlit run main.py

The app opens at http://localhost:8501.

Note

Docker is not recommended for this application as it requires direct microphone access which is difficult to configure in containers.

Quick Start

Setup API Key: Click ⚙️ API Key Settings → Enter OpenAI API key → Save
Record/Upload: Navigate to "Record & Upload" tab
Manage: Go to "Recordings" tab to view all files
Transcribe: Click 🎙️ button next to any recording
Generate Notes: Click 📝 button to create AI meeting notes
View Full Page: Click 📖 View Full Page for distraction-free reading

Usage

Recording Audio

Select microphone and sample rate (16kHz recommended)
Click 🔴 Start Recording → Speak → ⏹️ Stop Recording
File saved automatically to recordings/ directory

Note

We provide multi-channel recording support. If you want to record both your microphone and system audio (e.g., Zoom calls), use a virtual audio device like BlackHole (macOS) or VB-Audio Virtual Cable (Windows).

Transcribing Audio

Click 🎙️ button in Recordings tab
Select model (GPT-4o Mini recommended for most cases)
Configure advanced options if needed:
- Compression: Auto-enabled for large/long files
- Chunk Overlap: 30s default (adjustable 15-120s)
- Language: Auto-detect or specify (en, ko, ja, etc.)
Click Start Transcription
View results in scrollable text area

Generating AI Meeting Notes

Transcribe audio first (required)
Click 📝 button next to the recording
Select model:
- GPT-5: Best quality, complex meetings ($1.25/$10 per 1M tokens)
- GPT-5 Mini: Balanced (recommended) ($0.25/$2 per 1M tokens)
- GPT-5 Nano: Fast & affordable ($0.05/$0.40 per 1M tokens)
Choose output language (auto-detect recommended)
Configure advanced options:
- Prompt Template: Select or create custom prompts
- Reasoning Effort: minimal/low/medium/high
- Max Output Tokens: 1000-8000
Review cost estimate
Click 📝 Generate Meeting Notes

Viewing Meeting Notes

In Details: Toggle between 📝 AI Meeting Notes and 📄 Transcription
Full Page: Click 📖 View Full Page for better reading experience
- Shows metadata (model, generation time, tokens used)
- Distraction-free markdown rendering
- 🔄 Regenerate option
- ← Back to return to recordings

Managing Custom Prompts

Click 📝 Prompt Settings in sidebar
Create New Prompt:
- Click ➕ Create New Prompt
- Enter name (e.g., "technical-meeting")
- Write prompt content (use {LANGUAGE_INSTRUCTION} placeholder)
- Click 💾 Create Prompt
Edit Prompt:
- Select prompt from dropdown
- Click ✏️ Edit This Prompt
- Modify content
- Click 💾 Save Changes
Delete Prompt:
- Select prompt (except default)
- Click 🗑️ Delete This Prompt

Custom Naming

Click ✏️ button next to recording
Enter meaningful name (e.g., "Weekly Team Meeting")
Original filename preserved, display name stored in <filename>.json

Compression Methods

Method	Ratio	Speed	Use Case
Recommended	75-85%	Medium	Meetings with silence removal
Fast (MP3)	60-70%	Fast	Quick compression
Balanced (Opus)	65-75%	Medium	Efficient for speech
Custom	Varies	Varies	Specify your own FFmpeg options

Compression auto-enabled when file >25MB or duration >20min.

Warning

The ratio is arbitrary and depends on the audio content.

API Models & Pricing

Transcription Models

Model	Price/hour	Best For
`gpt-4o-mini-transcribe`	$0.18	Most meetings (recommended)
`gpt-4o-transcribe`	$0.36	Complex audio, heavy accents
`whisper-1`	$0.36	When timestamps needed

AI Meeting Notes Models (GPT-5 Series)

Model	Price (Input/Output per 1M tokens)	Best For	Speed
`gpt-5`	$1.25 / $10	Complex meetings, high quality	Slower
`gpt-5-mini`	$0.25 / $2	Most meetings (recommended)	Balanced
`gpt-5-nano`	$0.05 / $0.40	Simple summaries, quick notes	Fastest

Reasoning Effort Levels:

minimal: Fastest, least reasoning tokens
low: Quick processing (default)
medium: Balanced quality/speed
high: Best quality, more expensive

File Structure

ai-meeting-notes/
├── recordings/
│   ├── recording_20251013_145550.wav      # Audio file
│   ├── recording_20251013_145550.json     # Metadata (name, transcription, meeting notes)
│   └── recording_20251013_145550.txt      # Legacy transcription (migrated to JSON)
├── prompts/
│   └── meeting-notes/
│       ├── default.txt                    # Default prompt template
│       ├── technical-meeting.txt          # Custom prompt example
│       └── standup.txt                    # Custom prompt example
├── .config/
│   └── api_key.txt                        # OpenAI API key (auto-created)
├── src/
│   ├── audio.py                           # Audio recording
│   ├── transcription.py                   # Whisper transcription
│   ├── meeting_notes.py                   # GPT-5 meeting notes generation
│   ├── file_manager.py                    # File & metadata management
│   ├── audio_processor.py                 # Compression & chunking
│   ├── config.py                          # API key management
│   └── streamlit_ui.py                    # UI implementation
└── main.py                                # Application entry point

Metadata Structure

Each audio file has a companion .json file storing:

{
  "display_name": "Weekly Team Meeting",
  "transcription": "Meeting transcription text...",
  "transcribed_at": "2025-10-15T14:30:00",
  "meeting_notes": "# Meeting Summary\n...",
  "meeting_notes_model": "gpt-5-mini",
  "meeting_notes_generated_at": "2025-10-15T14:35:00",
  "meeting_notes_usage": {
    "prompt_tokens": 1500,
    "completion_tokens": 800,
    "total_tokens": 2300,
    "reasoning_tokens": 50
  }
}

Troubleshooting

FFmpeg not found: Install with brew install ffmpeg (macOS) or sudo apt-get install ffmpeg (Ubuntu)

API key issues: Click ⚙️ API Key Settings and verify your key at OpenAI Platform

Dialog not closing: Use Close/Done buttons instead of X button

Long files: App automatically chunks files >23min with intelligent merging

Empty meeting notes: Check transcription exists and try regenerating with different model

Streamlit media errors in logs: Harmless internal caching issue, does not affect functionality

Tips

General

Let app auto-decide compression (enabled only when beneficial)
Rename recordings immediately for easy identification
Monitor API usage at OpenAI Usage Dashboard

Transcription

Use GPT-4o Mini for most meetings
Specify language code for better accuracy if auto-detect fails
For long meetings, increase chunk overlap if transcription seems disjointed

AI Meeting Notes

Start with GPT-5 Mini (best balance of quality/cost)
Use GPT-5 Nano for quick summaries or budget constraints
Use GPT-5 only for complex technical meetings
Low reasoning effort is sufficient for most meetings
Create custom prompts for recurring meeting types (standups, sales calls, etc.)
Use auto-detect language unless you need specific output language
Full Page View for comfortable reading and reviewing
Regenerate if first result isn't satisfactory (try different model or reasoning effort)

Custom Prompts

Include {LANGUAGE_INSTRUCTION} placeholder for language flexibility
Test prompts with different meeting types before committing
Name prompts descriptively (e.g., "sales-call", "technical-review")
Keep default prompt as fallback reference

Architecture

The application follows a modular architecture:

Frontend (Streamlit): Multi-page interface with dialogs
Audio Layer: Recording and compression
Transcription Layer: OpenAI Whisper integration
AI Notes Layer: GPT-5 meeting notes generation
Storage Layer: Local file system with JSON metadata

Built with Streamlit, OpenAI Whisper API, OpenAI GPT-5 API, and FFmpeg

License: MIT

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.streamlit		.streamlit
img		img
prompts/meeting-notes		prompts/meeting-notes
src		src
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Meeting Notes

Features

Prerequisites

Installation

Quick Start

Usage

Recording Audio

Transcribing Audio

Generating AI Meeting Notes

Viewing Meeting Notes

Managing Custom Prompts

Custom Naming

Compression Methods

API Models & Pricing

Transcription Models

AI Meeting Notes Models (GPT-5 Series)

File Structure

Metadata Structure

Troubleshooting

Tips

General

Transcription

AI Meeting Notes

Custom Prompts

Architecture

About

Uh oh!

Releases

Packages

Languages

License

DevSlem/ai-meeting-notes

Folders and files

Latest commit

History

Repository files navigation

AI Meeting Notes

Features

Prerequisites

Installation

Quick Start

Usage

Recording Audio

Transcribing Audio

Generating AI Meeting Notes

Viewing Meeting Notes

Managing Custom Prompts

Custom Naming

Compression Methods

API Models & Pricing

Transcription Models

AI Meeting Notes Models (GPT-5 Series)

File Structure

Metadata Structure

Troubleshooting

Tips

General

Transcription

AI Meeting Notes

Custom Prompts

Architecture

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages