MetaScribe is a modular, accessibility-first tool for building metadata from audio files using subtitle input. It was originally developed to generate metadata.csv files for associating transcriptions of .wav files in voice cloning AI workflows. However, its structured design makes it suitable for most simple .wav transcription tasks where audio needs to be paired with text.
- 🎧 Audio playback and file tracking
- 📄 Subtitle-assisted text entry from
.srtfiles - 💾 Metadata export to
metadata.csv - 🔁 Session resume with undo-safe editing
- 🧠 Dyslexia-friendly Verdana font and fatigue-free layout
- 🔄 Batch renaming of
.wavfiles with undo support - 📋 Live preview of metadata and session state
- 🧹 Automatic cleanup of session data when complete
MetaScribe loads a folder of .wav files and a transcript.srt file. For each audio file, it presents a user-friendly interface to:
- Play the audio
- View the corresponding subtitle line
- Add or refine the text
- Accept the entry to bind it to the filename
- Save progress and resume later if needed
The resulting metadata is saved in metadata.csv using the format:
filename|text
Place your files in the following structure:
MetaScribe/
├── metascribe.py # Main application script
├── transcript.srt # Subtitle input file
├── metadata.csv # Output metadata file
├── session.json # Session persistence
├── wavs/ # Folder for audio files
- Python 3.8 or higher
- Tkinter (included with standard Python)
- Windows OS (uses
winsoundfor audio playback)
-
Clone the repository:
-
Navigate to the project folder:
cd MetaScribe
-
Ensure your
.wavfiles are placed in thewavs/directory. -
Include a
transcript.srtfile in the root directory. -
Run the application:
python metascribe.py
- The interface is optimized for 1500x1000 resolution and launches centered on screen.
- All fonts use Verdana for dyslexia-friendly readability.
- Metadata is saved in
metadata.csvusing the format:filename|text. - Session progress is saved in
session.jsonand automatically resumed on next launch. - The “Audio Renamer” tool allows batch renaming of
.wavfiles with undo support. - The “Accept” button finalizes the current transcription and moves to the next file.
- The “Skip” button preserves manual edits and advances without finalizing.
Specify your license here (e.g., MIT, GPL-3.0). Include a LICENSE file in the repository.
James Starling — Freelance broadcast and systems integration engineer, accessibility advocate, and creator of Echoes & Encounters.
MetaScribe is built for clarity, precision, and accessibility. It streamlines the process of aligning audio files with subtitle-based metadata for voice model training and other audio-text workflows.