Skip to content

bzczb/gentle

This branch is 5 commits behind strob/gentle:master.

Folders and files

NameName
Last commit message
Last commit date
Sep 22, 2018
Mar 14, 2022
Mar 2, 2023
Jul 7, 2018
Sep 13, 2016
Jun 20, 2018
Nov 27, 2015
Sep 5, 2019
Oct 23, 2015
Jul 7, 2018
May 14, 2020
Jan 2, 2019
Mar 14, 2016
Jun 20, 2018
Jun 9, 2018
Jan 24, 2017
Dec 18, 2015
Sep 22, 2018
Jun 9, 2018

Repository files navigation

Gentle

Robust yet lenient forced-aligner built on Kaldi. A tool for aligning speech with text.

Getting Started

There are three ways to install Gentle.

  1. Download the pre-built Mac application. This package includes a GUI that will start the server and a browser. It only works on Mac OS.

  2. Use the Docker image. Just run docker run -P lowerquality/gentle. This works on all platforms supported by Docker.

  3. Download the source code and run ./install.sh. Then run python3 serve.py to start the server. This works on Mac and Linux.

Using Gentle

By default, the aligner listens at http://localhost:8765. That page has a graphical interface for transcribing audio, viewing results, and downloading data.

There is also a REST API so you can use Gentle in your programs. Here's an example of how to use the API with CURL:

curl -F "audio=@audio.mp3" -F "transcript=@words.txt" "http://localhost:8765/transcriptions?async=false"

If you've downloaded the source code you can also run the aligner as a command line program:

git clone https://github.com/lowerquality/gentle.git
cd gentle
./install.sh
python3 align.py audio.mp3 words.txt

The default behaviour outputs the JSON to stdout. See python3 align.py --help for options.

Releases

No releases published

Packages

No packages published

Languages

  • Python 64.7%
  • C++ 17.3%
  • HTML 14.4%
  • Shell 1.6%
  • Makefile 1.2%
  • Dockerfile 0.8%