More models for offline Speech Recognition #501

sansyrox · 2019-05-19T08:31:08Z

Describe the bug

Currently, offline Speech Recognition only recognizes US English and more languages need to be supported.

Expected behavior

Configure more models from here https://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/

Additional context

Reference for installation can be taken from here : https://github.com/Uberi/speech_recognition/blob/master/reference/pocketsphinx.rst#installing-other-languages

hongquan · 2019-05-19T10:29:13Z

Good!

norbusan · 2019-05-19T10:31:00Z

I think this is a very good idea, but we need setup interface for new languages, and isn't it some other service that by default is used for speech recognition?

sansyrox · 2019-05-22T19:10:20Z

@norbusan , we use google stt when we are online and we use PocketSphinx when we are offline

geekypathak21 · 2019-09-20T19:22:20Z

Hey @stealthanthrax I want to solve this issue. While using SUSI-AI offline I got following error

Error: missing PocketSphinx language data directory: "/usr/local/lib/python3.6/dist-packages/speech_recognition/pocketsphinx-data/en_US"
Internet Connection not available

and when I checked pocketsphinx-data it contains folder with name en-us not en_US.
I want to know is there any mistake done by me while setting up project or a bug.

sansyrox · 2019-09-20T19:37:10Z

Hi @himanshupathak21061998 , I think this is a new bug which might have crept in while developing newer features.
I'll open an issue and you can work on fixing this first?

sansyrox · 2019-09-20T19:38:26Z

Because , I don't think that there is a different way to install SUSI on your system instead.

norbusan · 2019-09-21T03:00:39Z

@stealthanthrax @himanshupathak21061998 interesting. The en_US comes from proper language support, so we use locale names. Back then I changed the invocation of the sphinx recognizer to

recognizer.recognize_sphinx(audio, language=susi_config["language"])

where susi_config["language"] contains a locale style string (ll_LL). We need to convert this to one of the supported languages of pocket sphinx on the fly.

Best would be further to have the code check which languages are installed, and fall back to english if the requested language is not available.

It looks like an easy few line Python hack in susi_linux/main/states/recognizing_state.py, anyone wanting to take that? Otherwise I do it later on.

sansyrox added difficulty: easy/medium speech-recognition labels May 19, 2019

geekypathak21 linked a pull request Sep 25, 2019 that will close this issue

Adding Italian model in pocketsphinx for offline recognition fossasia/susi_installer#101

Open

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More models for offline Speech Recognition #501

More models for offline Speech Recognition #501

sansyrox commented May 19, 2019

hongquan commented May 19, 2019

norbusan commented May 19, 2019

sansyrox commented May 22, 2019

geekypathak21 commented Sep 20, 2019 •

edited

Loading

sansyrox commented Sep 20, 2019

sansyrox commented Sep 20, 2019

norbusan commented Sep 21, 2019

More models for offline Speech Recognition #501

More models for offline Speech Recognition #501

Comments

sansyrox commented May 19, 2019

Describe the bug

Expected behavior

Additional context

hongquan commented May 19, 2019

norbusan commented May 19, 2019

sansyrox commented May 22, 2019

geekypathak21 commented Sep 20, 2019 • edited Loading

sansyrox commented Sep 20, 2019

sansyrox commented Sep 20, 2019

norbusan commented Sep 21, 2019

geekypathak21 commented Sep 20, 2019 •

edited

Loading