Skip to content

Tesseract Traineddata for Sanskrit in Devanagari script and transliteration

License

Notifications You must be signed in to change notification settings

anotatta/tesstrain-Sanskrit-IAST

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 

Repository files navigation

tesstrain-Sanskrit-IAST

Tesseract Traineddata for Sanskrit IAST (written with diacritics)

Training

  • MODEL_NAME DevaLayer-201017 was trained by replacing top layer of START_MODEL=Devanagari (Oct 2020).
  • MODEL_NAME PuranaFinetune-210224 was trained by finetuning START_MODEL=DevaLayer-201017 (Feb 2021) using a few scanned images of Puranic Encyclopeadia for 3600 iterations, mainly to fix letter e being recognized as c.

Only Fast Integer Models are provided.

Testing

The test directory has a few sample images and the recognized text using these models.

About

Tesseract Traineddata for Sanskrit in Devanagari script and transliteration

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Shell 100.0%