Skip to content

Commit d055249

Browse files
committed
update README.md
1 parent 2b0d214 commit d055249

File tree

1 file changed

+9
-6
lines changed

1 file changed

+9
-6
lines changed

README.md

+9-6
Original file line numberDiff line numberDiff line change
@@ -2,16 +2,19 @@ Pytesser
22
========
33

44
Python wrapper for the tesseract OCR engine. The module is based on OpenCV.
5-
Article : http://robindavid.comli.com/pytesser-python-wrapper-for-the-tesseract-ocr-engine/
5+
66

77
Informations
88
------------
99

10-
There is already multiples module called pytesser, but this one is slightly different on the following point:
10+
There is already multiples tesseract python modules, but none of them satisfied me. This one is different on the following point:
1111

12-
* It implement all the features of tesseract engine it includes the choise of the language and the page segmentation mode.
13-
* All the module is contained in one file (the others modules I have tried are quite messy.
14-
* It support OpenCV, so you can directly provide an IplImage to the module.
12+
* All the classes are put in the same file and all inessential class are removed
13+
* Use OpenCV instead of PIL (to really an advantage because PIL as far more widespread, but better fit my needs ;))
14+
* Use subprocess.communicate instead of subprocess.wait to avoid any output in the shell or in the programs that use the module.
15+
* Management of the differents languages via the option '-l' because the original pytesser use the default language which is english. By this way the detection of french for instance is totally inacurrate.
16+
* Management of of the pagesegmode, which allow to modify the behavior of tesseract if we want for instance to detect only one character, a word or a line.
17+
* The code is far more straightforward (my opinion)
1518

1619
How to use it ?
1720
---------------
@@ -28,4 +31,4 @@ There is to ways to use it. Either you give it a filename, either directly an Ip
2831
Or you can directly give it an IplImage like this:
2932

3033
image = cv.LoadImage("myimage.jpg")
31-
txt = pytesser.iplimage_to_string(image)
34+
txt = pytesser.iplimage_to_string(image)

0 commit comments

Comments
 (0)