Missing text with OCR although detected #335

EHadoux · 2025-03-11T21:38:16Z

Hey, I love the model, it's game changer.

I'm trying to extract this document. All is well except on page 6. As you can see on the screens (surya_gui, but it's the same in direct Python), the bounded boxes are good with "Run Text Detection", but the right-hand side of the first line is missing with "Run OCR".

The code I'm using which returns the same thing but directly in Python:

recognition_predictor(images, [["en"]] * len(images), detection_predictor, highres_images=images_high)

Any idea? Thanks!

12102682_2022-07-06.pdf

kaiwang13 · 2025-03-19T09:00:37Z

Any solution for this case? I have met the same problem.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing text with OCR although detected #335

Missing text with OCR although detected #335

EHadoux commented Mar 11, 2025

kaiwang13 commented Mar 19, 2025

Missing text with OCR although detected #335

Missing text with OCR although detected #335

Comments

EHadoux commented Mar 11, 2025

kaiwang13 commented Mar 19, 2025