Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use Orca2 language model. #18

Open
zydjohnHotmail opened this issue Dec 13, 2024 · 0 comments
Open

How to use Orca2 language model. #18

zydjohnHotmail opened this issue Dec 13, 2024 · 0 comments

Comments

@zydjohnHotmail
Copy link

Hello:
I am new to Orca2 language model, I have no idea on how to use this, but I have a real world example, so I want to know how to use t this language model to split the words correctly for those OCR text returned by Florence2 language model.
See the screenshot to know the OCR test image.
The OCR returned by Florence2 language model looks like this:
This is a lot of 12 point text to test theocr code and see if it works on all ctypesof file format.The quick brown dog jumped over thelazy fox. The quick brown dog jumpedover the lazy fox.Thequick brown dogjumped over the lazyfox. Thequickbrown dog jumpedoverthe lazy fox
Basically, the OCR results are quite OK, but not exactly. I want to use Orca2 language model to correctly split the following strings:
theocr; jumpedover; dogjumped; lazyfox. Thequickbrown; jumpedoverthe. Even in the original image, there are clearly some spaces, but the OCR results did NOT have any space at all.
Please show me some code how to use this language to correctly split those words.
If this github is not able to use the Orca2 language model, please simply ignore my issue.
testocr

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant