How to Batch Image OCR with Generative AI API Calls

Env config

install uv and run the following command to install the dependencies:

uv sync

Run the demo

Prepare your own OpenAI API key, write it in .env. The run the following command in terminal:

input_dir: the directory of your input images, e.g., sample_images
collection_series: the collection series you want to use legacy (Generative AI Challenge) or sydow
save-json: whether to save the OCR results in JSON format
save-excel: whether to save the OCR results in Excel format

uv run python -m main --input-dir sample_images --collection-series legacy --save-json --save-excel

Ajust it to your project

You need to create a structured output Pydantic model in main.py and adjust batch_ocr function accoding, then change your promptsin ocr_prompt_default.yml to your needs.
You can read more details in FungariumOCR.pdf.
Enjoy :)

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
asserts		asserts
sample_images		sample_images
sample_images_sydow		sample_images_sydow
.env		.env
.gitattributes		.gitattributes
.gitignore		.gitignore
.python-version		.python-version
FungariumOCR.pdf		FungariumOCR.pdf
LICENSE		LICENSE
README.md		README.md
main.py		main.py
ocr_prompt_default.yml		ocr_prompt_default.yml
ocr_prompt_default_sydow.yml		ocr_prompt_default_sydow.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to Batch Image OCR with Generative AI API Calls

Env config

Run the demo

Ajust it to your project

About

Uh oh!

Releases

Packages

Languages

License

Alias-z/FungariumOCR

Folders and files

Latest commit

History

Repository files navigation

How to Batch Image OCR with Generative AI API Calls

Env config

Run the demo

Ajust it to your project

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages