Releases: OCR-D/ocrd_tesserocr
Releases · OCR-D/ocrd_tesserocr
v0.20.1
v0.20.0
Changed:
- adapt to (and require)
ocrd>=3.0– allows running- with pages in parallel (
OCRD_MAX_PARALLEL_PAGES) in tandem with METS Server - with page timeout (
OCRD_PROCESSING_PAGE_TIMEOUT) - with page failure fallback copycat (
OCRD_MISSING_OUTPUT=COPY), new default isSKIPinstead ofABORT(now via--debug) - with page completion re-runs (
OCRD_EXISTING_OUTPUT=SKIP), which is the new default instead ofABORT(now via--overwrite)
- with pages in parallel (
- switched to pyproject.toml build, tracking version via
ocrd-tool.json
Added:
- more test coverage (esp. modes w/o METS Server, METS caching, instance-caching, page-parallel)
- Docker image includes preconfigured
ocrd-all-tool.jsonfor these processors
Fixed:
- no more logging side effects between tests
v0.19.1
v0.19.0
v0.18.0
v0.17.0
Fixed:
- segment/recognize: fix
shrink_polygons - segment/recognize: fix reinit scope (for
xpath_modelandauto_model) - CI: test multiple Python versions independent of ocrd/core image
- CI: speed up build for EOL Python 3.6
- CI: chmod o+w tessdata directory of PPA/OS Tesseract
- deps-ubuntu: allow installation of PPA Tesseract to fail (for newer OS)
Changed:
- adapted to Shapely v2
- *: inherit from recognize (but override logger)
- segment*: delegate constructor instead of wrapping instance
- requires ocrd==2.48
v0.16.0
v0.15.0
Added:
- binarize:
dpinumerical parameter to specify pixel density, #186 - binarize:
tisegboolean parameter to specify whether to calltessapi.AnalyseLayoutfor text-image separation, #186
Changed:
- regonize: improved polygon handling, #186
- resources: proper support for
moduledir, companion to OCR-D/core#904, #187