Skip to content

Releases: docling-project/docling

v2.28.4

29 Mar 11:56
Compare
Choose a tag to compare

Fix

v2.28.3

28 Mar 18:30
Compare
Choose a tag to compare

Fix

v2.28.2

26 Mar 16:52
Compare
Choose a tag to compare

Fix

  • Improve HTML layer detection, various MD fixes (#1241) (9210812)
  • html: Fix HTML parsed heading level (#1244) (85c4df8)

v2.28.1

25 Mar 18:20
Compare
Choose a tag to compare

Fix

  • converter: Cache same pipeline class with different options (#1152) (825b226)
  • debug: Missing translation of bbox to to_bounding_box (#1220) (6df8827)
  • docx: Identifying numbered headers (#1231) (f739d0e)

Documentation

  • examples: Batch conversion doc raises_on_error (#1147) (0974ba4)

v2.28.0

19 Mar 15:18
Compare
Choose a tag to compare

Feature

  • SmolDocling: Support MLX acceleration in VLM pipeline (#1199) (1c26769)
  • Add PPTX notes slides (#474) (b454aa1)
  • Updated vlm pipeline (with latest changes from docling-core) (#1158) (2f72167)

Fix

  • Determine correct page size in DoclingParseV4Backend (#1196) (f5adfb9)
  • msword: Fixing function return in equations handling (#1194) (0b707d0)

Documentation

v2.27.0

18 Mar 13:37
Compare
Choose a tag to compare

Feature

  • Add factory for ocr engines via plugins (#1010) (6eaae3c)
  • Add DoclingParseV4 backend, using high-level docling-parse API (#905) (3960b19)
  • actor: Docling Actor on Apify infrastructure (#875) (772487f)
  • Equations to latex in MSWord backend (with inline groups) (#1114) (6eb718f)

Fix

Documentation

v2.26.0

11 Mar 11:12
Compare
Choose a tag to compare

Feature

  • Use new TableFormer model weights and default to accurate model version (#1100) (eb97357)

Fix

Documentation

  • Add description of DOCLING_ARTIFACTS_PATH env var (#1124) (e1c49ad)

Performance

  • New revision code formula model and document picture classifier (#1140) (5e30381)

v2.25.2

05 Mar 14:51
Compare
Choose a tag to compare

Fix

  • Proper handling of orphan IDs in layout postprocessing (#1118) (c56ab3a)

Documentation

v2.25.1

03 Mar 00:56
Compare
Choose a tag to compare

Fix

  • Enable locks for threadsafe pdfium (#1052) (8dc0562)
  • html: Use 'start' attribute when parsing ordered lists from HTML docs (#1062) (de7b963)

Documentation

  • Improve docs on token limit warning triggered by HybridChunker (#1077) (db3ceef)

v2.25.0

26 Feb 14:16
Compare
Choose a tag to compare

Feature

  • [Experimental] Introduce VLM pipeline using HF AutoModelForVision2Seq, featuring SmolDocling model (#1054) (3c9fe76)
  • cli: Add option for downloading all models, refine help messages (#1061) (ab683e4)

Fix

Documentation

  • Extend chunking docs, add FAQ on token limit (#1053) (c84b973)