Skip to content

Releases: docling-project/docling

v2.92.0

29 Apr 07:38

Choose a tag to compare

Feature

  • Extend the kserve-triton OCR model to have multi-lingual support (#3368) (8b67fae)
  • docx: Add checkbox parsing support (#3349) (c455a65)
  • Introduce modular docling-slim package (#3285) (ed32c5e)
  • Add ResponseFormat.DOCLANG and parsing branch in VLM pipeline (#3350) (0f6f8d0)

Fix

  • pptx: Skip malformed picture shapes instead of aborting conversion (#3372) (7294248)
  • docx: OMML conversion failures for unsupported limit functions (#3359) (3df80e7)
  • Make VLLM model_impl configurable (#3358) (a6a37ca)

v2.91.0

23 Apr 09:29

Choose a tag to compare

Feature

  • docx: Extract VML images with v:imagedata elements (#3343) (2ddaa3b)

Fix

  • Strengthen input validation for METS‑GBS processing (#3336) (c1dbac2)
  • EasyOCR model downloading (#3339) (5e161ac)
  • vlm: Remove bogus preamble from VLM chat template (#3351) (c190ba2)
  • html: Refine image URL and size handling (#3348) (cd0cb69)
  • Fixes to html_backend (#3342) (9813190)
  • pptx: Assign pptx notes to ContentLayer.NOTES (#3341) (3a3c8f6)
  • Prevent path traversal in LaTeX macro handlers (#3330) (65ef180)
  • service: Add explicit usage exceeded exception handling (#3325) (075fa69)

Documentation

  • uspto: Improve documentation of USPTO XML parser security config (#3338) (09de7f9)

v2.90.0

17 Apr 11:56

Choose a tag to compare

Feature

  • Implement GraniteVisionTableStructureModel for VLM-based table extraction (#3323) (1569e42)

Fix

  • latex: Fully unwrap deeply nested formatting macros (#3249) (101233e)
  • docx: Handle inline formulas in list items (#3304) (c761512)
  • format: Add MD fallback for .txt files in _guess_from_content (#3311) (3bab6b4)
  • Strip soft hyphen when joining merged text elements (#3232) (8274892)
  • pptx: Handle NotImplementedError from shape.shape_type (#3309) (043ed2d)

Documentation

  • Fix nanonets_ocr2 runtime support matrix (#3317) (8ec14f2)

v2.89.0

16 Apr 08:08

Choose a tag to compare

Feature

  • Explicit TikZ environment handling in LaTeX backend (#3187) (a15c16e)

Fix

  • ocr: Align RapidOCR english assets with 3.8 mobile models (#3291) (251c8b2)
  • docx: Isolate list state in table cells (#3294) (740c386)
  • pipeline: Prevent cache miss due to pipeline options mutation during chart extraction (#3300) (5b84911)

Documentation

  • Add indexed picture placeholder example to serialization notebook (#3293) (cd2e5b6)

Performance

  • markdown: Avoid eager string formatting in Markdown backend debug logs (#3301) (a64c378)

v2.88.0

13 Apr 14:05

Choose a tag to compare

Feature

  • service: Establish client SDK for docling serve (#3264) (42157a3)

Fix

  • ocr: Support rapidocr 3.8 mobile model naming (#3277) (6b257ec)

Documentation

  • Add agent skill bundle for coding assistants (SKILL.md, pipelines, convert/evaluate) (#3174) (c23622f)

v2.87.0

13 Apr 07:37

Choose a tag to compare

Feature

Fix

  • Transformers v5 compatibility for AUTOMODEL_CAUSALLM VLMs (#3276) (d431224)
  • vlm: Add explicit MLX support for OCR presets (#3272) (27d3cf4)
  • markdown: Normalize repeated leading dash markers (#3286) (a6aeddf)
  • docx: Preserve inline SDT references (#3280) (6cb1bc0)
  • pptx: Respect page_range during conversion (#3282) (e4fd937)
  • vlm: Support tool-calling API responses (#3271) (9c3ab93)
  • pdf: Extend ligature map with Dutch IJ and PUA glyph U+F0A0 (#3254) (ab5254d)

Documentation

  • Add AG2 multi-agent document analysis example (#3261) (1fed840)

v2.86.0

10 Apr 14:15

Choose a tag to compare

Feature

  • Support for GraniteVision v4 (#3217) (fd83420)
  • Add signature/stamp html block to DC document (#3251) (9b4b67b)
  • vlm: Add PARTIAL_SUCCESS status for VLM pipeline pages (#3215) (6699642)

Fix

  • latex: Discard arguments of filtered spacing commands (#3245) (6180925)

Documentation

v2.85.0

07 Apr 14:22

Choose a tag to compare

Feature

Fix

v2.84.0

01 Apr 18:35

Choose a tag to compare

Feature

v2.83.0

31 Mar 09:32

Choose a tag to compare

Feature

Fix

  • pdf: Propagate hyperlinks to DoclingDocument text items (#3131) (524edcc)
  • xlsx: Guard last-row bounds in Excel table scan (#3197) (85ac377)
  • Parse LaTeX macros in multicolumn/multirow table cells (#3204) (89c68f8)
  • Handle empty CSV file without crashing (#3196) (f283484)

Documentation

  • Add line-based chunker documentation and examples (#3210) (3a64f41)