Skip to content
@docling-project

Docling Project

Welcome to the Docling Project

This is the GitHub organization Docling open-source project.

Docling

Docling is our main open-source package. It is a powerful library which simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.

We support an amazing community which helps us driving forward the adoption of Docling. Give it a try and join the community!



The key repositories of Docling are:

  • docling - The home of the main docling package.
  • docling-core - The definition of types, transforms, serializers, etc. If it has to do with the DoclingDocument you will find it here.
  • docling-parse - The backend PDF parser used by Docling.
  • docling-serve - The FastAPI wrappers for running Docling as REST API and distribute large jobs.
  • docling-ibm-models - The AI models powering Docling.

LF AI & Data

Docling is hosted as a project in the LF AI & Data Foundation.

IBM ❤️ Open Source AI

The project was started by the AI for knowledge team at IBM Research Zurich.

Pinned Loading

  1. docling docling Public

    Get your documents ready for gen AI

    Python 26.7k 1.6k

  2. docling-serve docling-serve Public

    Running Docling as an API service

    Python 241 46

  3. docling-core docling-core Public

    A python library to define and validate data types in Docling.

    Python 108 41

  4. community community Public

    2

Repositories

Showing 10 of 15 repositories
  • docling-eval Public

    Evaluation framework for document processing models and services.

    docling-project/docling-eval’s past year of commit activity
    Python 12 MIT 4 2 7 Updated Apr 9, 2025
  • docling-core Public

    A python library to define and validate data types in Docling.

    docling-project/docling-core’s past year of commit activity
    Python 108 MIT 41 20 4 Updated Apr 9, 2025
  • docling-project/docling-jobkit’s past year of commit activity
    Python 4 MIT 2 1 2 Updated Apr 9, 2025
  • docling-parse Public

    Simple package to extract text with coordinates from programmatic PDFs

    docling-project/docling-parse’s past year of commit activity
    C++ 99 MIT 19 11 2 Updated Apr 9, 2025
  • docling Public

    Get your documents ready for gen AI

    docling-project/docling’s past year of commit activity
    Python 26,663 MIT 1,602 253 (9 issues need help) 16 Updated Apr 9, 2025
  • docling-ts Public

    Use Docling output in TypeScript and JavaScript

    docling-project/docling-ts’s past year of commit activity
    TypeScript 5 MIT 2 1 1 Updated Apr 8, 2025
  • docling-sdg Public

    A set of tools to create synthetically-generated data from documents

    docling-project/docling-sdg’s past year of commit activity
    Python 6 MIT 3 4 3 Updated Apr 4, 2025
  • docling-serve Public

    Running Docling as an API service

    docling-project/docling-serve’s past year of commit activity
    Python 241 MIT 46 28 7 Updated Apr 4, 2025
  • docling-mcp Public

    Making docling agentic through MCP

    docling-project/docling-mcp’s past year of commit activity
    Python 16 MIT 4 2 2 Updated Apr 3, 2025
  • docling4j Public

    Docling4j brings the functionalities of Docling in document understanding to Java® projects

    docling-project/docling4j’s past year of commit activity
    Java 2 MIT 0 0 0 Updated Mar 31, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.