Discussion: grpc-service thread

# gRPC Architecture Discussion Draft

Opening up the thread to discuss 2 PRs that are joined together:

https://github.com/docling-project/docling-serve/pull/504

https://github.com/docling-project/docling-core/pull/546

## What This Is

This is a 1:1 gRPC server for Docling Serve that follows the Pydantic model while using gRPC and protobuf conventions. The semantic source of truth is still the Pydantic domain model, and the protobuf IDL is the transport contract for gRPC clients.

## Approach and Feedback Request

We aligned early that a REST to gRPC field by field mirror is not a good design goal by itself. REST and gRPC solve different transport and client needs, so strict endpoint symmetry can make both sides worse.

Instead, the approach is semantic parity: the same document meaning, the same options, and the same outcomes, exposed through a gRPC native API shape. I'm hoping to get some feedback on the implementation I took with this approach - how it feels for maintainability and client usability.

Sidenote - so far it's been useful.  The changes since I initially coded this to maintain the mapping from this design were not at all difficult to do - about 20 minutes of total work on my end.

## How Mapping and Parity Work

At startup, the gRPC server validates schema compatibility by crawling the Pydantic model and comparing it to protobuf descriptors. This gives fast feedback when model changes happen, and it fails hard on unsafe type drift.

To avoid breakage while the codebase evolves, we explicitly track intentional differences and keep that set small. For example, fallback fields like `label_raw` are proto only on purpose so unknown future enum values do not break clients.

At runtime, conversion is model driven. The server hydrates protobuf messages from Pydantic objects, not from ad hoc JSON transforms. This keeps behavior consistent with the existing application paths and reduces duplicate logic.

In tests, new fields are caught in two places: conversion tests for field level correctness and startup schema validation tests for type/cardinality drift. So when the model changes, both runtime and CI surface mismatches quickly.

Feature parity is preserved because gRPC and REST both execute the same underlying conversion and chunking pipeline. Additional format options are still available, but protobuf remains the primary structured payload.

## Testing

So far my tests have been a grouping of my own documents as well as the ~200 docs you use for your own tests.  I'll be glad to add more with the goal to gain confidence with the Docling team that this is a solid design.

## Feedback Cadence / Methods

I'm OK with any change suggestions - the purpose of putting this out there is to start the conversation and keep the ball rolling.  I'm also open for any calls you might want to have to discuss.

## Future Direction

Once this is introduced - I'll he glad to help on a next step: deeper streaming support over gRPC, built around incremental pipeline output:

- page by page yielding during parse and enrichment
- document part yielding for tables, pictures, and text blocks
- live status and progress monitoring streams
- richer partial result streaming for long running jobs

This would let gRPC clients start consuming useful results earlier, rather than waiting for full document completion.  It'll also open up the possibility of remote services to handle these pipelines fast with a multi-language compatibility for other language tool integration.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discussion: grpc-service thread #563

gRPC Architecture Discussion Draft

What This Is

Approach and Feedback Request

How Mapping and Parity Work

Testing

Feedback Cadence / Methods

Future Direction

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Discussion: grpc-service thread #563

Description

gRPC Architecture Discussion Draft

What This Is

Approach and Feedback Request

How Mapping and Parity Work

Testing

Feedback Cadence / Methods

Future Direction

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions