Thinking blocks support #1219

igordayen · 2025-12-31T00:31:47Z

Add Comprehensive Thinking Extraction Support

Summary

This PR implements complete end-to-end thinking extraction functionality,
enabling LLMs clients with reasoning content alongside structured outputs.
The implementation spans API design, integration with Spring AI, manual
converter chains, and comprehensive test coverage.

New API Surface

 // Fluent API for thinking extraction

ThinkingResponse<Person> response = 
      promptRunner
     .withThinking()
     .createObject("Analyze this person", Person.class);

//  Response Structure

 data class ThinkingResponse<T>(
     val result: T?,                      // Converted object
     val thinkingBlocks: List<ThinkingBlock>  // Extracted reasoning
 )

Execution Flow

  User Code
    ↓
  PromptRunner.withThinking() 
     ↓
  OperationContextPromptRunner
   ↓
  ThinkingPromptRunnerOperationsImpl
    ↓
  ChatClientLlmOperations.doTransformWithThinking()
    ↓
  Manual Converter Chain:
    - WithExampleConverter (with examples)
    - SuppressThinkingConverter (JSON cleaning)
    - FilteringJacksonOutputConverter (parsing)
    ↓
  extractAllThinkingBlocks(rawResponse)
    ↓
  ThinkingResponse<T>

Key Components

API Layer

PromptRunner - core thinking functionality withThinking
operations
ThinkingPromptRunnerOperations: Interface defining thinking-aware
operations
ThinkingResponse: Response wrapper containing result + thinking
blocks

Implementation Layer

ThinkingPromptRunnerOperationsImpl: Core implementation routing to
ChatClientLlmOperations
ChatClientLlmOperations: Enhanced with doTransformWithThinking()
methods
Manual converter chains: Bypassing responseEntity() to preserve
thinking blocks

Infrastructure

SuppressThinkingConverter: Cleans thinking blocks from JSON parsing
ThinkingDetector: Used in Streaming; Refactored to use centralized ThinkingTags
definitions

Technical Approach

Manual Converter Chain Pattern

  // Extract thinking BEFORE converter chain
  val thinkingBlocks = extractAllThinkingBlocks(rawText)

  // Clean conversion without responseEntity() 
  val result = converter.convert(rawText)

  // Combine both in thinking-aware response
ThinkingResponse(result, thinkingBlocks)

Thinking Block Preservation

Extract thinking blocks from raw LLM response text first
Use manual converter chains to avoid single-consumption constraints
Preserve thinking blocks even in failure scenarios via
ThinkingException for createObjectIfPossible

Test Coverage

Unit Tests

ThinkingPromptRunnerOperationsTest: API contract testing
ThinkingPromptRunnerOperationsExtractionTest: Thinking extraction
validation
ChatClientLlmOperationsThinkingTest: Core implementation testing
ThinkingPromptRunnerBuilderTest: Java builder pattern validation

Integration Tests

LLMAnthropicThinkingBuilderIT: End-to-end with Anthropic Claude models
LLMOllamaThinkingBuilderIT: End-to-end with Ollama models

Files Modified

Enhanced: ChatClientLlmOperations.kt, OperationContextPromptRunner.kt
Fixed: SuppressThinkingConverter.kt, ThinkingDetector.kt
Updated: LlmOptions.kt, existing test files

Usage Examples

Java

    PromptRunner runner = ai.withLlm("claude-sonnet-4-5")
                                .withToolObject(Tooling.class)
                                 .withGenerateExamples(true);

        String prompt = """
                What is the hottest month in Florida and  provide its temperature.
                Please respond with your reasoning using tags <reason>.
                
                The name should be the month name, temperature should be in Fahrenheit.
                """;

        // When: Use runner to create object with thinking
        ThinkingResponse<MonthItem> response = runner
                .withThinking()
                .createObject(prompt, MonthItem.class);


19:27:56.460 [main] INFO  LLMAnthropicThinkingBuilderIT - Created object: MonthItem{name='August', temperature=91}
19:27:56.460 [main] INFO  LLMAnthropicThinkingBuilderIT - Extracted [ThinkingBlock(content=Florida's hottest month is typically August, when temperatures peak during the summer season. The average high temperature in August across Florida is approximately 91-92°F, though it can vary slightly by region. In many areas, particularly inland and southern Florida, temperatures regularly reach into the low to mid-90s during this month. I'll use 91°F as a representative average high temperature for Florida in August., tagType=TAG, tagValue=reason)] thinking blocks

Addressed code complexity in ChatClientLlmOperations by moving exception handling blocks and prompt builders into separate private methods,

This implementation provides complete thinking extraction capabilities
while maintaining backward compatibility and comprehensive test coverage
across multiple LLM providers

Note: this PR depends on another PR:
embabel/embabel-common#99

ThinkingBlocks abstraction suppot PromptRunner API withThinking Introduced ChatResponseWithThinking for PromptRunner createObject APIs Delegation from OperationContextPromptRunner to ChatClientLLMOperations for thinking support Comprehensive unit and integration testing

…separate functions to reduce complexity

johnsonr

Great stuff. Really important functionality.

embabel-agent-api/src/main/java/com/embabel/agent/api/thinking/ThinkingPromptRunnerBuilder.java

...ent-api/src/main/kotlin/com/embabel/agent/api/common/support/OperationContextPromptRunner.kt

embabel-agent-api/src/main/kotlin/com/embabel/agent/api/common/thinking/ThinkingExtensions.kt

embabel-agent-api/src/main/kotlin/com/embabel/chat/ChatResponseWithThinking.kt

renamed and moved Reponse With Thinking and Thinking Exception to recommended package factored API withThinking into PromptRunner introduced tag interface Thinking Capability to ensure thinking gets applied only to proper prompt runner operations ensure createObject and other APIs will not compile on prompt runners that do not implement thinking functionality more fluent java builder API

PromptRunner withThinking returns ThinkingPromptRunnerOperations - single critical change Removed Thinking Extensions Renamed ResponseWithThinking to ThinkingResponse Updated java Thinking IT tests to use Core PromptRunner API, rather than builder Thinking Builder - not deprecated, for supporting builder pattern, does not rely on extensions anylonger Updated documentation with reference to Core API rather than builder

igordayen · 2026-01-05T17:15:29Z

commit 688a53a (HEAD -> thinking-blocks-support, origin/thinking-blocks-support)
Author: Igor Dayen [email protected]
Date: Mon Jan 5 12:05:54 2026 -0500

Include Thinking into Core PromptRunner API:

PromptRunner withThinking returns ThinkingPromptRunnerOperations - single critical change
Removed Thinking Extensions
Renamed ResponseWithThinking to ThinkingResponse
Updated java Thinking IT tests to use Core PromptRunner API, rather than builder
Thinking Builder - not deprecated, for supporting builder pattern, does not rely on extensions anylonger
Updated documentation with reference to Core API rather than builder

johnsonr

Very close now. Good stuff

embabel-agent-api/src/main/kotlin/com/embabel/agent/api/common/PromptRunner.kt

embabel-agent-api/src/main/java/com/embabel/agent/api/thinking/ThinkingPromptRunnerBuilder.java

...ent-api/src/main/kotlin/com/embabel/agent/api/common/support/OperationContextPromptRunner.kt

embabel-agent-docs/src/main/asciidoc/reference/thinking/page.adoc

embabel-agent-api/src/main/java/com/embabel/agent/api/thinking/ThinkingPromptRunnerBuilder.java

Enhanced documentation Updated multi-line text

sonarqubecloud · 2026-01-06T03:45:52Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
76.4% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

igordayen added 2 commits December 30, 2025 18:24

Unwanted file removal

a2cdbaa

igordayen requested review from johnsonr and poutsma December 31, 2025 00:33

igordayen added 10 commits December 30, 2025 20:00

Fix Thinking Detector Test and thinkingPatterns

a68b43e

Align StreamingJacksonOutputConverterTest with common Thinking Tags

5cd6c83

Increase Code Coverage per Sonar Report

2da0c6e

Even More New Code Coverage per Sonar Report

12c87c5

Even More and More New Code Coverage per Sonar Report

38b0839

Lines and Branch Coverage Increase

1f88b6b

Address Sonar Violations

eba91e5

NOSONAR directive for safe cast

cd78f47

Refactored ChatClientLlmOperations by moving exception handling into …

e813eae

…separate functions to reduce complexity

Increase Code Coverage for newly added private methods

2d4e150

johnsonr requested changes Jan 3, 2026

View reviewed changes

igordayen added 4 commits January 4, 2026 13:05

Include Thinking section into User Guide

bbf9d5e

Restored Streaming Section

248e9fa

johnsonr requested changes Jan 5, 2026

View reviewed changes

Removed Thinking Prompt Runner Builder and also:

421e284

Enhanced documentation Updated multi-line text

johnsonr approved these changes Jan 6, 2026

View reviewed changes

johnsonr merged commit 5dcf26f into main Jan 6, 2026
13 checks passed

johnsonr deleted the thinking-blocks-support branch January 6, 2026 05:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Thinking blocks support #1219

Thinking blocks support #1219

Uh oh!

igordayen commented Dec 31, 2025 •

edited

Loading

Uh oh!

johnsonr left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

igordayen commented Jan 5, 2026

Uh oh!

johnsonr left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud bot commented Jan 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Thinking blocks support #1219

Thinking blocks support #1219

Uh oh!

Conversation

igordayen commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add Comprehensive Thinking Extraction Support

Summary

New API Surface

Execution Flow

Technical Approach

Thinking Block Preservation

Test Coverage

Unit Tests

Integration Tests

Files Modified

Usage Examples

Uh oh!

johnsonr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

igordayen commented Jan 5, 2026

Uh oh!

johnsonr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud bot commented Jan 6, 2026

Quality Gate passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

igordayen commented Dec 31, 2025 •

edited

Loading