fix(responses): store complete assistant response in Redis sessions by josemaria-vilaplana · Pull Request #89 · CartoDB/litellm

josemaria-vilaplana · 2026-02-06T10:16:35Z

Summary

Fix multi-turn conversations with Gemini thinking models (2.5/3) when using tool calling via the Responses API
Previously, Redis session storage only stored input messages, missing the assistant response with tool_calls and provider_specific_fields
This caused "Base64 decoding failed for thought_signature" errors on follow-up requests

Changes

Extract and store complete assistant message including tool_calls in Redis sessions
Preserve provider_specific_fields containing thought_signatures (critical for Gemini thinking models)
Handle both streaming and non-streaming response paths
Add comprehensive tests for thought_signature preservation (17 new tests)

Test plan

Run unit tests: pytest tests/test_litellm/responses/litellm_completion_transformation/test_thought_signature_preservation.py (17 passed)
Run related tests: pytest tests/test_litellm/responses/litellm_completion_transformation/ (98 passed, 1 skipped due to missing dependency)
Manual test with Gemini 2.5/3 thinking model + tool calling + multi-turn conversation

🤖 Generated with Claude Code

Previously, Redis session storage only stored input messages, missing the assistant response with tool_calls and provider_specific_fields. This caused Gemini thinking models (2.5/3) to fail on multi-turn tool calling conversations with "Base64 decoding failed for thought_signature" errors. Changes: - Extract and store complete assistant message including tool_calls - Preserve provider_specific_fields containing thought_signatures - Handle both streaming and non-streaming response paths - Add comprehensive tests for thought_signature preservation Fixes multi-turn conversations with Gemini thinking models when using tool calling via the Responses API. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

mateo-di

@josemaria-vilaplana can we check if we have a test already that verify this carto-customization in the smoke-ai tests in cloud-native?

josemaria-vilaplana · 2026-02-06T13:30:38Z

@josemaria-vilaplana can we check if we have a test already that verify this carto-customization in the smoke-ai tests in cloud-native?

No way we can test this with our current tests approach. This fails randomly, only when the thought_signatures in base64 is bigger then 2K, which happens only from time to time depending on the encoded value.

josemaria-vilaplana requested a review from mateo-di February 6, 2026 10:24

mateo-di reviewed Feb 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(responses): store complete assistant response in Redis sessions#89

fix(responses): store complete assistant response in Redis sessions#89
josemaria-vilaplana wants to merge 1 commit intoupstream-sync/v1.81.0-stablefrom
fix/redis-session-store-complete-response

josemaria-vilaplana commented Feb 6, 2026

Uh oh!

mateo-di left a comment

Uh oh!

josemaria-vilaplana commented Feb 6, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

josemaria-vilaplana commented Feb 6, 2026

Summary

Changes

Test plan

Uh oh!

mateo-di left a comment

Choose a reason for hiding this comment

Uh oh!

josemaria-vilaplana commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

josemaria-vilaplana commented Feb 6, 2026 •

edited

Loading