fix: pass num_retries to streaming completion path by saivedant169 · Pull Request #9460 · stanfordnlp/dspy

saivedant169 · 2026-03-16T17:48:53Z

Description

The streaming completion path via _get_stream_completion_fn called litellm.acompletion(stream=True) without num_retries or retry_strategy, so rate limit errors (429) crashed immediately instead of retrying with exponential backoff. The non-streaming path correctly passes both parameters.

Changes

Added num_retries parameter to _get_stream_completion_fn()
Pass num_retries and retry_strategy="exponential_backoff_retry" to litellm.acompletion(stream=True) inside the stream closure
Updated litellm_completion() caller to forward num_retries
Updated alitellm_completion() caller to forward num_retries and headers (headers were also previously missing from the streaming path inconsistent with the sync caller and the non-streaming path)

Before vs After

# BEFORE: streaming no retries, crashes on 429
response = await litellm.acompletion(
    cache=cache_kwargs,
    stream=True,
    headers=headers,
    **request,
)

# AFTER: streaming retries match non-streaming path
response = await litellm.acompletion(
    cache=cache_kwargs,
    stream=True,
    num_retries=num_retries,
    retry_strategy="exponential_backoff_retry",
    headers=headers,
    **request,
)

Testing

ruff check and ruff format pass clean.

The streaming path via _get_stream_completion_fn called litellm.acompletion(stream=True) without num_retries or retry_strategy, so rate limit errors (429) crashed immediately instead of retrying with exponential backoff. Thread num_retries through _get_stream_completion_fn and pass it along with retry_strategy to the litellm.acompletion call, matching the non-streaming path behavior. Also fix alitellm_completion to pass headers to the streaming path (previously missing, inconsistent with litellm_completion). Fixes stanfordnlp#9459

isaacbmiller · 2026-03-16T18:21:06Z

dspy/clients/lm.py

 def _get_stream_completion_fn(
    request: dict[str, Any],
    cache_kwargs: dict[str, Any],
+    num_retries: int = 0,


Let's have num_retries as a required function parameter

isaacbmiller · 2026-03-16T18:21:27Z

dspy/clients/lm.py

    headers = request.pop("headers", None)
-    stream_completion = _get_stream_completion_fn(request, cache, sync=False)
+    stream_completion = _get_stream_completion_fn(
+        request, cache, num_retries=num_retries, sync=False, headers=_add_dspy_identifier_to_headers(headers)


Make the syntax should match lm:355

isaacbmiller · 2026-03-16T18:21:45Z

dspy/clients/lm.py

-        messages: list[dict[str, Any]] | None = None,
-        **kwargs
-    ):
+    def forward(self, prompt: str | None = None, messages: list[dict[str, Any]] | None = None, **kwargs):


undo change unless necessary for ruff

…evert formatting - Make num_retries a required parameter in _get_stream_completion_fn - Match alitellm_completion syntax with litellm_completion (line 355 style) - Revert unrelated ruff formatting change on forward() signature

saivedant169 · 2026-03-24T19:22:39Z

Hey @isaacbmiller , wanted to see if there's anything else you need from me on this. Happy to make changes if something's off.

saivedant169 · 2026-04-03T16:49:17Z

Hey @isaacbmiller, all three items from your review are in. num_retries is required, the async caller matches the sync style, and the formatting is reverted. Let me know if anything else needs changing.

saivedant169 · 2026-04-05T01:27:54Z

ForgeArena Review

Decision: PASS (score: 89/100)

Evaluation

Tests: 0 passed, 0 failed
Lint: passed
Build: passed
Commands: 2/2 passed

Risk Signals

None detected

Policy

No violations

approved by ForgeArena - the merge gate for AI-generated code

isaacbmiller requested changes Mar 16, 2026

View reviewed changes

saivedant169 requested a review from isaacbmiller March 17, 2026 05:34

saivedant169 mentioned this pull request Mar 17, 2026

Add docstrings to dspy/clients/lm.py #9438

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: pass num_retries to streaming completion path#9460

fix: pass num_retries to streaming completion path#9460
saivedant169 wants to merge 2 commits intostanfordnlp:mainfrom
saivedant169:fix/streaming-retry-passthrough

saivedant169 commented Mar 16, 2026 •

edited

Loading

Uh oh!

isaacbmiller Mar 16, 2026

Uh oh!

isaacbmiller Mar 16, 2026

Uh oh!

isaacbmiller Mar 16, 2026

Uh oh!

saivedant169 commented Mar 24, 2026

Uh oh!

saivedant169 commented Apr 3, 2026

Uh oh!

saivedant169 commented Apr 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

saivedant169 commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes

Before vs After

Testing

Uh oh!

isaacbmiller Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

isaacbmiller Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

isaacbmiller Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

saivedant169 commented Mar 24, 2026

Uh oh!

saivedant169 commented Apr 3, 2026

Uh oh!

saivedant169 commented Apr 5, 2026

ForgeArena Review

Evaluation

Risk Signals

Policy

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

saivedant169 commented Mar 16, 2026 •

edited

Loading