Skip to content

Conversation

@shmuelk
Copy link
Contributor

@shmuelk shmuelk commented Oct 29, 2025

This PR back port PR llm-d-inference-scheduler 403 to this repo to enable us to release a quick fix.

Copy link
Collaborator

@nirrozenbaum nirrozenbaum left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm
/approve

@nirrozenbaum nirrozenbaum merged commit e7151bd into llm-d:main Oct 29, 2025
1 check passed
pierDipi pushed a commit to pierDipi/llm-d-routing-sidecar that referenced this pull request Oct 30, 2025
pierDipi pushed a commit to pierDipi/llm-d-routing-sidecar that referenced this pull request Oct 30, 2025
…8d (llm-d#67)

Signed-off-by: konflux-internal-p02 <170854209+konflux-internal-p02[bot]@users.noreply.github.com>
Co-authored-by: konflux-internal-p02[bot] <170854209+konflux-internal-p02[bot]@users.noreply.github.com>
pierDipi pushed a commit to pierDipi/llm-d-routing-sidecar that referenced this pull request Oct 30, 2025
pierDipi pushed a commit to pierDipi/llm-d-routing-sidecar that referenced this pull request Oct 30, 2025
Jooho added a commit to opendatahub-io/llm-d-routing-sidecar that referenced this pull request Oct 30, 2025
…e3db496c26d-v0.3

[0.3] Ensure max_completion_tokens=1 for prefill (llm-d#67)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants