🐛 use 512 tokens instead of 256 #509

joerunde · 2025-10-07T19:51:35Z

Description

Recent updates are causing issues with running granite models @ context length 256. This updates the unit tests to default to 512 instead.

Signed-off-by: Joe Runde <[email protected]>

github-actions · 2025-10-07T19:51:43Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

Signed-off-by: Joe Runde <[email protected]>

# Description Following up on #509, this fixes up a few remaining tests that were setting their own max model lengths instead of using the default. Using the default makes it much easier to respond to changes in hardware support Signed-off-by: Joe Runde <[email protected]>

🐛 use 512 tokens instead of 256

6315085

Signed-off-by: Joe Runde <[email protected]>

joerunde requested review from prashantgupta24, rafvasq and sducouedic as code owners October 7, 2025 19:51

maxdebayser approved these changes Oct 7, 2025

View reviewed changes

🐛 fixup test ordering

b191492

Signed-off-by: Joe Runde <[email protected]>

joerunde merged commit 0c9b971 into main Oct 7, 2025
18 checks passed

joerunde deleted the 512-tokens branch October 7, 2025 21:11

joerunde mentioned this pull request Oct 8, 2025

🐛 fixup more tests to use the default max model length #512

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🐛 use 512 tokens instead of 256 #509

🐛 use 512 tokens instead of 256 #509

Uh oh!

joerunde commented Oct 7, 2025

Uh oh!

github-actions bot commented Oct 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

🐛 use 512 tokens instead of 256 #509

🐛 use 512 tokens instead of 256 #509

Uh oh!

Conversation

joerunde commented Oct 7, 2025

Description

Uh oh!

github-actions bot commented Oct 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants