🎨 make max_num_seqs 4 for online test #394

prashantgupta24 · 2025-08-19T15:51:24Z

Description

For model ibm-ai-platform/micro-g3.3-8b-instruct-1b:
max_num_seqs 2 with TP 4 and CB for test_openai_serving seems to be failing:

numValidElems: 139984597168000 larger than attn mask vector size: 256dataformat_src_: IEEE_INT64

Some notes:

Earlier we had only BS 2 tested with:

test_openai_serving which tested with SB and TP 1, 2, 4, and
test_openai_serving_cb which tested against CB but with no TP.

Both of them separately passed, but TP (4) with CB and BS 2 for test_spyre_online is failing (but TP (2) with CB and BS 2 passes)

Also the full model passes:

tests/e2e/test_spyre_online.py::test_openai_serving[max_model_len(256)-max_num_seqs(2)-cb-sendnn-TP(4)-ibm-granite/granite-3.3-8b-instruct] PASSED

Signed-off-by: Prashant Gupta <[email protected]>

github-actions · 2025-08-19T15:51:35Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

joerunde

F

🎨 make max_num_seqs 4 for online test

5105c46

Signed-off-by: Prashant Gupta <[email protected]>

prashantgupta24 requested review from rafvasq and sducouedic as code owners August 19, 2025 15:51

prashantgupta24 enabled auto-merge (squash) August 19, 2025 15:52

github-actions bot added the ready label Aug 19, 2025

prashantgupta24 added ready and removed ready labels Aug 19, 2025

joerunde approved these changes Aug 19, 2025

View reviewed changes

prashantgupta24 merged commit 470a049 into main Aug 19, 2025
23 of 31 checks passed

prashantgupta24 deleted the max-seqs-4 branch August 19, 2025 16:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🎨 make max_num_seqs 4 for online test #394

🎨 make max_num_seqs 4 for online test #394

Uh oh!

prashantgupta24 commented Aug 19, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Aug 19, 2025

Uh oh!

joerunde left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

🎨 make max_num_seqs 4 for online test #394

🎨 make max_num_seqs 4 for online test #394

Uh oh!

Conversation

prashantgupta24 commented Aug 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

github-actions bot commented Aug 19, 2025

Uh oh!

joerunde left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

prashantgupta24 commented Aug 19, 2025 •

edited

Loading