Skip to content

Conversation

@prashantgupta24
Copy link
Collaborator

@prashantgupta24 prashantgupta24 commented Aug 12, 2025

Description

  • Added DISABLE_ASSERTS to the scheduling tests - It's very useful if you want to debug without actually asserting the values
  • Fixed a bug where server_args.extend should have be called in tests

@github-actions
Copy link

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

from spyre_util import get_spyre_backend_list, get_spyre_model_list


def _check_result(client, model, max_tokens=8, temperature=0.0, n=1) -> None:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🌶️ !

@prashantgupta24
Copy link
Collaborator Author

bot:test
TEST_FILE=tests/e2e/test_spyre_online.py MARKERS="spyre"

@prashantgupta24
Copy link
Collaborator Author

bot:test
TEST_FILE=tests/e2e/test_spyre_online.py MARKERS="spyre"

Signed-off-by: Prashant Gupta <[email protected]>
"VLLM_SPYRE_DYNAMO_BACKEND": backend
}
server_args = [
server_args.extend([
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

found a 🐛 !

Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
@prashantgupta24 prashantgupta24 changed the title 🧪 add failing test for batch requests fix some stuff? Aug 14, 2025
@yannicks1
Copy link
Collaborator

just seen this... the modification to the model runner can be removed (see #384). The two internal issues on the tracker were coupled I guess. does this solve the remaining part of (internal) issue 977?

with vllm_spyre:0.7.0
```
tests/e2e/test_spyre_basic.py::test_output[max_num_seqs(4)-sendnn-TP(4)-ibm-granite/granite-3.3-8b-instruct-cb
```
fails with
```
(VllmWorker rank=1 pid=62989) ERROR 08-15 17:09:36 [multiproc_executor.py:522]   File "/usr/local/lib/python3.12/site-packages/torch_sendnn/backends.py", line 2440, in update_graph
(VllmWorker rank=1 pid=62989) ERROR 08-15 17:09:36 [multiproc_executor.py:522]     self.meta["gl_op"] = GraphLoaderOp(self.meta["g2"], self)
(VllmWorker rank=1 pid=62989) ERROR 08-15 17:09:36 [multiproc_executor.py:522]                          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(VllmWorker rank=1 pid=62989) ERROR 08-15 17:09:36 [multiproc_executor.py:522]   File "/usr/local/lib/python3.12/site-packages/torch_sendnn/backends.py", line 1819, in __init__
(VllmWorker rank=1 pid=62989) ERROR 08-15 17:09:36 [multiproc_executor.py:522]     raise RuntimeError(f"GraphLoader CompileGraph failed: {s}")
(VllmWorker rank=1 pid=62989) ERROR 08-15 17:09:36 [multiproc_executor.py:522] RuntimeError: GraphLoader CompileGraph failed: compile_graph failed
```
so we don't need specific tests!

Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
@prashantgupta24 prashantgupta24 changed the title fix some stuff? 🐛 fix a bugs in tests, add DISABLE_ASSERTS Aug 15, 2025
@wallashss
Copy link
Collaborator

Thanks for rename the title, it is much better! 😄
(I was about to suggest it)

@prashantgupta24 prashantgupta24 changed the title 🐛 fix a bugs in tests, add DISABLE_ASSERTS 🐛 fix a bug in tests, add DISABLE_ASSERTS Aug 15, 2025
@maxdebayser
Copy link
Collaborator

Can you add more information in the PR description as to why the DISABLE_ASSERT is needed?

@prashantgupta24
Copy link
Collaborator Author

DISABLE_ASSERT

We already have it in tests/spyre_util.py. It's very useful if you want to debug without actually asserting the values. Let me update the PR description

@prashantgupta24 prashantgupta24 enabled auto-merge (squash) August 18, 2025 15:34
@github-actions github-actions bot added the ready label Aug 18, 2025
Copy link
Collaborator

@maxdebayser maxdebayser left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@prashantgupta24 prashantgupta24 merged commit 46c6ac4 into main Aug 18, 2025
17 of 23 checks passed
@prashantgupta24 prashantgupta24 deleted the cb-batch-failing branch August 18, 2025 16:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants