[SB][CB] mark input tensors refactoring #198

yannicks1 · 2025-06-03T11:17:31Z

WIP

…hing case Signed-off-by: Yannick Schnider <[email protected]>

github-actions · 2025-06-03T11:19:35Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

Signed-off-by: Yannick Schnider <[email protected]>

wallashss · 2025-06-03T12:31:11Z

FYI: I did something similar in this other draft PR. My intention is try have common between static batching and continuous batching and in after that have a structure that is more similar to vLLM upstream.

yannicks1 · 2025-06-03T13:43:08Z

thanks for letting me know (I usually do not check draft PRs:)
What do you think about this change? I saw that the order is not the same for SB and CB and just changed it, test seem to pass...

prashantgupta24 · 2025-06-03T16:40:08Z

vllm_spyre/v1/worker/spyre_model_runner.py

-            torch._dynamo.mark_static(model_input.slot_mapping, 1)  # always 1
-            torch._dynamo.mark_static(model_input.input_positions,
-                                      1)  # always 1
+        self._mark_input_tensors(model_input)


do we really need this function to be using self? It only interacts with the model_input param

you are totally right, I copied it from the static batching code, where it also uses self which can be removed. FYI: @wallashss

yannicks1 · 2025-06-04T16:18:26Z

If @wallashss can confirm above change I will close this PR and he can address all in his?

wallashss · 2025-06-05T14:17:08Z

If @wallashss can confirm above change I will close this PR and he can address all in his?

I'm back to work in this refactoring, so sure I can address those.

yannicks1 · 2025-06-12T16:23:37Z

Thanks Wallas, closing this one now.

refactoring tensor marking into _mark_input_tensors as in static batc…

820f053

…hing case Signed-off-by: Yannick Schnider <[email protected]>

[SB] update states before returning empty model runner (as done for CB)

e6154be

Signed-off-by: Yannick Schnider <[email protected]>

prashantgupta24 reviewed Jun 3, 2025

View reviewed changes

yannicks1 closed this Jun 12, 2025

yannicks1 deleted the ysc-refactor-mark-tensors branch June 24, 2025 10:39

wallashss mentioned this pull request Jun 26, 2025

[CB] refactoring spyre model runner #172

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SB][CB] mark input tensors refactoring #198

[SB][CB] mark input tensors refactoring #198

Uh oh!

yannicks1 commented Jun 3, 2025

Uh oh!

github-actions bot commented Jun 3, 2025

Uh oh!

wallashss commented Jun 3, 2025

Uh oh!

yannicks1 commented Jun 3, 2025

Uh oh!

prashantgupta24 Jun 3, 2025

Uh oh!

yannicks1 Jun 4, 2025

Uh oh!

yannicks1 commented Jun 4, 2025

Uh oh!

wallashss commented Jun 5, 2025

Uh oh!

yannicks1 commented Jun 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SB][CB] mark input tensors refactoring #198

[SB][CB] mark input tensors refactoring #198

Uh oh!

Conversation

yannicks1 commented Jun 3, 2025

Uh oh!

github-actions bot commented Jun 3, 2025

Uh oh!

wallashss commented Jun 3, 2025

Uh oh!

yannicks1 commented Jun 3, 2025

Uh oh!

prashantgupta24 Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

yannicks1 Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

yannicks1 commented Jun 4, 2025

Uh oh!

wallashss commented Jun 5, 2025

Uh oh!

yannicks1 commented Jun 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants