Documents a bit CB script and tests #300

sducouedic · 2025-07-10T21:50:35Z

This PR introduces a brief overview on how to debug and test the continuous batching functionality in vLLM. It pinpoints the main testing functions and script for inference with continuous batching.

Signed-off-by: Sophie du Couédic <[email protected]>

github-actions · 2025-07-10T21:50:42Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

Signed-off-by: Sophie du Couédic <[email protected]>

rafvasq

Quick note that the doc needs to be included in .nav to be shown on the site, see #226.

Also wonder if it fits better under Developer Guide and not User Guide since it seems primarily about debugging and testing.

prashantgupta24 · 2025-07-14T17:31:31Z

docs/user_guide/cb_testing_in_vllm.md

I wonder if all this can be added as docstring to the top of the respective test files themselves instead of having it sit separately (which is prone to being stale and an extra step for anyone trying to understand the code)

Having one doc file encompassing the different CB testing/debugging functions is a request from the compiler team actually

Makes sense. I wonder if we can add all the text from this PR into the test files and have some sort of table/visual representation encompassing the various configurations tested within the docs?

I worry that when we change something for the CB tests (which happens so frequently these days), we will forget to update the docs for the same which will lead to outdated stuff pretty fast.

The table/visual is a good idea, and actually also a request :)
But do you have in mind an automation code that update the table/visuals in the docs given the parameters found in the tests functions? also in your mind the table/visual would be in addition to the text in the docs, or replacing the text?

Ideally the visuals would be in addition to the docstring but I don't know if this can be automated, seems complex.

Just to jump in, I also agree that a lot of the info here probably should be just contained in the files too and this CB developer doc could point to where the script and tools are, with high-level descriptions, along with debugging notes similar to the ones here. Including that table you both mentioned.

I also noticed some "TODO" type of notes in this guide too. Those should probably be made issues for tracking rather than notes in the docs if they're important

Signed-off-by: Sophie du Couédic <[email protected]>

# Description Uses `mkdocstrings` to create a doc outlining CB tests and parameters. Related to #300 --------- Signed-off-by: Rafael Vasquez <[email protected]>

Signed-off-by: Sophie du Couédic <[email protected]>

prashantgupta24 · 2025-07-18T17:47:55Z

Do we still need this PR?

Signed-off-by: Sophie du Couédic <[email protected]>

sducouedic · 2025-07-18T19:23:48Z

@prashantgupta24
I just shortened it and added links to the docstring, please check: https://vllm-spyre--300.org.readthedocs.build/en/300/contributing/continuous_batching/overview.html

yannicks1

lgtm

rafvasq · 2025-07-21T13:33:04Z