[profiler] support PyTorch profiler enablement #176

mcalman · 2025-05-21T18:20:21Z

This PR implements the required methods to support profiling vLLM with PyTorch Profiler.

Using PyTorch Profiler with vLLM

Offline profiling

Enable torch profiler, can also be set on cmd line:

os.environ["VLLM_TORCH_PROFILER_DIR"] = "./vllm_profile"

Start and stop profiling:

llm.start_profile()
outputs = llm.generate(prompts, sampling_params)
llm.stop_profile()

Online profiling

Start server with profiling enabled:

VLLM_RPC_TIMEOUT=1800000 VLLM_TORCH_PROFILER_DIR=./vllm_profile python3 -m vllm.entrypoints.openai.api_server --model /models/llama-7b-chat --max-model-len=2048 --block-size=128

Start and stop profiler on the client side:

requests.post("http://0.0.0.0:8000/start_profile")
client.completions.create(...)
requests.post("http://0.0.0.0:8000/stop_profile")

Signed-off-by: Max Calman <[email protected]>

github-actions · 2025-05-21T18:20:47Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv pip install --group lint

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

marceloamaral

Looks great @mcalman!

Just want to mention that the Spyre events will only be collected if PyTorch is using the Kineto that supports Spyre events.

marceloamaral · 2025-05-22T09:53:22Z

@dpatel-ops could you please have a look on this PR?

joerunde · 2025-05-23T22:14:59Z

Not gonna lie I'm not an expert in torch profiling.

With the internal package installed I spun this up, profiled it and loaded up the profile in perfetto:

Then in tensorboard I see the gpu listed as AIU 🌶️🌶️🌶️

Reverting the profiler install and running a new profile I see only cpu devices listed:

so, LGTM!

joerunde

🚀

[profiler] support PyTorch profiler enablement

8662417

Signed-off-by: Max Calman <[email protected]>

marceloamaral reviewed May 22, 2025

View reviewed changes

joerunde approved these changes May 23, 2025

View reviewed changes

Merge branch 'main' into profiler

bf0f0a9

joerunde enabled auto-merge (squash) May 23, 2025 22:16

github-actions bot added the ready label May 23, 2025

joerunde merged commit 0d0d611 into vllm-project:main May 23, 2025
22 checks passed

mcalman deleted the profiler branch July 28, 2025 19:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[profiler] support PyTorch profiler enablement #176

[profiler] support PyTorch profiler enablement #176

Uh oh!

mcalman commented May 21, 2025

Uh oh!

github-actions bot commented May 21, 2025

Uh oh!

marceloamaral left a comment

Uh oh!

marceloamaral commented May 22, 2025

Uh oh!

joerunde commented May 23, 2025

Uh oh!

joerunde left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[profiler] support PyTorch profiler enablement #176

[profiler] support PyTorch profiler enablement #176

Uh oh!

Conversation

mcalman commented May 21, 2025

Using PyTorch Profiler with vLLM

Offline profiling

Online profiling

Uh oh!

github-actions bot commented May 21, 2025

Uh oh!

marceloamaral left a comment

Choose a reason for hiding this comment

Uh oh!

marceloamaral commented May 22, 2025

Uh oh!

joerunde commented May 23, 2025

Uh oh!

joerunde left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants