Skip to content

Fix failed to rename bug in torch profiler #3708

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

Fridge003
Copy link

@Fridge003 Fridge003 commented Apr 20, 2025

When launching multiple GPUs with torch profiler, the error described here will be thrown.

This PR fixes this bug by restricting the profiling machine to rank 0.

@Fridge003
Copy link
Author

/bot run

@poweiw poweiw requested review from a team, lucaslie and HuiGao-NV and removed request for a team June 5, 2025 19:42
@poweiw poweiw added triaged Issue has been triaged by maintainers Community want to contribute PRs initiated from Community Generic Runtime General operational aspects of TRTLLM execution not in other categories. labels Jun 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Community want to contribute PRs initiated from Community Generic Runtime General operational aspects of TRTLLM execution not in other categories. triaged Issue has been triaged by maintainers
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants