Skip to content

[FEA]: Ability to list the kernel execution queue #698

Answered by leofang
Ind1x1 asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @Ind1x1 thanks for reaching out 👋 Unfortunately we do not have any public access to hook into the kernel queue. Even if we do, I am not sure how it'd actually help you debug faulty GPU issues, which is what you really are concerned about.

I would like to mention, in case you don't know already, that NVIDIA DCGM is designed to help such use cases. It can be used to provide GPU diagnostics. See their docs here: https://docs.nvidia.com/datacenter/dcgm/latest/index.html and they can be reached out on GitHub here: https://github.com/NVIDIA/DCGM.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by leofang
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #696 on June 09, 2025 16:40.