Traces CUDA GPU kernel functions via BPF and offers analysis through visualization and LLLM augmented insight. #14

aather · 2025-05-20T22:29:56Z

Traces CUDA GPU kernel functions via BPF and provides in-depth analysis through visualizations and optional LLM (Large Language Model)-powered summaries.

…is through visualizations and optional LLM (Large Language Model)-powered summaries

facebook-github-bot · 2025-05-20T22:30:03Z

Hi @aather!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

facebook-github-bot · 2025-05-20T23:06:29Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks!

RihamSelim · 2025-05-21T23:36:33Z

strobelight/src/profilers/gpuevent_snoop/GpuEventSnoop.cpp

-    if (link) {
-      links.emplace_back(link);
+ /* Attach Uprobes for CUDA API tracepoints */
+  for (const auto& symbol : kCudaSymbols) {


Can we make the list of events to trace optional (passed as an argument to the program) instead of collecting all events by default?

RihamSelim · 2025-06-02T18:08:29Z

strobelight/src/profilers/gpuevent_snoop/bpf/gpuevent_snoop.bpf.c

  bool capture_args;
  bool capture_stack;
 } prog_cfg = {
-    // These defaults will be overridden from user space


Can we leave the comments?
The purpose of the defaults is that they allow us to exercise these specific code paths using veristat to avoid verifier errors

RihamSelim · 2025-06-02T18:08:54Z

strobelight/src/profilers/gpuevent_snoop/bpf/gpuevent_snoop.bpf.c

      bpf_printk(fmt, ##__VA_ARGS__); \
  })

-// The caller uses registers to pass the first 6 arguments to the callee.  Given


Can we keep this comment as well?

RihamSelim · 2025-06-02T18:12:34Z

strobelight/src/profilers/gpuevent_snoop/bpf/gpuevent_snoop.bpf.c

-      bpf_probe_read_user(&e->args[i], sizeof(arg_addr), arg_addr);
+
+    struct gpukern_sample* e = bpf_ringbuf_reserve(&rb, sizeof(*e), 0);
+    if (!e) return 0;


The bpf_printk_debug("Failed to allocate ringbuf entry"); can be useful for debugging, especially if we overrun the ring buf, it should not cause noise or an increase in instruction count as long as the .debug flag is set to false from user space

RihamSelim · 2025-06-02T18:13:22Z

strobelight/src/profilers/gpuevent_snoop/bpf/gpuevent_snoop.bpf.c

+            bpf_probe_read_user(&arg_addr, sizeof(u64), (const void*)(argv + i * sizeof(u64)));
+            bpf_probe_read_user(&e->args[i], sizeof(arg_addr), arg_addr);
+        }
+    }


Same about leaving the comments :)

RihamSelim · 2025-06-02T18:23:55Z

strobelight/src/profilers/gpuevent_snoop/GpuEventSnoop.cpp

-    for (auto& frame : stack) {
-      frame.print();
+    // Print function arguments if requested
+    if (env.args) {


This probably should be specific to EVENT_CUDA_LAUNCH_KERNEL

Traces CUDA GPU kernel functions via BPF and provides in-depth analys…

fab7cbb

…is through visualizations and optional LLM (Large Language Model)-powered summaries

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label May 20, 2025

RihamSelim reviewed Jun 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Traces CUDA GPU kernel functions via BPF and offers analysis through visualization and LLLM augmented insight. #14

Traces CUDA GPU kernel functions via BPF and offers analysis through visualization and LLLM augmented insight. #14

Uh oh!

aather commented May 20, 2025

Uh oh!

facebook-github-bot commented May 20, 2025

Uh oh!

facebook-github-bot commented May 20, 2025

Uh oh!

RihamSelim May 21, 2025

Uh oh!

RihamSelim Jun 2, 2025

Uh oh!

RihamSelim Jun 2, 2025

Uh oh!

RihamSelim Jun 2, 2025

Uh oh!

RihamSelim Jun 2, 2025

Uh oh!

RihamSelim Jun 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Traces CUDA GPU kernel functions via BPF and offers analysis through visualization and LLLM augmented insight. #14

Are you sure you want to change the base?

Traces CUDA GPU kernel functions via BPF and offers analysis through visualization and LLLM augmented insight. #14

Uh oh!

Conversation

aather commented May 20, 2025

Uh oh!

facebook-github-bot commented May 20, 2025

Action Required

Process

Uh oh!

facebook-github-bot commented May 20, 2025

Uh oh!

RihamSelim May 21, 2025

Choose a reason for hiding this comment

Uh oh!

RihamSelim Jun 2, 2025

Choose a reason for hiding this comment

Uh oh!

RihamSelim Jun 2, 2025

Choose a reason for hiding this comment

Uh oh!

RihamSelim Jun 2, 2025

Choose a reason for hiding this comment

Uh oh!

RihamSelim Jun 2, 2025

Choose a reason for hiding this comment

Uh oh!

RihamSelim Jun 2, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants