Skip to content
View amathews-amd's full-sized avatar

Block or report amathews-amd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Advanced Profiling and Analytics for AMD Hardware

Python 140 51 Updated Feb 28, 2025

Omnitrace: Application Profiling, Tracing, and Analysis

C++ 308 28 Updated Feb 28, 2025

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,606 376 Updated Dec 4, 2024
VBA 3 Updated Jan 24, 2023

Benchmarks to capture important workloads.

Python 29 23 Updated Jan 31, 2025

Convert nvprof profiles into about:tracing compatible JSON files

Python 68 13 Updated Apr 9, 2021
Showing results