Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make an efficient CUDA microbenchmark framework #11

Open
Red-Portal opened this issue Dec 20, 2017 · 1 comment
Open

Make an efficient CUDA microbenchmark framework #11

Red-Portal opened this issue Dec 20, 2017 · 1 comment

Comments

@Red-Portal
Copy link
Member

Red-Portal commented Dec 20, 2017

Make a efficient CUDA micro benchmark framework

The current workflow of writing/optimizing CUDA kernels is very difficult because there is no proper, consistent way of measuring the performance of kernels.
A simple and consistent tool to measure and profile CUDA kernels is required.

Requirements

  • Automatic measuring of FLOPS (probably using nvprof)
  • Measuring of parallel scaling
  • Simple, nutshell API
  • Plotting the benchmark reports (probably using pyplot, gnuplot)
@Red-Portal Red-Portal changed the title Make a efficient CUDA micro benchmark framework Make a efficient CUDA microbenchmark framework Dec 20, 2017
@Red-Portal Red-Portal changed the title Make a efficient CUDA microbenchmark framework Make an efficient CUDA microbenchmark framework Dec 27, 2017
@Red-Portal
Copy link
Member Author

working on this on a separate repository https://github.com/MGfoundation/mgbench

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant