Make an efficient CUDA microbenchmark framework #11

Red-Portal · 2017-12-20T14:16:01Z

Make a efficient CUDA micro benchmark framework

The current workflow of writing/optimizing CUDA kernels is very difficult because there is no proper, consistent way of measuring the performance of kernels.
A simple and consistent tool to measure and profile CUDA kernels is required.

Requirements

Automatic measuring of FLOPS (probably using nvprof)
Measuring of parallel scaling
Simple, nutshell API
Plotting the benchmark reports (probably using pyplot, gnuplot)

Red-Portal · 2018-01-14T16:30:55Z

working on this on a separate repository https://github.com/MGfoundation/mgbench

Red-Portal added C++ CUDA TODO labels Dec 20, 2017

Red-Portal changed the title ~~Make a efficient CUDA micro benchmark framework~~ Make a efficient CUDA microbenchmark framework Dec 20, 2017

Red-Portal changed the title ~~Make a efficient CUDA microbenchmark framework~~ Make an efficient CUDA microbenchmark framework Dec 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make an efficient CUDA microbenchmark framework #11

Make an efficient CUDA microbenchmark framework #11

Red-Portal commented Dec 20, 2017 •

edited

Loading

Red-Portal commented Jan 14, 2018

Make an efficient CUDA microbenchmark framework #11

Make an efficient CUDA microbenchmark framework #11

Comments

Red-Portal commented Dec 20, 2017 • edited Loading

Requirements

Red-Portal commented Jan 14, 2018

Red-Portal commented Dec 20, 2017 •

edited

Loading