-
Notifications
You must be signed in to change notification settings - Fork 582
Description
Environment
Ubuntu 16.04.4 LTS
gcc/g++ 5.4.0 20160609
Python 2.7
CUDA 8.0
cuDNN 7.1.4
gpgpu-sim_distribution dev branch
pytorch-gpgpu-sim (removed the git dependencies of nervanagpu @ d4eefd5, since the repo is no longer there)
PYTORCH_BIN /usr/lib/x86_64-linux-gnu/libcudnn.so
The configurations used are from the configs folder of gpgpu-sim dev branch
MNIST
I use the MNIST sample from here. (Following deval281shah 's suggestions in another discussion. )
https://github.com/gpgpu-sim/gpgpu-sim_simulations
Config
I use TITANV config.
I also tried TITANX config, but a deadlock happened with that configuration.
The simulation runs for 39 minutes, and I checked it has stimulated a number of kernels, and it reported some related information such as IPC.
In the beginning, it generated a large amount of .. it cannot find all device function required.
Warning: cannot find deviceFun maxwell_zgemmBatched_32x32_raggedMn_ct
Warning: cannot find deviceFun maxwell_zgemmBatched_64x32_raggedMn_ct
Warning: cannot find deviceFun maxwell_zgemmBatched_32x32_raggedMn_cn
Warning: cannot find deviceFun maxwell_zgemmBatched_64x32_raggedMn_cn
Warning: cannot find deviceFun maxwell_zgemmBatched_32x32_raggedMn_tc
Warning: cannot find deviceFun maxwell_zgemmBatched_64x32_raggedMn_tc
Warning: cannot find deviceFun maxwell_zgemmBatched_32x32_raggedMn_tt
Warning: cannot find deviceFun maxwell_zgemmBatched_64x32_raggedMn_tt
Warning: cannot find deviceFun maxwell_zgemmBatched_32x32_raggedMn_tn
Warning: cannot find deviceFun maxwell_zgemmBatched_64x32_raggedMn_tn
Warning: cannot find deviceFun maxwell_zgemmBatched_32x32_raggedMn_nc
Therefore, cudaLaunchKernel fails to find the device function.
However, it ends up with
...
GPGPU-Sim PTX: Setting up arguments for 4 bytes starting at 0x7ffc0b96aa68..
GPGPU-Sim PTX: cudaLaunch for 0x0x4321f0 (mode=performance simulation) on stream 0
GPGPU-Sim PTX: ERROR launching kernel -- no PTX implementation found for 0x4321f0
Does anyone encounter this problem before? Any suggestions?
Thank you so much for your help.