I am changing the GPU configs to test the performance under different configurations. When I use a config with increased dram_atom_size, I found that all banks report an access count of 0, even if they actually have memory accesses.
I found that the issue is caused by trying to use ceil() on the division of two unsigned values, which does not help at all.
The same problem could also apply to other statistics.