-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
mallocMC with alpaka is slow #183
Comments
It looks like compile time parameter will be passed at runtime and loaded from the global memory: The code above show a snapshot where https://github.com/ComputationalRadiationPhysics/mallocMC/blob/31f9b3c3f1cf919a96312a30ab9aa223f41fed67/src/include/mallocMC/creationPolicies/Scatter_impl.hpp#L74-L98 is load
|
I forgot to remove the |
It is not a mallocMC issue. The CMake CUDA module is not forwarding flags from
|
Since #173 we added alpaka to mallocMC the performance is lower than before.
I tested https://github.com/ComputationalRadiationPhysics/mallocMC/tree/31f9b3c3f1cf919a96312a30ab9aa223f41fed67 with patches from #180
The performance with PIConGPU for the spec example is 28% lower.
mallocMC with alpaka
current PIConGPU with old alpaka
I currently investigating where the performance decrease is coming from.
The text was updated successfully, but these errors were encountered: