Description
There were 3 separate threads about this issue on the Open MPI mailing list:
- https://mail-archive.com/[email protected]/msg30581.html
- https://www.mail-archive.com/[email protected]//msg30593.html
- https://www.mail-archive.com/[email protected]/msg30732.html
Short version:
It looks like this app (CP2K -- https://www.cp2k.org/) is experiencing a large slowdown with regards to MPI_ALLOC_MEM/MPI_FREE_MEM. Users cite that profiling has shown that it is spending 70% of its time in ALLOC_MEM, for example.
It's not immediately clear if MPI_ALLOC_MEM is being called too often (e.g., for buffers that don't really need to be registered), or if simply the switch to actively register/deregister buffers for every alloc / dealloc is significantly more expensive than Open MPI's usual lazy model of registration / deregistration.
Opening this issue to track the progress.
@loveshack @hiliev @JingchaoZhang @ggouaillardet @hjelmn @bosilca Feel free to tag others if they are interested.