Skip to content

Memory issues with larger numbers of particles and replicated meshes #34

@Waqar-ukaea

Description

@Waqar-ukaea

Right now AEGIS relies upon a replicating the mesh for each MPI rank. For large enough meshes and enough MPI ranks this causes me to run out of memory on my local system (~32GB). I don't recall running into memory issues when working on HPC systems such as CSD3 though.

It might be worth investigating methods of reducing this memory overhead. Options to consider:

  • Use valgrind-massif to investigate memory usage at runtime.
  • Use valgrind-memcheck to check for memory leaks.
  • Implementing a hybrid MPI + OpenMP parallel algorithm so that a single MPI rank is allocated per node and OpenMP threads are used to access the same mesh available to each MPI rank.
  • MOAB has some parallel mesh related operations in its documentation - so another route could be to explore what options I have here. Could be a route in for a distributed mesh?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions