-
Notifications
You must be signed in to change notification settings - Fork 33
Open
Description
cuda_memtest seems to abort with "out of memory" (line 148 in cuda_memtests.cu) when run in a container (nvidia-docker1 and 2) on V100 GPUs.
The problem might be a general one or just triggered in PIConGPU. Needs investigation. Maybe just multiple-times assigned from mpiInfo...
Occurred with a 4 & 8 GPU PIConGPU lwfa example on a DGX-1.