Mandelbrot set - via GPU Warning: Your GPU will run at 100% at up to 90% TDP. You will need at least a 1600 Watt PSU and 2 separate 15A/1800W breakers on 2 lines if running multiple servers.
on the NVIDIA DGX Spark
nvcc -o kernel kernel.cu
or on the RTX-3500 ADA running Visual Studio on the lenovo P1gen6
nvcc.exe -gencode=arch=compute_52,code=\"sm_52,compute_52\" --use-local-env -ccbin "C:\Program Files\Microsoft Visual Studio\2022\Community\VC\Tools\MSVC\14.42.34433\bin\HostX64\x64" -x cu -I"C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.6\include" -I"C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.6\include" --keep-dir fractal\x64\Release -maxrregcount=0 --machine 64 --compile -cudart static -DWIN32 -DWIN64 -DNDEBUG -D_CONSOLE -D_MBCS -Xcompiler "/EHsc /W3 /nologo /O2 /FS /MD " -Xcompiler "/Fdfractal\x64\Release\vc143.pdb" -o C:\wse_github\ObrienlabsDev\fractals\fractal\fractal\x64\Release\kernel.cu.obj "C:\wse_github\ObrienlabsDev\fractals\fractal\kernel.cu"
- block size 16 x 16, 5000 iterations of 4096x4096 images with max mandelbrot iteration of 8192
| perf | sec | /run | # GPUs | % GPU | Watts | TDP | Chip | Cores | GPU spec |
|---|---|---|---|---|---|---|---|---|---|
| 11.7 | 23 | .0092 | 2 | 99 | 904 | 94 | AD102 | 32768 | dual RTX-4090 Ada (no NVLink (not used 48G)) |
| 5.85 | 46 | .0092 | 1 | 99 | 452 | 94 | AD102 | 16384 | RTX-4090 Ada 24G |
| 3.44 | 78 | .0312 | 2 | 99 | 388 | 97 | GA102 | 14336 | dual RTX-A4500 with NVLink (not used) 40G |
| 2.66 | 100 | .02 | 1 | 99 | 304 | 102 | GA102 | 10752 | RTX-A6000 48G |
| 2.09 | 128 | .0256 | 1 | 91 | 103 (197 system) | ? | GB10 | 6144 | DGX Spark 128G - CUDA 13.0 |
| 1.72 | 156 | .0312 | 1 | 99 | 194 | 97 | GA102 | 7168 | RTX-A4500 20G old |
| 1.49 | 180 | 2 | 92 | ? | M3 Ultra 60 | 7680 | Mac Studio 3 M3Ultra 96G | ||
| 1.41 | 191 | .0382 | 1 | 99-68 | 102 | ? | AD104 | 5120 | RTX-3500 Ada 12G Thermal Throttling |
| 1.29 | 208 | .0416 | 1 | 99 | 143 | 102 | GA104 | 6144 | RTX-A4000 16G old |
| 1.16 | 231 | .0462 | 1 | 98 | 120 | ? | M4 Max 40 | 5120 | Macbook Pro 16 M4Max 48G |
| 1 | 269 | .0538 | 1 | 99 | 105 | ? | TU104 | 3072 | RTX-5000 16G |
| 0.78 | 344 | .0688 | 2 | 96 | 120 | ? | M2 Ultra 60 | 7680 | Mac Studio 2 M2Ultra 64G |
| 0.47 | 571 | .1142 | 1 | 79-98 | ? | M4 Pro 16 | 2048 | Mac Mini M4 Pro 24G | |
| 0.39 | 693 | .1386 | 1 | 95 | ? | M1 Max 32 | 4096 | Macbook Pro 16 M1Max 32G |
10.5 A - two RTX-4090 GPUs generating Mandelbrot set images

