-
Notifications
You must be signed in to change notification settings - Fork 81
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Invalid results of type 1 transform into (64, 64, 64) grid on A100 GPU #575
Comments
Smaller reproducer with just one point:
The spectra should be 1 everywhere, which it is for |
What happens if we use GM instead of SM? https://finufft.readthedocs.io/en/latest/c_gpu.html#options-for-gpu-code gpu_method should be supported in python too. |
With
A100:
On T4 all good:
|
@janden could you provide the command to do a debug build with pip? I saw this type of errors when using debug symbols. In my tests if I compile with @pavel-shmakov could you try a bigger eps? 1e-2 or 1e-3? |
|
@pavel-shmakov for the local compilation which version of CUDA are you using? If we move to email we could share binary wheels with different flags to narrow down the issue |
We've encountered an issue where
cufinufft.nufft3d1
outputs wildly incorrect results for very specific inputs and only on certain GPUs. This can be reproduced by running the following code on an A100 GPU:Here's an archive with
points.pt
andvalues.pt
: inputs.zipThe value is many orders of magnitude greater than it should be. It also grows quickly with decreasing
eps
.Notes:
libcufinufft.so
is built, that would be helpful, and we can investigate further!The text was updated successfully, but these errors were encountered: