Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Removing alloca on GPU #618

Open
wants to merge 7 commits into
base: master
Choose a base branch
from

Conversation

DiamonDinoia
Copy link
Collaborator

@paquiteau and @Lenoush have noticed that alloca made things slower in their benchmarks while greatly reducing memory consumption.

Details are in #570 and mind-inria/mri-nufft-benchmark#5

Instead of using opts.gpu_* to switch with the old implementation it is better to use kernel dispatch and have pre-compiled kernels for the various scenarios. As per CPU code. One less parameter that the user has to worry about and it can obtain both higher performance and low memory consumption at the same time.

@paquiteau, @Lenoush can you benchmark this branch and let us know how it fares? I could not measure a meaningful difference with my custom code.

@DiamonDinoia DiamonDinoia requested a review from blackwer January 30, 2025 20:53
@DiamonDinoia DiamonDinoia changed the title Removing alloca to GPU Removing alloca on GPU Jan 30, 2025
@DiamonDinoia DiamonDinoia requested a review from janden January 30, 2025 21:02
@paquiteau
Copy link

paquiteau commented Jan 31, 2025

Hello @DiamonDinoia ! Interesting stuff :)
I will have a look in the next coming days with @chaithyagr as well

PS: @Lenoush's contract ended so she does not work on nuffts anymore

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants