-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement Fast Fourier Transform CUDA kernel #2
Comments
@mujjingun do you consider this issue solved? |
Not yet, there still is some work to be done, namely implementing padded (upsampling) ffts and 2d/3d ffts. |
@mujjingun ok |
@mujjingun cuFFT have some serious 'warming up' issues. On multi-GPU servers constructing even takes 1000ms or so. I think implementing our own parallel FFT with upsampling built in, is a considerable option |
@Red-Portal Implementing our own FFT would be great. But considering the advanced features that CuFFT offers, such as operating on non-power-of-two size FFTs, which could be a challenge for us to implement, I think it would be better to somehow work around that problem by caching and such. |
Implement Fast Fourier Transform CUDA kernel or add cuFFT into the library.
preliminaries
The text was updated successfully, but these errors were encountered: