VkFFT now supports radix 3 and 5 in FFTs for better sequence lengths coverage

Follow the full discussion on Reddit.
Hello, I am the creator of VkFFT library. This update brings another important milestone - radix 3 and 5 support, which greatly increases sequence length coverage of VkFFT. For example, it is now possible to do FFT of modern display resolutions, like 1920x1080, which will be useful in VkResample in the future. If needed, adding other radix kernels (7, 11... etc) requires only kernel code now, as the engine is enhanced to handle non-full thread warps and can stagnate some threads during execution (as different radix kernels require different amount of registers - hence not all threads will be active for bigger radix kernels). The amount of used registers changes depending on the radix kernels used in one stage and precision. In the future it can be refined for different vendors and architectures as well.

Comments

There's unfortunately not much to read here yet...

Discover the Best of Machine Learning.

Ever having issues keeping up with everything that's going on in Machine Learning? That's where we help. We're sending out a weekly digest, highlighting the Best of Machine Learning.

Join over 900 Machine Learning Engineers receiving our weekly digest.

Best of Machine LearningBest of Machine Learning

Discover the best guides, books, papers and news in Machine Learning, once per week.

Twitter