Introducing ToPe-FFT: An OpenCL-based FFT library targeting GPUs.

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE(2017)

引用 3|浏览32
暂无评分
摘要
In this paper, we present our implementation of the fast Fourier transforms on graphic processing unit (GPU) using OpenCL. This implementation of the FFT (ToPe-FFT) is based on the Cooley-Tukey set of algorithms with support for 1D and higher dimensional transforms using different radices. Factorization for mix-radices enables our code to target FFTs of near arbitrary length. In systems with multiple graphic cards (GPUs), the library automatically balances the FFT computation thus achieving maximum resource utilization and higher speedup. Based on profiling and micro-benchmarking of ToPe-FFT, it is observed that the average speedup of our library for different sizes is 48x faster than the single CPU-based code using FFTW and 3x faster than NVIDIA's GPU-based cuFFT library.
更多
查看译文
关键词
fast fourier transform,GPU,high performance computing,OpenCL
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要