期刊文献+

基于OpenCL的FFT算法研究 被引量:2

RESEARCH ON FFT ALGORITHM BASED ON OPENCL
下载PDF
导出
摘要 快速福利叶变换在图像处理领域,尤其是在图像复原算法中作为常用的计算工具,将时域计算转变为频域计算,在工程应用中有着非常重要的意义。采取多线程分块以及并行的映射方法,可以使FFT算法最大程度并行。针对OpenCL的存储层次特点和算法层次的优化,在AMD GPU平台上取得了明显的加速效果。优化后的算法性能比具有相同处理能力的CPU平台提高了7倍,比具有相同处理能力的CUDA提高了4倍。 Fast Fourier transform, as a commonly used computational tool in the field of image processing, especially in image restoration algorithm, transforms the time-domain computation into frequency-domain computation and has great significance for engineering applications. By adopting thread blocks and parallel mapping method,, we can make FFT algorithm reach the maximum degree of parallelism. In view of the storage features of OpenCL and the optimisation of the algorithm, the AMD GPU platform has been significantly accelerated. Compared with CPU platform and CUDA with the same processing capability, the performance of the optimised algorithm has increased by 7 times and 4 times respectively.
作者 贾格 彭先蓉 左颢睿 Jia Ge Peng Xianrong Zuo Haorui(Institute of Optics and Electronics ,Chinese Academy of Sciences, Chengdu 610209, Sichuan, China Graduate University of Chinese Academy of Sciences, Beijing 100039, China)
出处 《计算机应用与软件》 2017年第3期233-237,283,共6页 Computer Applications and Software
关键词 傅里叶变换 OPENCL GPU并行加速 Fast Fourier transform OpenCL GPU Parallel speedup
  • 相关文献

参考文献5

二级参考文献53

  • 1樊民革,赵剡,许东.基于DSP的实时图像复原[J].红外与激光工程,2006,35(z4):343-348. 被引量:1
  • 2王易因,Renevan Leuken,Alle-Jan van der Veen,曾晓洋,章倩苓.基于FPGA的跳时延发射参考系统的基带数字信号处理(英文)[J].光学精密工程,2006,14(5):876-882. 被引量:4
  • 3宋亚军,许廷发,倪国强,高昆,王强.基于Virtex-4 FPGA的低功耗图像融合系统[J].光学精密工程,2007,15(6):935-940. 被引量:13
  • 4庄国瑜.实信号二维FFT的高效算法[J].天津纺织工学院学报,1997,16(2):47-50. 被引量:2
  • 5ZHONG K, HE H, ZHU G. An ultra high-speed FFT processor [C]. International Symposium on Signals, Circuits and Systems, Bnagkok, 2003:37-40.
  • 6BENZL A O, GRISIAL P C. A broadband FFT spectrometer for radio and millimeter astronomy [J]. Astronomy & Astrophysics, 2005, 442 (2) :767-773.
  • 7BASS B M. An approach to low power, high performance , fast Fourier transform processor design [D]. Stanford University, 1999.
  • 8HE S, TORKELSON M. Design and implementa tion of a 1024-point pipeline FFT processor[C]. Processing IEEE Custom Integrated Circuits Con ference, 1998: 131-134.
  • 9PENG Y. A parallel architecture for VLSI implementation of FFT processor [C].Proceedings of ASIC, 2003(2) :748-751.
  • 10SIWORKS. Product Brief: Parallel N-Point FFT/IFFTCore[R]. SiWorks, Inc., 2003.

共引文献62

同被引文献23

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部