期刊文献+

基于异构多核可编程系统的大点FFT卷积设计与实现 被引量:15

Design and implementation of large FFT convolution on heterogeneous multicore programmable system
下载PDF
导出
摘要 如今FFT卷积广泛应用于数字信号处理,并且过去几年证实了异构多核可编程系统(HMPS)的发展。另外,HMPS已经成为DSP领域的主流趋势。因此,研究基于HMPS大点FFT卷积的高效地实现显得非常重要。基于重叠相加FFT卷积方法,设计一款针对输入数据流的高效流水重叠相加滤波器。介绍了基于HMPS的大点FFT卷积实现,获得了高精度的滤波效果。此外,采用流水技术的滤波器设计,提高系统处理速度、数据吞吐率和任务并行度。基于Xilinx XC7V2000T FPGA开发板上的实验表明,参与运算的采样点越大,系统的任务并行度、处理速度和数据吞吐率就会越高。当采样点达到1M时,系统的平均任务平行度达到了5.33,消耗了2.745×10~6个系统时钟周期数,并且绝对误差精度达到10^(-4)。 Nowadays FFT convolution is widely applied to digital signal processing(DSP), and the past few years have witnessed the development of the heterogeneous muhicore programmable system (HMPS). In addition, HMPS has been the mainstream in the field of DSP. So it is very important to study the high efficient implementation of large FFT convolution on the HMPS. In this paper, a high efficient pipelined overlap-add filter based on the overlap-add FFT convolution method is designed for the input streaming data. This paper introduces the implementation of large FFT convolution on the HMPS and achieves the high accuracy of filter re- sult. Furthermore, a pipeline technology is adopted for the filter design to improve processing speed, throughout and parallelism of tasks. The Xilinx XC7V2000T FPGA verification result shows that the larger sampling points are involved in computing, the higher task parallelism, processing speed and throughout will be obtained. When the sample points reach 1M, the system average task par- allelism is 5.33 with 2.745×10^6 clock cycles and the precision of 10^-4.
出处 《电子技术应用》 北大核心 2017年第3期16-20,共5页 Application of Electronic Technique
基金 国家自然科学基金(61106020)
关键词 FFT卷积 重叠相加 算法映射 任务并行度 异构多核 FFT convolution overlap-add algorithm mapping task parallelism heterogeneous muhicore
  • 相关文献

参考文献5

二级参考文献69

  • 1Mei Wen Nan Wu Hai-Yan Li Chun-Yuan Zhang.Multiple-Morphs Adaptive Stream Architecture[J].Journal of Computer Science & Technology,2005,20(5):635-646. 被引量:3
  • 2Kumar R, Tullsen D M, Ranganathan P, et al. Single-ISA Heterogeneous Multi Core Architectures for Multithreaded Workload Performance[C]//Proc of the 31st International Symposium on Computer Architecture, 2004 : 64-75.
  • 3Sun Fei, Ravi S, Raghunathan A. Application Specific Het erogeneous Multiprocessor Synthesis Using Extensible Pro cessors[J]. IEEE Transactions on CAD of Integrated Circuits and Systems, 2006, 25(9):1589-1602.
  • 4Sun Fei, Ravi S, Raghunathan A, et al. Synthesis of Application-Specific Heterogeneous Multiprocessor Architectures Using Extensible Processors[C]//Proc of the 18th International Conference on VLSI Design Held Jointly with 4th International Conference on Embedded Systems Design (VLSID' 05), 2005 : 551-556.
  • 5Prakash S,Parker A C. SOS: Synthesis of Application-Specific Heterogeneous Multiproeessor Systems[J]. Journal of Parallel and Distributed Computing, 1992,16(4):38-51.
  • 6Sarkar A, Chakrabarti P P, Kumar R. Frame Based Fair Multiprocessor Scheduler: A Fast Fair Algorithm for Real- Time Embedded Systems[C]//Proc of the 19th International Conference on VLSI Design held jointly with 5th International Conference on Embedded Systems Design (VLSID' 06), 2006:677-682.
  • 7Fisher N, Anderson J H, Baruah S. Task Partitioning upon Memory-Constrained Multiprocessors[C]//Proc of the llth IEEE International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA'05), 2005 : 416-421.
  • 8Krste A. The Landscape of Parallel Computing Research: A View from Berkeley[R]. Technical Report No. UCB/EECS-2006 183, University of California,2006.
  • 9Wang D T. The CELL Microprocessor[EB/OL]. [2005-02-10]. http://www. realwordtech. com.
  • 10Micheli G D, Benini L. Networks-on-Chips: A New SoC Paradigm [J]. Computer, 2002, 35(1):70-78.

共引文献34

同被引文献51

引证文献15

二级引证文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部