期刊文献+

软件流水向量化算法研究 被引量:2

Software Pipelining Vectorization Algorithm
下载PDF
导出
摘要 描述了开源高性能编译器Open64及向量化框架,给出了多循环层进行依赖关系分析算法和收益分析方法。结合面向DSP架构的SLP向量化框架,给出了一种应用软件流水的向量化算法。实验结果表明软件流水向量化算法适用于某些计算密集的DSP应用,最高加速比达到14.2。 The open source high performance compiler Open64 and its vectorization framework are described and analyzed. The dependency analysis algorithm and benefit analysis for loop nest are presented. Based on SLP vectorization framework,a software pipelining vectorization algorithm is presented for DSP Architecture. Experimental results show that the software pipelining vectorization algorithm is suitable for compute-intensive DSP applications,with the maximum speedup hitting 14. 2.
出处 《信息工程大学学报》 2015年第6期753-759,共7页 Journal of Information Engineering University
基金 国家863计划资助项目(2009AA01220)
关键词 SLP 循环展开 软件流水 寄存器重用 superword level parallel loop unrolling software pipelining register reuse
  • 相关文献

参考文献9

  • 1Franchetti F, Kral S, Lorenz J, et al. Efficient utilization of SIMD extensions [ C ]//Proceedings of the IEEE. 2005 : 409-425.
  • 2Instruments T. TMS320C6000 CPU and instruction set ref- erence guide [ C ]//Literature Number : SPRU 189F, 2000 : 62 -67.
  • 3古志民,田小永.基于StarCore SC140混合编程的方法[J].计算机应用,2002,22(9):129-130. 被引量:1
  • 4Fridman J, Greenfield Z. The tigersharc DSP architecture [J]. IEEE micro, 2000, 20(1): 66-76.
  • 5Naishlos D. Autovectorization in GCC [ C ]//Proceedings of the 2004 GCC Developers Summit. 2004: 105-118.
  • 6Allen R, Kennedy K. Optimizing compilers for modern architectures: a dependence-based approach [ M ]. San Francisco: Morgan Kaufmann, 2002.
  • 7Samuel Larsen, Saman Amarasinghe. Exploiting superword level parallelism with multimedia instruction sets [ C ]// Proceedings of the ACM SIGPLAN Conference on Pro- gramming Language Design and Implementation. 2000: 145-156.
  • 8Nuzman D, Zaks A. Outer-loop vectorization: revisited for short simd architectures [ C ]//Proceedings of the 17th international conference on Parallel architectures and compilation techniques. 2008 : 2-11.
  • 9Tenllado C, Pinuel L, Prieto M, et al. Pack Transposi- tion: Enhancing Superword Level Parallelism Exploitation [ C ]//ParCo. 2005 : 573-580.

二级参考文献2

  • 1.CodeWarriorMetrowerksEnterpriseCCompilerUser′sManual[].wwwmetroworkscom.
  • 2.SC140DSPCoreReferenceManual[].wwwmotorolacom/sps/dsp.

同被引文献11

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部