期刊文献+

分簇VLIW DSP上支持单双字模式选择的SIMD编译优化 被引量:2

SIMD compiler optimization by selecting single or double word mode for clustered VLIW DSP
下载PDF
导出
摘要 BWDSP100是一款采用超长指令字(VLIW)和单指令多数据流(SIMD)架构的针对高性能计算领域而设计的32位静态标量数字信号处理器,其指令级并行(ILP)主要是通过其特殊的分簇体系结构和SIMD指令来实现,然而现有的编译框架无法对这些特殊的SIMD指令提供支持。由于BWDSP100拥有丰富的SIMD向量化资源,且其所运用的雷达数字信号处理领域对程序的性能要求极高,因此针对BWDSP100结构的特点,在传统Open64编译器中SIMD编译优化框架的基础上提出并实现了一种支持单双字模式选择的SIMD编译优化算法,通过该算法可以显著提高一些在DSP上有着广泛运用计算密集型程序的性能。实验结果表明,与优化前相比,该算法方案在BWDSP编译器上的实现能够平均取得5.66的加速比。 BWDSP100 is a 32-bit static scalar Digital Signal Processor (DSP) with Very Long Instruction Word (VLIW) and Single Instruction Multiple Data (SIMD) features, which is designed for high-performance computing. Its Instruction Level Parallelism (ILP) is acquired though clustering and special SIMD instructions. However, the existing compiler framework can not provide support for these SIMD instructions. Since BWDSPIO0 has much SIMD vectorization resources and there are very high requirements in radar digital signal processing for the program performance, an SIMD optimization which surpported the selection of single or double word mode was put forward based on the traditional Open64 compiler according to the characteristics of BWDSP100 structure, and it can significantly improve the performance of some compute-intensive programs which are widely used in DSP field. The experimental results show that this algorithm can achieve speedup of 5.66 on average compared with before optimization.
出处 《计算机应用》 CSCD 北大核心 2015年第8期2371-2374,共4页 journal of Computer Applications
基金 国家"核高基"重大专项(2012ZX01034-001-001)
关键词 编译优化 指令级并行 分簇体系数字信号处理器 超长指令字 单指令多数据流 Open64编译器 compiler optimization Instruction Level Parallelism (ILP) multi-cluster Digital Signal Processor (DSP) Very Long Instruction Word (VLIW) Single Instruction Multiple Data (SIMD) Open64 compiler
  • 相关文献

参考文献12

  • 1CETC38.BWDSPl00硬件用户手册[R].合肥:中国电子科技集团第三十八研究所,2011:1-2.
  • 2CETC38.BWDSPl00软件用户手册[R].合HE:中国电子科技集团第三十八研究所,2011:181-191.
  • 3SUI Y. Open64 introduction [ EB/OL]. [ 2015- 03- 17]. http:// www. cse. unsw. edu. au/- ysui/saber/open64, pdf.
  • 4Open64. Open64 compiler WHIRL intermediate representation [ EB/ OL]. [2015-03-17]. http://www, mcs. anl. gov/OpenAD/open64A. pdf.
  • 5Open64. Using the x86 Open64 compiler suite [ EB/OL]. [ 2015- 03-17]. http://amd-dev, wpengine, netdna-cdn, eom/wordpress/ media/2012/10/open64, pdf.
  • 6SIMD [ EB/OL]. [ 2015-03-17]. http://en, wikipedia, org/wiki/ SIMD.
  • 7CHENG G, LAM M S. An optimizer for multimedia instruction sets [ C/OL] // Proceedings of the 2nd SUIF Compiler Workshop. Stan- ford: Stanford University, 1997 [ 2015 - 01 - 16]. http://www- suif. stanford, edu/suifconf/suifcont2/.
  • 8KRALL A, LELAIT S. Compilation techniques for multimedia pro- cessors [ J]. International Journal of Parallel Programming, 2000, 28(4): 347 -361.
  • 9LARSEN S, AMARASINGHE S. Exploiting superword level paral- lelism with multimedia instruction sets [ J]. ACM SIGPLAN No- tices, 2000, 35(5) : 145 - 156.
  • 10王昊,黄光红,王向前.基于BWDSP100的传播分簇算法研究与实现[J].中国集成电路,2014,23(8):24-28. 被引量:4

二级参考文献1

共引文献8

同被引文献7

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部