期刊文献+

基于GCC的高性能DSP Matrix向量指令集扩展 被引量:9

Extending the Vector Instruction Set for High-Performance DSP Matrixes Based on GCC
下载PDF
导出
摘要 自动向量化技术是编译器提高程序并行性的优化方法。随着支持SIMD结构处理器的计算平台的广泛应用,自动向量化技术也成为编译器技术研究的热点。GCC编译器是一种开源、跨平台的编译器。本文基于GCC内部自动向量化算法,结合Matrix芯片的体系结构和指令集特点,完成了Matrix向量指令集在GCC后端扩展,实现了基本的自动向量化支持。测试结果表明,扩展后的编译器能够支持Matrix向量指令集,进行基本的自动向量化,同时支持以内建函数方式开发基于Matrix的并行程序。 Autovectorization is an optimal method for the compiler to improve the parallelism of programs.With the extensive use of a computing platform supporting the SIMD architecture processor, autovectorization becomes a hot research of compiler technology. The GCC compiler is a kind of open source, cross-platform compiler. Based on the internal autovectorization algorithm and the architecture and instruction set characteristics of the Matrix chip, this paper extends the Matrix vector instruction set at GCC’s back-end and realizes the basic autovectorization support. The measured results indicate that the extended compiler can support the Matrix vector instruction set, the basic autovectorization and the development of the Matrix-based parallel programs by way of built-in functions.
出处 《计算机工程与科学》 CSCD 北大核心 2012年第1期58-63,共6页 Computer Engineering & Science
关键词 GCC MATRIX 自动向量化 GCC Matrix autovectorization
  • 相关文献

参考文献4

  • 1Naishlos D.Autovectorization in GCC[C]∥Proc of GCC De-velopers’Summit,2003.
  • 2Eichenberger A E,Wu Peng,O’Brien K.Vectorization for SIMD Architectures with Alignment Constraints[C]∥Proc of the ACM SIGPLAN’04,2004.
  • 3Nuzman D,Henderson R.Multi-Platform Auto-Vectoriza-tion[C]∥Proc of the International Symposium on Code Gen-eration and Optimization,2006:26-29.
  • 4Stallman R M.GCC Inter for Version4.4.5[M].The GCC Developer Community,GNV Press,2008.

同被引文献44

  • 1吴圣宁,李思昆.多媒体处理器的SIMD代码生成[J].计算机科学,2007,34(7):268-270. 被引量:2
  • 2Khronos OpenCL Working Group. The OpenCL Speci? cation [R]. [2009]. http://www, khronos, org/registry/c|/.
  • 3Tian X, Saito H, Preis S V. Compiling C/Cq-q-SIMD Exten- sions for Function and Loop Vectorization on Multicore-SIMD Processors [C]//Multicore and GPU Programming Models, Lan- guages and Compilers Workshop. 2012:2349-2358.
  • 4Klemm M,et al. Extending OpenMP" with vector constructs for modern multicore SIMD architectures[C]//OpenMP in a Heter- ogeneous World, 2012 .. 59-72.
  • 5黄娟娟,李春江,徐颖.GCC中自动向量化代价模型剖析[C]//第17届计算机工程与工艺年会暨第三届微处理器技术论坛论文集.长沙:国防科技大学出版社,2013:259-268.
  • 6OpenMP Architecture Review Board=OpenMP Application Pro- gram Interface[M]. Version 4. 0(July 2013).
  • 7Free Software Foundation Inc. GCC 4. 9 Release Series http:// gcc. gnu. org/gcc4. 9/.
  • 8Novillo D. Design and Implementation of Tree SSA[C]//Pro- ceedings of the 2004 GCC Summit. Ottawa,Canada, 2004.
  • 9徐颖.编译器自动向量化效能评估与分析[D].长沙:国防科学技术大学,2012.
  • 10Peleg A, Weiser U. MMX Technology Extension to the Intel Architecture[J]. IEEE/ACM International Symposium on Mi- croarchitecture, 1996,16 (4) : 42-50.

引证文献9

二级引证文献37

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部