期刊文献+

基于申威GCC编译器的间接预取算法

Indirect Prefetching Algorithm Based on Shenwei GCC Compiler
下载PDF
导出
摘要 对间接存储器的访问延迟往往会影响应用程序的执行性能,一种有效的解决方案是使用预取技术.国产申威平台中支持常规访问模式的软件预取和硬件预取机制,但是其GCC编译器中缺少为间接存储器访问模式自动插入预取的方法.为了解决这个问题,基于申威GCC开发了一个完整间接预取优化遍,它利用深度优先搜索算法查找引用循环归纳变量的间接内存引用并为之生成合适的软件预取.在一组内存受限的基准测试中,自动预取遍对SW1621处理器的平均加速比达到1.16倍. The delayed access to indirect memory often affects the execution performance of applications.An effective solution is to resort to the prefetching technology.Although the Shenwei platform developed in China supports the software and hardware prefetching mechanisms for conventional access modes,the compilers in its GNU compiler collection(GCC)lack the method of automatically inserting prefetches for indirect memory access.A complete indirect prefetching optimization pass is developed on the basis of the Shenwei GCC to solve this problem,and it uses a depthfirst search algorithm to find indirect memory references that refer to loop induction variables and generate appropriate software prefetches for them.In a set of memory-bound benchmark tests,the average speed-up ratio of the automatic prefetching pass on the SW1621 processor reaches 1.16 times.
作者 余龙龙 韩林 YU Long-Long;HAN Lin(Research Institute of Frontier Information Technology,Zhongyuan University of Technology,Zhengzhou 450007,China)
出处 《计算机系统应用》 2022年第8期203-211,共9页 Computer Systems & Applications
关键词 存储器访问 申威处理器 GCC 软件预取 不规则访存 memory access Shenwei processor GNU compiler collection(GCC) software prefetch irregular memory access
  • 相关文献

参考文献3

二级参考文献25

  • 1郭勇,尉红梅,漆锋滨.基于局部性分析数据预取在GCC上的实现[J].计算机工程与科学,2006,28(z2):21-23. 被引量:1
  • 2黄娟娟,李春江,徐颖.GCC中自动向量化代价模型剖析[C]//第17届计算机工程与工艺年会暨第三届微处理器技术论坛论文集.长沙:国防科技大学出版社,2013:259-268.
  • 3Hennessy J L,Patterson D A.Computer architecture:a quantitative approach[M].[S.l.]:Elsevier,2012.
  • 4Hwang S J,Park M S.Efficient software data prefetching for a loop with large arrays[C]//EURO-PAR’95 Parallel Processing.Berlin Heidelberg:Springer,1995:301-312.
  • 5Byna S,Chen Y,Sun X H.A taxonomy of data prefetching mechanisms[C]//International Symposium on Parallel Architectures,Algorithms,and Networks,I-SPAN 2008,2008:19-24.
  • 6Free Software Foundation,Inc.GCC,the GNU compiler collection[EB/OL].(2014-12-23).https://gcc.gnu.org/.
  • 7Free Software Foundation,Inc.GCC 4.9 release series[EB/OL].(2014-10-30).http://gcc.gnu.org/gcc-4.9/.
  • 8Smith A J.Sequential program prefetching in memory hierarchies[J].Computer,1978,11(12):7-21.
  • 9Callahan D,Kennedy K,Porterfield A.Software prefetching[J].ACM SIGOPS Operating Systems Review,1991,25:40-52.
  • 10Intel.Intel itanium architecture software developer’s manual,vol.3:instruction set reference[Z].Rev 2.3,May 2010.

共引文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部