期刊文献+

分簇结构高效谓词机制及编译优化方法

Effective predicate mechanism for clustering and its compiling optimization method
下载PDF
导出
摘要 谓词执行是在控制流存在的条件下可以有效挖掘指令级并行性的硬件机制,而在分簇结构上实现谓词机制,可以提高分簇结构上条件的执行效率。针对分簇结构展开谓词体系结构的研究,提出了分簇结构部分谓词的高效实现方法,以及基于循环展开的分簇结构部分谓词支持框架。实验表明,提出的分簇结构部分谓词及编译框架可以很好地提高条件执行程序的执行效率,部分典型算法核心的加速比可以达到350%500%。 Predicate execution is a valid method to develop instruction parallelism in the programs with control flow.And implementation of predicate for clustering can improve efficiency of conditional statement.Researching on predicate mechanism for clustering,this paper presented an efficient partial predicate mechanism for clustering,and proposed a compiling implementation framework based on loop unrolling for partial predicate.Experiments show that the predicate implementation framework can better enhance executive efficiency of programs with conditional statement.The speed for some classic algorithms kernels can reach 350%~500%.
作者 王向前 洪一 郑启龙 Wang Xiangqian;Hong Yi;Zheng Qilong(No.38 Research Institute,China Electronics Technology Group Corporation,Hefei 230088,China;School of Computer&Information,Hefei University of Technology,Hefei 230009,China;School of Computer Science&Technology,University of Science&Technology of China,Hefei 230027,China)
出处 《计算机应用研究》 CSCD 北大核心 2018年第1期126-129,共4页 Application Research of Computers
基金 国家重大专项资助项目(2012ZX01034001-001)
关键词 谓词 部分谓词 分簇结构 循环展开 predicate partial predicate clustering loop unrolling
  • 相关文献

参考文献3

二级参考文献19

  • 1黄元峰,黄学达.TMS320VC5416器件的G.722.2的研究与实现[J].电子技术应用,2007,33(7):46-47. 被引量:1
  • 2Alexandre E Eichenberger, Edward S Davidson. Register allocation for predicated code [C]. The 28th Annual Int'l Symp on Microarchitecture, Ann Arbor, Michigan, USA, 1995
  • 3David M Gillies, Dz-ching Roy Ju, Richard Johnson, et al.Global predicate analysis and its application to register allocation[ C]. The 29th Annual Int' l Symp on Microarchitecture, Pairs,1996
  • 4John W Sias, Wen-mei W Hwu, David I August. Accurate and efficient predicate analysis with binary decision diagrams [C].The 33rd Annual Int'l Symp on Microarchitecture, Haifa,Israel, 2000
  • 5J C Park, M S Schlansker. On predicated execution [R].Hewlett Packard Laboratories. Tech Rep: HPL-91-58, 1991
  • 6L Cater, B Simon, B Calder, et al. Predicated static single assignment [C]. Int'l Conf on Parallel Architectures and Compilation Techniques, Newport Beach, California, 1999
  • 7Preston Briggs. Register allocation via graph coloring: [ Ph D dissertation] [ D]. Houston, TX : Rice University, 1992
  • 8S A Mahlke, D C Lin, W Y Chen, et al. Effective compiler support for predicated execution using the hyperblock [C]. The 25th Int'l Symposium on Micro Architecture, Portland,Oregon, 1992
  • 9M Guthaus, J Ringenberg, T Austin, et al. Mibench: A free, commercially representative embedded benchmark suite [C]. IEEE 4th Annual Workshop on Workload Characterization Austin, TX, 2001
  • 10Hardware reference manual for Ezalm 5900 [EB/OL]. [2014 - 11 - 13]. http://www, onsemi, cn/PowerSolutions/content, do?id = 16488.

共引文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部