期刊文献+

渐进式智能回溯向量化代码调优方法

Method of Progressive Intelligent Backtracking Vector Code Adjustment and Optimization
下载PDF
导出
摘要 为了充分发挥高性能计算机的计算能力,缓解程序员设计和编写并行程序的压力,扩充可用软件集合,设计并实现了利用交互界面深入挖掘程序中的可向量化语句,优化生成代码中的向量化语句,提高生成代码的执行效率。该方法对充分发挥高性能计算机的计算能力,增强系统可用性和扩展应用范围具有重要的意义,同时能够提供有效的辅助手段和工具支持。渐进式智能回溯向量化代码调优架构通过对用户提交的串行程序进行程序分析和变换,采用串行程序分析、数据依赖分析、向量化分析等技术手段,根据分析结果对程序进行变换和优化,自动生成最终的向量化代码。该方法通过分析串行程序中潜在的并行性,将其自动变换为等价的向量化代码形式,大大简化了程序员的工作。 In order to fully develop the computing ability of high-performance computer and relieve the pressure of designing and writing parallel programs for programmers,with the expansion of available software sets we designed and realized the vector program through the interactive interface.Using this method,we could optimize the generated vectorcode to improve the efficiency for the implementation of generated code.Our process is of great significance to express the ability of high-performance computing,enhance the availability and extend range of application.Furthermore,it can provide available supplementary means and tool support.The method of progressive intelligent backtracking vector code adjustment and optimization can automatically generate the parallel code after analysis and transformation of the serial code given by the user using the following methods such as serial code analysis,data dependence analysis and parallelization analysis,etc.The work of this article can greatly reduce the work of programmers by transforming the serial code into parallel code automatically with the analysis of parallelization in the serial code.
出处 《计算机科学》 CSCD 北大核心 2015年第1期50-53,58,共5页 Computer Science
基金 核高基国家科技重大专项(2009ZX01036)资助
关键词 渐进式 静态调优 动态调优 动静结合 Progressive Static adjustment Dynamic adjustment Static combined dynamic
  • 相关文献

参考文献6

二级参考文献47

  • 1Peleg A, Weiser U. MMX technology extension to the intel architecture[J]. IEEE Micro. August 1996, 16(4):42-50.
  • 2Intel Corporation. IA-32 Intel architecture software developer's manual [ Z ]. lntel Corporation, http://developer, intel, com, 1997.
  • 3Diefendorff K,Dubey P K,et al. Altivec extension to PowerPC accelerates media processing [ J ]. IEEE Micro, March-April 2000, 20(2) :85-95.
  • 4Martinez C,Pinnamaneni M,John E B. Multimedia workloads versus SPEC CPU2000 [C]. In 2006 SPEC Benchmark Workshop, January 2006 , The University of Texas at San Antonio.
  • 5Lee R B. Subword parallelism with MAX-2[J]. IEEE Micro, July/Aug, 1996.
  • 6Randy Allen, Ken Kennedy. Automatic translation of FORTRAN programs to vector form [ J ]. ACM Transactions on Programming Languages and Systems, October 1987, 9(4) :491-542.
  • 7Intel Corp. Intel C/C + + and Intel fortran compilers for Linux EB/OL ]. http ://www. intcl, com/software/products/compilers. 2007.
  • 8Martinez C, Pinnamaneni M, John E B. Multimedia workloads versus SPEC CPU2000 [ C]. In 2006 SPEC Benchmark Workshop, The University of Texas at San Antonio, January 2006 .
  • 9Samuel Larsen, Saman Amarasinghe. Exploiting superword level parallelism with multimedia instruction sets[ C]. In Proc. Of the ACM SIGPLAN Conference on Programming Language Design and Implementation, June 2000,145-156.
  • 10Shi Hui. Dynamic data regrouping[ D]. Key Laboratory of Computer System and Architecture, Chinese Academy of Sciences, June 2007.

共引文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部