期刊文献+

计算流体力学程序单核指令级优化方法

A Single-Core Instruction-Level Optimization Method for Computational Fluid Dynamics Programs
下载PDF
导出
摘要 针对目前大多数计算流体力学程序对系统的单核计算能力利用不足,提出一种针对计算流体力学程序的单核指令级优化方法。该方法首先分析程序的性能指标存在潜在的性能不足,根据分析结果进行优化;依据容器的存储特性和系统的访存特性,对程序的存储结构和访存顺序进行调整,以优化空间开销和访存性能;对CPU的流水机制进行分析,在循环和分支中消除指令的控制相关和数据相关从而达到减少流水中断率的目的;分析编译器对高级语言的处理特点并结合系统中的运行时栈在指令级作出分析,优化指令结构从而减少指令冗余和降低指令复杂度。实验结果表明,在TIANHE-1A超级计算机系统上进行测试,与优化前程序相比,优化后的程序执行时间约减少68.34%,空间消耗约减少55.43%。通过对程序性能各项指标进行分析的结果表明,程序在流水中断率、缓存命中率及机器指令数等性能指标上均有大幅地提升,该方法优化覆盖范围多于目前其他优化方法,有较好的优化效果,在计算流体力学程序优化研究中具有一定的借鉴价值。 A single-core instruction-level optimization method for computational fluid dynamics(CFD)programs is proposed to overcome the shortage of most current CFD programs in utilizing the single-core computing power of a system.The method first analyzes the performance indicators of a program to find potential performance deficiencies,and then optimizes them according to the analysis results.Then the memory structure and memory access sequence are adjusted according to the memory access characteristics of the system and the container to optimize memory access performance and space overhead.The pipeline mechanism of CPU is analyzed,and the control correlation and data correlation of instructions in loops and branches are eliminated to reduce the pipeline interruption rate.Both the characteristics of a compiler processing the high-level language and the runtime stack at the instruction level are analyzed to optimize the instruction structure and to reduce instruction redundancy and duplication.Experimental results show that the performance of the optimized program is greatly improved.Testings on TIANHE-1A supercomputer system show that the execution time of the program reduces by 68.34%and the space consumption reduces by 55.43%.Analyses show that the performance of the program is greatly improved in pipeline interruption rate,cache hit rate and number of machine instructions.It shows that the proposed method has more coverage than other existing optimization methods and better optimization effect,and has a good reference value.
作者 刘闯 何峰 肖兮 董小社 张兴军 LIU Chuang;HE Feng;XIAO Xi;DONG Xiaoshe;ZHANG Xingjun(Department of Computer Science and Technology,Xi'an Jiaotong University, Xi'an 710049,China)
出处 《西安交通大学学报》 EI CAS CSCD 北大核心 2018年第12期77-83,共7页 Journal of Xi'an Jiaotong University
基金 国家重点研发计划资助项目(2016YFB0200902)
关键词 流体力学程序 指令级优化 访存优化 流水优化 computational fluid dynamics program instruction-level optimization memory-access optimization pipeline optimization
  • 相关文献

参考文献5

二级参考文献37

  • 1曹小林,莫则尧,张景琳,陈其峰.基于“块-单元”数据结构的分子动力学并行计算[J].计算物理,2004,21(5):377-385. 被引量:14
  • 2曹小林,莫则尧.一种基于实测的高维动态负载平衡方法[J].计算机学报,2005,28(9):1440-1446. 被引量:4
  • 3金丽,包志华,陈海进.基于ARM嵌入式系统的C程序优化设计方法[J].南通大学学报(自然科学版),2006,5(3):61-64. 被引量:8
  • 4Fitch B G, Rayshubskiy A, Eleftheriou M, et al. Blue Matter: Strong Scaling of Molecular Dynamics on Blue Gene/L [R]. IBM Research Technical Report RC3688, 2005.
  • 5Kumar S, Huang G C, Kale L V. Achieving Strong Scaling with NAMD on Blue Gene/L[R]. PPL Technical Report 05- 13, University of Illinois at Urbana-Champaign, 2005.
  • 6Optimizing SAGE on the Intel Itanium 2 Processor-Based Platform[R]. Technical White Paper from Intel Solution Services, 2003.
  • 7Intel Corp. Introduction to Microarchitectural Optimization for Itanium 2 Processors [EB/OL]. [2009-06-12]. http:// www. intel. com/software/products/vtune/techtopic/software_optimization. pdf.
  • 8Intel Corp. Intel Itanium 2 Processor Reference Manual for Software Development and Optimization[EB/OL]. [2009-06-12]. http://www. intel. com/design/itanium2/manuals/ 251110. htm.
  • 9Cosentino G B.Computational Fluid Dynamics Analysis Success Stories of X-plane Design to Flight Test[R].NASA/TM-2008-214636.
  • 10Resch M M,Küster U.HPC Processor Technologies and Their Impact on Simulation[J].Computational Science and High Performance Computing Ⅳ,Notes on Numerical Fluid Mechanics and Multidisciplinary Design,2011,115:17-28.

共引文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部