期刊文献+

格点QCD基础求解器及其异构计算实现的性能优化

Performance Optimization of Lattice QCD Solver and Its Heterogeneous Computation
下载PDF
导出
摘要 格点量子色动力学(格点QCD)是研究夸克、胶子等微观粒子间相互作用的重要理论和方法.通过将时空离散化为四维结构网格,并将量子色动力学的基本场量定义在网格上,让研究人员可以使用数值模拟方法,从第一性原理出发研究强子间相互作用和性质,但这个过程中的计算量极大,需要进行大规模并行计算.格点QCD计算的核心基础为格点QCD求解器,是程序运行主要的计算热点模块.本文研究在国产异构计算平台下格点QCD求解器的实现与优化,提出一套格点QCD求解器的设计实现,实现了BiCGSTAB求解器,显著降低了迭代次数;通过对奇偶预处理技术,降低了所求问题的计算规模;针对国产异构加速卡的特点,优化了Dslash模块的访存操作.实验测试表明,相比优化前的求解器获得了约30倍的加速比,为国产异构超算下格点QCD软件性能优化提供了有益的参考价值. Lattice quantum chromodynamics(Lattice QCD)is an important theory and method to study the interaction between microscopic particles such as quarks and gluons.By discretizing the spacetime into a four-dimensional structural grid and defining the basic field quantity of QCD on the grid,researchers can use a numerical simulation method to study hadron interactions and properties from the first principle.However,the computation in this process is time-consuming,and large-scale parallel computing is required.The fundamental module of the Lattice QCD computation is the Lattice QCD solver which is the main hot spot of the program running.This work studies the realization and optimization of Lattice QCD solver from a domestic heterogeneous computing platform and proposes a design method of Lattice QCD solver,which realizes BiCGSTAB solver and significantly reduces the iteration numbers.With the odd/even preprocessing technology,the study reduces the computing scale of the problem and optimizes the Dslash module’s memory access in terms of the characteristics of a domestic heterogeneous accelerator.Experimental tests show that the speedup ratio of the solver is about 30 times higher than that of the unoptimized one,which provides a useful reference for the performance optimization of Lattice QCD software of domestic heterogeneous supercomputers.
作者 杨子江 张克龙 刘倩 徐顺 孙鹏 YANG Zi-Jiang;ZHANG Ke-Long;LIU Qian;XU Shun;SUN Peng(Computer Network Information Center,Chinese Academy of Sciences,Beijing 100190,China;University of Chinese Academy of Sciences,Beijing 100049,China;Nanjing Normal University,Nanjing 210023,China)
出处 《计算机系统应用》 2022年第11期358-364,共7页 Computer Systems & Applications
基金 中国科学院B类先导培育项目(XDPB25) 海光产业生态合作组织基金(ghfund202107011598)
关键词 格点量子色动力学 方程求解器 并行计算 异构计算 Lattice quantum chromodynamics(Lattice QCD) equation solver parallel computing heterogeneous computing
  • 相关文献

参考文献4

二级参考文献5

共引文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部