Implementation of a Particle Accelerator Beam Dynamics Code on Multi-Node GPUs

Implementation of a Particle Accelerator Beam Dynamics Code on Multi-Node GPUs

下载PDF

导出

摘要 Particle accelerators play an important role in a wide range of scientific discoveries and industrial applications. The self-consistent multi-particle simulation based on the particle-in-cell (PIC) method has been used to study charged particle beam dynamics inside those accelerators. However, the PIC simulation is time-consuming and needs to use modern parallel computers for high-resolution applications. In this paper, we implemented a parallel beam dynamics PIC code on multi-node hybrid architecture computers with multiple Graphics Processing Units (GPUs). We used two methods to parallelize the PIC code on multiple GPUs and observed that the replication method is a better choice for moderate problem size and current computer hardware while the domain decomposition method might be a better choice for large problem size and more advanced computer hardware that allows direct communications among multiple GPUs. Using the multi-node hybrid architectures at Oak Ridge Leadership Computing Facility (OLCF), the optimized GPU PIC code achieves a reasonable parallel performance and scales up to 64 GPUs with 16 million particles. Particle accelerators play an important role in a wide range of scientific discoveries and industrial applications. The self-consistent multi-particle simulation based on the particle-in-cell (PIC) method has been used to study charged particle beam dynamics inside those accelerators. However, the PIC simulation is time-consuming and needs to use modern parallel computers for high-resolution applications. In this paper, we implemented a parallel beam dynamics PIC code on multi-node hybrid architecture computers with multiple Graphics Processing Units (GPUs). We used two methods to parallelize the PIC code on multiple GPUs and observed that the replication method is a better choice for moderate problem size and current computer hardware while the domain decomposition method might be a better choice for large problem size and more advanced computer hardware that allows direct communications among multiple GPUs. Using the multi-node hybrid architectures at Oak Ridge Leadership Computing Facility (OLCF), the optimized GPU PIC code achieves a reasonable parallel performance and scales up to 64 GPUs with 16 million particles.

作者 Zhicong Liu Ji Qiang

机构地区 Lawrence Berkeley National Laboratory Key Laboratory of Particle Acceleration Physics and Technology

出处《Journal of Software Engineering and Applications》 2019年第9期321-338,共18页 软件工程与应用（英文）

关键词 PARTICLE ACCELERATOR PARTICLE-IN-CELL GPU Parallel BEAM Dynamics Simulation Particle Accelerator Particle-In-Cell GPU Parallel Beam Dynamics Simulation

分类号 R73 [医药卫生—肿瘤]

引文网络
相关文献

1Xuejun Yin,Wolfgang Bayer,Andrea Franchi.Linac Beam Dynamics Code Benchmarking[J].Journal of Modern Physics,2015,6(8):1044-1050.
2Evgeny E. Perepelkin,Natalia G. Inozemtseva,Alex A. Zhavoronkov.The Evolution of the Charge Density Distribution Function for Spherically Symmetric System with Zero Initial Conditions[J].World Journal of Condensed Matter Physics,2014,4(1):33-38. 被引量：1
3Tingting YANG,Shufang ZHANG.A Study of Multi-Node and Dual-Hop Collaborative Communication Performance Based on Harmonic Mean Method[J].Communications and Network,2009,1(1):42-45.
4Rahul Gaur,Purushottam Shrivastava.Beam Dynamics and Electromagnetic Design Studies of 3 MeV RFQ for SNS Programme[J].Journal of Electromagnetic Analysis and Applications,2010,2(9):519-528. 被引量：2
5Ashutosh Sharma.Self-Thomson Backscattering of Ultra-Intense Laser from Thin Foil Target[J].Journal of Electromagnetic Analysis and Applications,2013,5(1):43-48.
6Heather Song,Leslie Tekamp,Frank Francisco,Ming-Chieh Lin,Peter H. Stoltz,David Smith,Gil Wong Choi,Jin Joo Choi.Integrated Design of a Compact and Lightweight S-Band Traveling-Wave Tube Amplifier for a New Class of Microwave Power Module[J].Journal of Electromagnetic Analysis and Applications,2013,5(3):96-102.
7Richard C. Beeson Jr.,Dilma Silva.Development of a Procedure to Maximize Production of Hardy Rootstocks of Citrus Using Stem Cuttings[J].American Journal of Plant Sciences,2017,8(11):2837-2846.
8Cédric Garion.New Materials for Vacuum Chambers in High Energy Physics[J].World Journal of Mechanics,2014,4(3):71-78.
9Zhuo WANG,Qun CHEN,Bo SUO,Wei PAN,Zhanhuai LI.Reducing partition skew on MapReduce: an incremental allocation approach[J].Frontiers of Computer Science,2019,13(5):960-975. 被引量：1
10Zhuping Gong.A Super-High-Efficiency Algorithm for the Calculation of the Correlation Integral[J].Journal of Data Analysis and Information Processing,2015,3(4):128-135.

Journal of Software Engineering and Applications

2019年第9期

浏览历史

内容加载中请稍等...

Implementation of a Particle Accelerator Beam Dynamics Code on Multi-Node GPUs

相关作者

相关机构

相关主题

浏览历史