Accelerating the discontinuous Galerkin method for seismic wave propagation simulations using multiple GPUs with CUDA and MPI 被引量：3

Accelerating the discontinuous Galerkin method for seismic wave propagation simulations using multiple GPUs with CUDA and MPI

下载PDF

导出

摘要 We have successfully ported an arbitrary highorder discontinuous Galerkin method for solving the threedimensional isotropic elastic wave equation on unstructured tetrahedral meshes to multiple Graphic Processing Units （GPUs） using the Compute Unified Device Architecture （CUDA） of NVIDIA and Message Passing Interface （MPI） and obtained a speedup factor of about 28.3 for the single-precision version of our codes and a speedup factor of about 14.9 for the double-precision version. The GPU used in the comparisons is NVIDIA Tesla C2070 Fermi, and the CPU used is Intel Xeon W5660. To effectively overlap inter-process communication with computation, we separate the elements on each subdomain into inner and outer elements and complete the computation on outer elements and fill the MPI buffer first. While the MPI messages travel across the network, the GPU performs computation on inner elements, and all other calculations that do not use information of outer elements from neighboring subdomains. A significant portion of the speedup also comes from a customized matrix-matrix multiplication kernel, which is used extensively throughout our program. Preliminary performance analysis on our parallel GPU codes shows favorable strong and weak scalabilities. We have successfully ported an arbitrary highorder discontinuous Galerkin method for solving the threedimensional isotropic elastic wave equation on unstructured tetrahedral meshes to multiple Graphic Processing Units （GPUs） using the Compute Unified Device Architecture （CUDA） of NVIDIA and Message Passing Interface （MPI） and obtained a speedup factor of about 28.3 for the single-precision version of our codes and a speedup factor of about 14.9 for the double-precision version. The GPU used in the comparisons is NVIDIA Tesla C2070 Fermi, and the CPU used is Intel Xeon W5660. To effectively overlap inter-process communication with computation, we separate the elements on each subdomain into inner and outer elements and complete the computation on outer elements and fill the MPI buffer first. While the MPI messages travel across the network, the GPU performs computation on inner elements, and all other calculations that do not use information of outer elements from neighboring subdomains. A significant portion of the speedup also comes from a customized matrix-matrix multiplication kernel, which is used extensively throughout our program. Preliminary performance analysis on our parallel GPU codes shows favorable strong and weak scalabilities.

作者 Dawei Mu Po Chen Liqiang Wang

机构地区 Department of Geology and Geophysics Computer Science Department

出处《Earthquake Science》 2013年第6期377-393,共17页 地震学报（英文版）

基金 supported by the School of Energy Resources at the University of Wyoming The GPU hardware used in this study was purchased using the NSF Grant EAR-0930040

关键词 Seismic wave propagation DiscontinuousGalerkin method GPU Seismic wave propagation DiscontinuousGalerkin method GPU

分类号 P315 [天文地球—地震学]

引文网络
相关文献

参考文献5

1金星,张红才,韦永祥.基于地震台网资料快速发布的震动烈度标准及其应用研究[J].国际地震动态,2008,29(10):20-27. 被引量：18
2马强,金星,李山有.单自由度系统地震动力反应的实时计算方法[J].地震工程与工程振动,2003,23(5):61-68. 被引量：34
3王卫民,赵连锋,李娟,姚振兴.1999年台湾集集地震震源破裂过程[J].地球物理学报,2005,48(1):132-147. 被引量：38
4ZHANG Yong,FENG WanPeng,XU LiSheng,ZHOU ChengHu,CHEN YunTai.Spatio-temporal rupture process of the 2008 great Wenchuan earthquake[J].Science China Earth Sciences,2009,52(2):145-154. 被引量：68
5金星,张红才,韦永祥,李军.基于地震监测台网资料近实时插值计算震动图的初步研究[J].防灾减灾学报,2010,26(1):1-11. 被引量：22

二级参考文献69

1金星,廖振鹏.地震动随机场的物理模拟[J].地震工程与工程振动,1994,14(3):11-19. 被引量：19
2郝敏,谢礼立,徐龙军.关于地震烈度物理标准研究的若干思考[J].地震学报,2005,27(2):230-234. 被引量：37
3谢小碧,姚振兴.计算分层介质中位错点源静态位移场的广义反射、透射系数矩阵和离散波数方法[J].地球物理学报,1989,32(3):270-280. 被引量：15
4李大华.计算地震反应谱的连锁公式[J].地震工程与工程振动,1990,10(2):47-52. 被引量：14
5泽仁志玛,陈会忠,何加勇,代光辉,胡彬.震动图快速生成系统研究[J].地球物理学进展,2006,21(3):809-813. 被引量：20
6袁一凡.由地震动三要素确定地震动强度的研究.中国地震局工程力学研究所研究报告,1998
7D J Wald, V Quitoriano, T H Heaton, et al. Trinet "ShakeMaps" : Rapid Generation of Peak Ground Motion and Intensity Maps for Earthquakes in Southern California. Earthquake Spectra, 1999, 15:537-556
8J Boatwright, H Bundock, J Luetgert, et al. The dependence of PGA and PGV on distance and magnitude inferred form northern California shakemap data. Bull. Seism. Soc. Am. 2003, 93(5) : 2043-2055
9F Yamazaki, S Noda and K Meguro. Developments of Monitoring and Early Damage Assessment System in Japan. Bull. ERS, 1997, 30:45-58
10Wu Yih-Min, Shin Tzay-Chyn and Chang Chien-Hsin. Near Real-Time Mapping of Peak Ground Acceleration and Peak Ground Velocity Fellowing a Strong Earthquake. Bull. Seism. Soc. Am. 2001, 91 (5) : 1 621-1632

共引文献160

1孙学军,龙政强,潘岳怡.广西靖西M^(S)5.2地震震动图分析[J].华北地震科学,2020,38(S02):128-133.
2陈灯红,彭刚,姚艳华,张微微.地震波时域数值优化研究及应用[J].世界地震工程,2008,24(4):130-135. 被引量：10
3YAN ZhenZhen,ZHANG Huai,YANG ChangChun,SHI YaoLin.Spectral element analysis on the characteristics of seismic wave propagation triggered by Wenchuan M_s8.0 earthquake[J].Science China Earth Sciences,2009,52(6):764-773. 被引量：9
4QIAN FuYe,ZHAO BiRu,QIAN Wei,ZHAO Jian,HE ShiGen,ZHANG HongKui,LI ShiYu,LI ShaoKun,YAN GuLiang,WANG ChengMin,SUN ZhenKai,ZHANG DongNing,LU Jun,ZHANG Ping,YANG GuoJun,SUN JiaLin,GUO ChunSheng,TANG YuXiong,XU JianMing,XIA KunTao,JU Hang,YIN BangHong,LI Ming,YANG DongSheng,QI WeiLuo,HE TaiMing,GUAN HuaPing,ZHAO YuLin.Impending HRT wave precursors to the Wenchuan M_s8.0 earthquake and methods of earthquake impending prediction by using HRT wave[J].Science China Earth Sciences,2009,52(10):1572-1584. 被引量：8
5Yong Zhang,LiSheng Xu,Yun-Tai Chen.Source process of the 2010 Yushu,Qinghai,earthquake[J].Science China Earth Sciences,2010,53(9):1249-1251. 被引量：8
6XIE ChaoDi,ZHU YuanQing,LEI XingLin,YU HaiYing,HU XiongLin.Pattern of stress change and its effect on seismicity rate caused by M_s8.0 Wenchuan earthquake[J].Science China Earth Sciences,2010,53(9):1260-1270. 被引量：17
7Xu Yi,Li ZhiWei,Huang RunQiu,Liu JianHua,Liu JinSong.Pn-wave velocity and anisotropy of the western Sichuan and Longmen Mountain region,China[J].Science China Earth Sciences,2010,53(11):1665-1670. 被引量：2
8金星,康兰池,欧益萍.Ground motion attenuation relation for small to moderate earthquakes in Fujian region, China[J].Acta Seismologica Sinica(English Edition),2008,21(3):283-295. 被引量：2
9刘海明,陶夏新.预测汶川8.0级大地震地震动的震源模型[J].土木工程学报,2013,46(S1):139-145. 被引量：4
10金星,马强,李山有.利用数字强震仪记录实时仿真地动位移[J].地震学报,2005,27(1):79-85. 被引量：27

同被引文献40

1王雪秋,孙建国,张文志.复杂地表地质条件下地震波数值模拟综述[J].吉林大学学报（地球科学版）,2005,35(S1):12-18. 被引量：16
2谢桂生,刘洪,赵连功.伪谱法地震波正演模拟的多线程并行计算[J].地球物理学进展,2005,20(1):17-23. 被引量：23
3王月英,孙成禹.弹性波动方程数值解的有限元并行算法[J].中国石油大学学报（自然科学版）,2006,30(5):27-30. 被引量：4
4Wang Xiangchun Liu Xuewei.3-D acoustic wave equation forward modeling with topography[J].Applied Geophysics,2007,4(1):8-15. 被引量：6
5冯英杰,杨长春,吴萍.地震波有限差分模拟综述[J].地球物理学进展,2007,22(2):487-491. 被引量：64
6薛东川,王尚旭,焦淑静.起伏地表复杂介质波动方程有限元数值模拟方法[J].地球物理学进展,2007,22(2):522-529. 被引量：40
7李信富,李小凡,张美根.地震波数值模拟方法研究综述[J].防灾减灾工程学报,2007,27(2):241-248. 被引量：37
8王祥春,刘学伟.起伏地表二维声波方程地震波场模拟与分析[J].石油地球物理勘探,2007,42(3):268-276. 被引量：13
9张华,李振春,韩文功.起伏地表条件下地震波数值模拟方法综述[J].勘探地球物理进展,2007,30(5):334-339. 被引量：12
10董良国,郭晓玲,吴晓丰,马在田.起伏地表弹性波传播有限差分法数值模拟[J].天然气工业,2007,27(10):38-41. 被引量：19

引证文献3

1张睿璇,廉西猛.波动方程法地震波正演数值模拟研究综述[J].油气地球物理,2015,13(2):55-59. 被引量：3
2贺茜君,杨顶辉,仇楚钧,周艳杰,常芸凡.基于非结构网格求解三维D′Alembert介质中声波方程的并行加权Runge-Kutta间断有限元方法[J].地球物理学报,2021,64(3):876-895. 被引量：2
3赵锴坤,朱炬波,谷德峰,韦春博.MPI+CUDA联合加速重力场反演的并行算法[J].大地测量与地球动力学,2024,44(4):423-428.

二级引证文献5

1侯凯,李黎,罗泽,杨小江,赵红娟,赵伟超.裂缝型非均匀介质地震波正演模拟研究[J].物探化探计算技术,2019,41(5):563-571.
2马国栋,傅建.水下抛石体探测中地震采集参数的优选[J].水利水电科技进展,2020,40(1):81-87. 被引量：2
3曹文忠,张伟.复杂固-液介质的速度应力方程三角形单元间断伽辽金地震波模拟算法[J].地球物理学报,2024,67(2):684-695.
4许艳秋,黄毅.正演模拟在火山岩储层中的应用[J].石油化工应用,2024,43(4):9-13.
5胡荃,刘元雪.地下掩埋物弹性波散射的间断有限元建模[J].舰船电子工程,2024,44(4):76-80.

1Intersil的单向内核控制器为Santa Rosa平台GPU供电[J].电子与电脑,2006(11):82-82.
2分布式计算面临挑战[J].中国信息化,2007(24):23-23.
3Qingguo Hong,Jun Hu,Shi Shu,Jinchao Xu.A DISCONTINUOUS GALERKIN METHOD FOR THE FOURTH-ORDER CURL PROBLEM[J].Journal of Computational Mathematics,2012,30(6):565-578. 被引量：3
4肖红林,罗纪生.基于MPI的伪谱法大涡模拟并行计算的研究[J].计算机工程与应用,2009,45(3):242-244. 被引量：2
5冯朝辉,王鑫伟.含压电谱单元及其在板结构中Lamb波传播模拟中的应用[J].中国机械工程,2014,25(3):377-382. 被引量：1
6林昭文,苏飞,马严.Modeling of Malicious Code Propagations in Internet of Things[J].China Communications,2011,8(1):79-86. 被引量：2
7Fatemeh Azmandian,Member, IEEE, Ayse Yilmazer,Student Member, IEEE, Jennifer G. Dy,Member, IEEE Javed A. Aslam,IEEE, Jennifer G. Dy,Member, ACM,David R. Kaeli,Fellow, IEEE, Member, ACM.Harnessing the Power of GPUs to Speed Up Feature Selection for Outlier Detection[J].Journal of Computer Science & Technology,2014,29(3):408-422.
8中科院近代物理所成立CUDA研究中心[J].现代科学仪器,2014,31(3):52-52.
9李光亮,温利华,闫俊霞,程海峰.基于元胞自动机的传染病传播模拟研究[J].长江大学学报（自科版）（上旬）,2013,10(9):85-87. 被引量：1
10A New DNA-based Logical Gate Comes into Being[J].Bulletin of the Chinese Academy of Sciences,2006,20(2):70-70.

Earthquake Science

2013年第6期

浏览历史

内容加载中请稍等...