动态网格的DSMC方法在GPU上的并行

GPU Based Parallel Method for Dynamic Collision Grid DSMC

下载PDF

导出

摘要直接模拟蒙特卡罗方法(direct simulation Monte Carlo,DSMC)是稀薄气体动力学领域的重要工具。然而,DSMC方法有两个比较主要的缺点:一是复杂的网格处理;另一个是庞大的计算量。使用动态网格的DSMC方法可以根据流场信息,动态生成自适应的碰撞网格,能有效解决前一个缺点;针对后一个缺点,使用统一计算架构(compute unified device architecture,CUDA)编写并行程序,将基于动态网格的DSMC方法移植到图形处理器(graphic processing unit,GPU)上以减少计算时间。在并行实现中,GPU负责绝大部分的计算,而CPU只负责初始化、结果输出等少量工作。使用一个二维超音速横掠平板问题作为算例,验证了并行程序的正确性。对于不同规模的算例,在NVIDIA Fermi C2050之上均获得了10倍以上的加速比;对于相同算例,NVIDIA最新发布的Kepler K20上的速度约为FermiC2050上的1.3～1.6倍。 The direct simulation Monte Carlo （DSMC） method is a powerful computational tool in the field of rarefied gas dynamics. However, there are two main shortages in DSMC method： one is complex gridding processing, the other is large time consumption. The dynamic collision grid DSMC method generates collision grids adaptively according to the flowfield, which overcomes the first shortage. For the other shortage, using compute unified device architecture （CUDA） to write parallel program, the dynamic collision grid DSMC method is ported to graphic pro- cessing unit （GPU） to reduce computing time. During the parallel implementation, the main computation is per- formed on GPU while CPU only deals with the processes of initialization and output. A two-dimensional benchmark problem in different sizes is used to demonstrate the correctness of the parallelization. The results show that 10 times speedup is achieved based on NVIDIA Fermi C2050. For a same case, the performance on NVIDIA newly released Kepler K20 is 1.3-1.6 times higher than that on Fermi C2050.

作者文敏华林新华 Simon Chong Wee See

机构地区上海交通大学高性能计算中心 NVIDIA Corporation

出处《计算机科学与探索》 CSCD 2013年第5期472-479,共8页 Journal of Frontiers of Computer Science and Technology

基金 NVIDIA公司资助项目~~

关键词统一计算架构(CUDA) 图形处理器(GPU) 直接模拟蒙特卡罗方法(DSMC) 动态网格DSMC 并行模拟 compute unified device architecture （CUDA） graphic processing unit （GPU） direct simulation Monte Carlo （DSMC） dynamic collision grid DSMC parallel simulation

分类号 TP39 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1陈颖骝,刘洪,李明禄.动态碰撞网格的DSMC及其并行化[J].计算机应用与软件,2009,26(10):39-42. 被引量：1
2张伟,姜恺,刘洪.直接模拟Monte Carlo方法并行化研究[J].计算机应用与软件,2009,26(9):1-3. 被引量：1

二级参考文献10

1Bird G A. Monte Carlo simulation of gas flows [ J ]. Ann. Rev. Fluid Mech,1978(10) :11 -31.
2Foster I. Designing and Building Parallel Programs [ EB/OL ]. http :// www-unix. mcs. anl. gov/dbpp/.
3BIRD G A. Molecular Gas Dynamics [ M ]. Oxford:Clarendon Press, 1976.
4BIRD G A. Molecular Gas Dynamics and the direct simulation of gas flow [ M ]. Oxford : Clarendon Press, 1994.
5BIRD G A. Application of the DSMC method to the full shuttle geometry[ R]. AIAA -90 - 1692.
6LAUX M. Optimization and parallelization of the DSMC method on unstructured grids [ R]. AIAA -97 -2512.
7KIM M G, KIM H S. A parallel cell based DSMC method with dynamic load balancing using unstructured Adaptive meshes [ R]. AIAA 2003 - 1033.
8姜恺黄良大.直接模拟蒙特卡洛(DSMC)方法的并行化.高性能计算发展与应用,2005,(2).
9Olson S E, Christlieb A J. Gridless DSMC [ J ]. Journal of Computational Physics ,2008,227 ( 17 ).
10BIRD G A. Sophisticated DSMC [ R/OL]. http://www.gab. com. au/ Resources/DSMC07 notes. pdf.

1张伟,姜恺,刘洪.直接模拟Monte Carlo方法并行化研究[J].计算机应用与软件,2009,26(9):1-3. 被引量：1
2董蕾,黄方,卜栓栓,冯杰,周纪.基于CUDA的压缩感知重构算法并行化研究[J].信息技术,2016,40(4):32-36. 被引量：1
3陈颖骝,刘洪,李明禄.动态碰撞网格的DSMC及其并行化[J].计算机应用与软件,2009,26(10):39-42. 被引量：1
4陈彬,陈和平,李晓卉.基于GPU的高效图像协方差矩阵算法与实现[J].计算机工程与设计,2014,35(12):4238-4242. 被引量：2
5杨则正.防止病毒的综合技术手段[J].管理观察,1995,0(4):51-51.
6傅游,花嵘,康继昌.DSMC交互式并行化系统性能预测模型[J].山东科技大学学报（自然科学版）,2005,24(3):65-68.
7傅游,花嵘,康继昌.DSMC并行仿真中的迁移相关分析方法[J].微电子学与计算机,2007,24(5):175-178. 被引量：1
8黄飞虎,兰时勇,吴健,刘东辉.基于CUDA平台的实时去雾[J].计算机应用,2013,33(A02):183-186. 被引量：3
9傅游,花嵘.稀薄气体直接仿真蒙特卡洛方法交互式并行化系统研究与实现[J].山东科技大学学报（自然科学版）,2009,28(5):75-80.
10He Bijiao,He Xiaoying,Zhang Mingxing,Cai Guobiao.Plume aerodynamic effects of cushion engine in lunar landing[J].Chinese Journal of Aeronautics,2013,26(2):269-278. 被引量：9

计算机科学与探索

2013年第5期

浏览历史

内容加载中请稍等...

动态网格的DSMC方法在GPU上的并行

参考文献2

二级参考文献10

相关作者

相关机构

相关主题

浏览历史