期刊文献+

基于非结构网格隐式算法的GPU加速研究 被引量:1

Research on GPU Acceleration of Implicit Schemes Based on Unstructured Grids
下载PDF
导出
摘要 针对非结构网格隐式算法在GPU上的加速效果不佳的问题,通过分析GPU的架构及并行模式,研究并实现了基于非结构网格格点格式的隐式LU-SGS算法的GPU并行加速.通过采用RCM和Metis网格重排序(重组)方法,优化非结构网格的数据局部性,改善非结构网格的隐式算法在GPU上的并行加速效果.通过三维机翼算例验证了本文实现的正确性及效率.结果表明两种网格重排序(重组)方法分别得到了63%和69%的加速效果提高.优化后的LU-SGS隐式GPU并行算法获得了相较于CPU串行算法27倍的加速比,充分说明了本文方法的高效性. With regard to the poor acceleration performance on GPU using the unstructured grids implicit method, this study realizes the GPU acceleration of LU-SGS implicit method based on unstructured grids with the cell-vertex scheme.With introduce the architecture of a GPU and its parallelization method, two grid reordering methods are set forth based on RCM and METIS, to improve data locality of unstructured grids and to improve acceleration performance on GPU using the unstructured grids implicit method. The ONERA M6 Wing test case is carried out to verify and validate this implementation. With two grid reordering methods, the GPU implementations achieve 63% and 69% improvements respectively. The GPU implementation obtains a speedup of 27 times compared to the CPU version running on a single core. It indicates that the proposed GPU implementation has a solid performance.
作者 陈龙 徐添豪 田书玲 CHEN Long;XU Tian-Hao;TIAN Shu-Ling(College of Aerospace Engineering, Nanj ing University of Aeronautics and Astronautics, Nanjing 210016, China)
出处 《计算机系统应用》 2018年第5期238-243,共6页 Computer Systems & Applications
基金 江苏高校优势学科建设工程资助项目
关键词 GPU加速 并行计算 网格排序 计算流体力学 隐式格式 GPU acceleration parallel computing grid reordering computational fluid dynamics implicit schemes
  • 相关文献

参考文献2

二级参考文献14

  • 1Lindholm E,Nickolls J,Oberman S,Montrym J.Nvidia Tesla:A United Graphics and Computing Architecture. .
  • 2Buck I,Foley T,Horn D,Sugerman J,Fatahalian K,HoustonM,Hanrahan P.Brook for GPUs:Stream Computing on Graphics Hardware. SIGGRAPH 2004 .
  • 3http://www.opencl.org/ . 2010
  • 4Brandvik T,Pullan G.An Accelerated 3D Navier-StokesSolver for Flows in Turbomachines. Proceedings of GT2009ASME Turbo Expo 2009:Power for Land,Sea and Air . 2009
  • 5Andrew C,Fernando C,Rainald L,John W. 19th AIAAComputational Fluid Dynamics .
  • 6Jonathan MC,M.Jeroen M.A Fast Double Precision CFDCode using CUDA. .
  • 7http://ati.amd.com/technology/streamcomputing/op encl.html .
  • 8NVIDIA CUDA Compute Unified DeviceArchitecture Programming Guide. http://www.developer.download.nvidia.com .
  • 9Zhang DL.Computational Fluid Dynamics. . 2008
  • 10Elsen E,LeGresley P,Darve E.Large calculation of the flowover a hypersonic vehicle using a GPU. Journal ofComputational Physics . 2008

共引文献17

同被引文献8

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部