期刊文献+

异构系统结构力学计算GPU加速性能分析与应用

Analysis and application of structural mechanical simulation accelerated by GPU with heterogeneous architecture
下载PDF
导出
摘要 高性能计算领域利用GPU加速计算已逐渐发展成为主流应用的普遍功能之一。主流结构力学应用ABAQUS支持GPU通用计算,充分发挥GPU的高性能浮点运算能力与访存带宽,提高软件求解效率。介绍了ABAQUS软件对GPU加速应用的发展历程,在上海超级计算中心三种不同的异构环境下,针对实际算例的GPU加速性能进行了测试,建立计算效率分析方法,分析GPU加速对求解时间、系统资源调用、软件并行效率的影响,并针对采用隐式方法求解百万量级自由度问题时资源的合理使用提出建议。 General computation accelerated by Graphic Processing Unit (GPU) has become popular in today's High Performance Computing (HPC) area. As one of the leading software solving structural mechanical problems, ABAQUS now offers general purpose GPU (GPGPU) simulation module. Benefited from high floating point computational ability and memory bandwidth, GPU is able to accelerate Computer Aided Engineering (CAE) simulation in ABAQUS. History of GPGPU simulation was firstly introduced, followed by software test regarding to engineering cases with three different heterogeneous hardware systems provided in Shanghai Supereomputer Center. The influence on solver time, system resource usage, and parallel efficiency by GPU acceleration was addressed, while suggestions on solving multi-million degree-of-freedom cases by implicit solver were made.
出处 《计算机应用》 CSCD 北大核心 2014年第A01期78-81,共4页 journal of Computer Applications
基金 国家863计划项目(2012AA01A308)
关键词 计算机辅助工程 结构力学 ABAQUS 图形处理器加速 高性能计算 Computer-Aided Engineering (CAE) structural dynamics ABAQUS GPU acceleration High PerformanceComputing (HPC)
  • 相关文献

参考文献12

  • 1吴恩华.图形处理器用于通用计算的技术、现状及其挑战[J].软件学报,2004,15(10):1493-1504. 被引量:141
  • 2GEOGESCU S, CHOW P. GPU accelerated CAE using open solvers and the cloud [ J]. ACM SIGARCH Computer Architecture News, 2011,39(4) : 14 - 19.
  • 3THIBAULT J C, SENOCAK I. Accelerating incompressible flow computations with a pthreads - CUDA implementation on small-foot- print multi-GPU platforms [ J]. The Journal of Supercomputing, 2012,59(2) :693 -719.
  • 4徐新海,林宇斐,易伟.CPU-GPGPU异构体系结构相关技术综述[J].计算机工程与科学,2009,31(A01):24-26. 被引量:10
  • 5BELL N, GARLAND M. Implementing a sparse matrix-vector multi- plication on throughput oriented processors [ C]// SC'09: Proceed- ings of the Conference on High Performance Computing Networking, Storage and Analysis. New York: ACM, 2009: 18.
  • 6GLASKOWSKY P N. NVIDIAs Fermi: The first complete GPU computing architecture[ R]. [ S. l. ] : NVIDIA Corporation, 2009:1 -26.
  • 7CHOI J W, SINGH A, VUDUC R W. Model driven autotuning of sparse matrix-vector multiply on GPUs [ C]// PPoPP 2010: Pro- ceedings of the 15th ACM SIGPLAN Sympesium on Principles and Practice of Parallel Programming. New York: ACM, 2010: 115- 126.
  • 8DIMITROV M, MANTOR M, ZHOU H. Understanding software ap- proaches for GPGPU reliability [ C]//GPGPU-2: Proceedings of the 2nd Workshop on General Purpose Processing on Graphics Process- ing Units. New York: ACM, 2009: 94-104.
  • 9LUCAS R, WAGENBRETH G, DAVIS D. Implementing a GPU- enhanced cluster fur large scale simulations [ C]// I/ITSC 2007: The Interservice/Industry Training, Simulation & Education Confer- ence. [S. l.]: National Defense Industrial Association, 2007: 7437.
  • 10GUNEY M E. High-performance direct solution of finite-element problems on multi-core processors [ D]. Atlanta: Georgia Institute of Technology, 2010.

二级参考文献12

  • 1吴恩华,柳有权.基于图形处理器(GPU)的通用计算[J].计算机辅助设计与图形学学报,2004,16(5):601-612. 被引量:226
  • 2Luebke D, Harris M, Krger J,et al. GPGPU: General Purpose Computation on Graphics Hardware[C]//Proc of ACM SIGGRAPH '04,2004.
  • 3AMD CorporatiorL ATI Stream Computing User Guide 1.4. 0a[EB/OL]. [2009-05-071. http://developer, amd. com/gpu _assets/Stream_Computing_User_Guide. pdf.
  • 4Buck I. Brook Spec v0. 2[R]. Technical Report, Stanford University, 2003.
  • 5NVIDIA Corporation. NVIDIA CUDA Compute Unified Device Architecture Programming Guide [EB/OL]. [2007-06- 23]. developer, download, nvidia, com/compute/cuda/1 0/ NV1DIA_CUDA_Programming_Guide_1.0. pdf.
  • 6Lee S, Min S J, Eigenmann R. Openmp to GPGPU: a Compiler Framework for Automatic Translation and Optimization [C]//Proc of the 14th ACM SIGPLAN Syrnp on Principles and Practice of Parallel Programming, 2008:101-110.
  • 7Han T D, Abdelrahman T S. hiCUDA: a High-level Directive-Based Language for GPU Programming[C]//Proc of the 2nd Workshop on General Purpose Processing on Graphics Processing Units, 2009 : 52-61.
  • 8Wang N, Patel S. ReStore: Symptom Based Soft Error Detection in Microprocessors[C]//Proc of DSN, 2005.
  • 9George N, Laeh J, Gurumurthi S. Towards Transient Fault Tolerance for Heterogeneous Computing Platforms [C]//Workshop on Compiler and Architectural Techniques for Application Reliability and Security, 2008.
  • 10Dimitrov M, Mantor M, Zhou H. Understanding Software Approaches for GPGPU Reliability [C]//Proc of the 2nd Workshop on General Purpose Processing on Graphics Processing Units, 2009:94-104.

共引文献149

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部