期刊文献+

OpenMC: Towards Simplifying Programming for TianHe Supercomputers

OpenMC: Towards Simplifying Programming for TianHe Supercomputers
原文传递
导出
摘要 Modern petascale and future exascale systems are massively heterogeneous architectures. Developing productive intra-node programming models is crucial toward addressing their programming challenge. We introduce a directive- based intra-node programming model, OpenMC, and show that this new model can achieve ease of programming, high performance, and the degree of portability desired for heterogeneous nodes, especially those in TianHe supercomputers. While existing models are geared towards oifloading computations to accelerators (typically one), OpenMC alms to more uniformly and adequately exploit the potential offered by multiple CPUs and accelerators in a compute node. OpenMC achieves this by providing a unified abstraction of hardware resources as workers and facilitating the exploitation of asynchronous task parallelism on the workers. We present an overview of OpenMC, a prototyping implementation, and results from some initial comparisons with OpenMP and hand-written code in developing six applications on two types of nodes from TianHe supercomputers. Modern petascale and future exascale systems are massively heterogeneous architectures. Developing productive intra-node programming models is crucial toward addressing their programming challenge. We introduce a directive- based intra-node programming model, OpenMC, and show that this new model can achieve ease of programming, high performance, and the degree of portability desired for heterogeneous nodes, especially those in TianHe supercomputers. While existing models are geared towards oifloading computations to accelerators (typically one), OpenMC alms to more uniformly and adequately exploit the potential offered by multiple CPUs and accelerators in a compute node. OpenMC achieves this by providing a unified abstraction of hardware resources as workers and facilitating the exploitation of asynchronous task parallelism on the workers. We present an overview of OpenMC, a prototyping implementation, and results from some initial comparisons with OpenMP and hand-written code in developing six applications on two types of nodes from TianHe supercomputers.
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2014年第3期532-546,共15页 计算机科学技术学报(英文版)
基金 supported by the National High Technology Research and Development 863 Program of China under Grant No.2012AA01A301 the National Natural Science Foundation of China under Grant No.61170049
关键词 SUPERCOMPUTER programming model HETEROGENEOUS MIC supercomputer, programming model, heterogeneous, MIC
  • 相关文献

参考文献38

  • 1Owens J, Luebke D, Govindaraju Net al. A survey of gen- eral purpose computation on graphics hardware. Computer Graphics Forum, 2007, 26(3): 80-113.
  • 2Sherlekar S. Tutorial: Intel many integrated core (MIC) ar- chitecture. In Proc. the 18th ICPADS, Dec. 2012, p.947.
  • 3Yang X, Liao X, Xu Wet al. TH-I: China's first petaflop su- percomputer. Frontiers of Computer Science in China, 2010, 4(4): 445-455.
  • 4Yang X, Liao X, Lu K et al. The TianHe-lA supercomputer: Its hardware and software. Journal of Computer Science and Technology, 2011, 26(3): 344-351.
  • 5Kirk D. NVIDIA CUDA software and GPU parallel comput- ing architecture. In Proc. International Symposium on Mem- ory Management, Oct. 2007, pp.103-104.
  • 6Caster B, Howes L, Kaeli D et al. Heterogeneous Comput- ing with OpenCL - Revised OpenCL 1.2 Edition. Morgan Kaufmann. 2013.
  • 7Lee S, Vetter J. Early evaluation of directive-based GPU programming models for productive exascale computing. In Proc. Int. Conf. High Performance Computing, Networking, Storage and Analysis, Nov. 2012, Article No.23.
  • 8Wienke S, Springer P, Terboven C et al. OpenACC: First ex- periences with real-world applications. In Proc. the 18th Int. Conf. Euro-Par Parallel Processing, Aug. 2012, pp.859-870.
  • 9Chapman B, Gropp W, Kumaran K et al (eds.). OpenMP in the Petascale Era Springer, 2011.
  • 10Petitet A, Whaley R, Dongarra J et al. HPL - A portable implementation of the high-performance linpack benchmark for distributed-memory computers, Sept. 2008. http://www.netlib.org/benchmark/hpl/, Mar. 2014.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部