期刊文献+

一种基于大同步并行编程模式的N体问题的优化实现

An Optimized Implementation of N-Body Problem Based on Maximal Synchronous Parallel Programming Model
原文传递
导出
摘要 文章基于多核机群系统对并行编程模型进行了深入研究,实现了多层次并行体系结构的Open MP/MPI混合编程模型的设计。在以SMP机群系统为背景的情况下,实现其节点间和节点内的分层,运用多层次的并行编程模型进行实验与分析。同时对多层次并行编程模型的性能进行深入的研究,提出了一种大同步混合设计新思路。设计了N-Body问题的大同步优化并行算法,并在曙光TC 5000A机群上与传统的并行算法作了性能方面的比较。通过理论研究并结合大量的实验分析统计,得到了多核机群的混合并行编程模型的性能优化的诸多结论。 This paper discusses hybrid programming paradigm and different implementation for the multi-core cluster, and designs an Open MP/MPI hybrid programming model. Considering the SMP cluster system, we design a hierarchical inter-nodes and hierarchical intra-nodes, and conduct a lot of experiments and analysis. The performance of parallel programming model for multi levels is studied, and a new hybrid design idea of maximal synchronous method is proposed. We design a maximal synchronous optimization parallel algorithm on the N-Body problem, and compare its performance with traditional hybrid parallel algorithms on the Dawning 5000 A cluster. Through theoretical research combined with statistical analysis on the basis of large number of experiments, we obtain many conclusions of performance optimization on multi-core cluster hierarchical parallel programming model.
出处 《电子技术(上海)》 2015年第2期28-32,共5页 Electronic Technology
基金 山东省自然科学基金(ZR2013FL015)山东省研究
关键词 N-BODY MPI OPENMP 大同步算法 性能优化 N-Body MPI Open MP maximal synchronous programming performance optimization
  • 相关文献

参考文献9

  • 1白秀秀,董小社,刘超,曹海军,李亮.面向异构多核架构的自适应编译框架[J].计算机学报,2014,37(7):1548-1559. 被引量:2
  • 2Voronin K V. A numerical study of an MPI/OpenMP implementation based on asynchronous threads for a three-dimensional splitting scheme in heat transfer problems[J].Joumal of Applied and Industrial Mathematics,2014,8(3):436-443.
  • 3刘志强,宋君强,卢风顺,赵娟.基于线程的MPI通信加速器技术研究[J].计算机学报,2011,34(1):154-164. 被引量:11
  • 4Perla F, Zanetti R Performance analysis of an hybrid OpenMP/MPI ALM software for life insurance policies on multi-core architectures[C].8th International Workshop on OpenMP,2012: 250-253.
  • 5江洁,凌思睿.一种投票式并行RANSAC算法及其FPGA实现[J].电子与信息学报,2014,36(5):1145-1150. 被引量:4
  • 6Tsuji M, Sato M. Performance evaluation of OpenMP and MPI hybrid programs on a large scale multi-care multi-socket cluster, T2K Open Supercomputer[C]. 2009 International Conference on Parallel Processing Workshops, 2009.
  • 7Miki Y, Takahashi D, Morid M, et al. Highly scalable implementation of an N-body code on a GPU cluster[J].Computer Physics Communications,2013(184):2159-2168.
  • 8Capuzzo-Dolcetta R, Spera M, Punzo D. A fully parallel, high precision, N-body code running on hybrid computing platforms[J].Journal of Computational Physics, 2013(236): 580-593.
  • 9祝永志,张丹丹,曹宝香,禹继国.基于SMP机群的层次化并行编程技术的研究[J].电子学报,2012,40(11):2206-2210. 被引量:9

二级参考文献52

  • 1陈付幸,王润生.一种新的消失点检测算法[J].电子与信息学报,2006,28(8):1458-1462. 被引量:8
  • 2孙红伟.二项分布两种近似计算的讨论[J].河南教育学院学报(自然科学版),2007,16(1):28-29. 被引量:2
  • 3Chai L, Gao Q, Panda D K. Understanding the impact of multi core architecture in cluster computing: A case study with InteI Dual Core system//Proceedings of the CCGrid'07. Rio de Janeiro, Brazil, 2007:471 -478.
  • 4Tang H, Shen K, Yang T. Program transformation and runtime support for threaded MPI execution on shared memory machines. ACM Transactions on Programming Languages and Systems, 2000, 22(4): 673- 700.
  • 5Demaine E D. A threads only MPI implementation for the development of parallel programs//Proceedings of the Ilth In ternational Symposium on High Performance Computing Sys terns. Winnipeg, Manitoba, Canada, 1997:153-163.
  • 6Prakash S, Bagrodia R. MPI -SIM: Using parallel simulation to evaluate MPI programs//Proceedings of the Winter Simula tion. Los Aamitos, CA, USA, 1998:467- 474.
  • 7Saini S, Naraikin A et al. Early performance evaluation of a Nehalem" cluster using scientific and engineering applications//Proceedings of the SC'09. New York, USA, 2009, Article 21,12 pages.
  • 8Diaz Martin J C, Rico Gallego J A et al. An MPI -1 corn pliant thread based implementation//Proceedings o{ the EuroPVM/ MP1 2009. Berlin, Heidelberg, 2009:327- 328.
  • 9Sade Y, Sagiv S, Shaham R. Optimizing C multithreaded memory management using thread local storage//Proceedings of the CC'05. Berlin, Heidelberg, 2005:137-155.
  • 10Jin H W, Sur S, Chai L, Panda D K. LiMIC: Support for high-performance MPI Intra Node communication on Linux cluster//Proceedings of the ICPP'05. Washington, DC,USA, 2005, 184- 191.

共引文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部