期刊文献+

面向Open64的OpenMP程序优化

Open MP Program Optimization Based on Open64
下载PDF
导出
摘要 Open MP规范了一系列的编译制导、环境变量和运行库,具有简单、可移植、支持增量并行等优点.但同时,采用FORK-JOIN模型所引起的频繁的线程管理开销也是制约Open MP程序性能的瓶颈之一.本文讨论了如何利用并行区的合并与扩展,实现并行区的重构,并在此基础上利用Open64的IPA优化部件所提供的全局间过程分析能力,实现跨越过程边界的并行块的合并.最终实验表明,该方法有效地改进了Open MP程序的运行性能. Open MP regulates a series of compilation guidance, environment variables and runtime routines, having the advantages of simple operation, portability and supporting incremental parallel. But at the same time, the use of frequent thread management overhead FORK-JOIN model is one of the bottlenecks caused by Open MP program performance constraints.. This article discusses how to use the merge and extend parallel zone, reconstruct the parallel section, and on this basis, using global process analysis ability which is provided by IPA, one of the Open64 optimization components, realize the parallel block mergeing which acrosses process boundaries. The final experimental results show that the method improves the performance of Open MP programs effectively.
出处 《计算机系统应用》 2016年第1期154-159,共6页 Computer Systems & Applications
基金 "核高基"重大专项(2012ZX01034-001-001)
关键词 增量化并行 线程管理 IPA 并行区扩张 incremental parallelization thread management IPA parallel region expansion
  • 相关文献

参考文献10

  • 1Chapman B, Jost G, Van Der Pas R. Using OpenMP:Portable Shared Memory Parallel Programming. MIT Press, 2008.
  • 2Intel C, User C. Reference Guides. The Intel Compiler Homepage. http://software.intel.com/en-us/intel-compilers,2009.
  • 3Sato M, Satoh S, Kusano K, et al. Design of OpenMP compiler for an SMP cluster. Proc. of the 1st European Workshop on OpenMP. 1999. 32-39.
  • 4Dimakopoulos VV, Georgopoulos A. The OMPi OpenMP/C Compiler. Proc. PCI2005, 10th Panhellenic Conference on Informatics. Volos, Greece. 2005. 153-162.
  • 5Balart J, Duran A, Gonzàlez M, et al. Nanos mercurium:A research compiler for openmp. Proc. of the European Workshop on OpenMP. 2004, 8.
  • 6Murphy M. NVIDIA\'s experience with Open64. Open64 Workshop at CGO. 2008, 8.
  • 7Lattner C, Adve V. LLVM:A compilation framework for lifelong program analysis & transformation. International Symposium on Code Generation and Optimization(CGO). IEEE. 2004. 75-86.
  • 8Darema F. The spmd model:Past, present and future. Recent Advances in Parallel Virtual Machine and Message Passing Interface. Springer Berlin Heidelberg. 2001. 1-1.
  • 9Zhu W, Del Cuvillo J, Gao GR. Performance characteristics of OpenMP language constructs on a many-core-on-a-chip architecture. OpenMP Shared Memory Parallel Programming. Springer Berlin Heidelberg, 2008:230-241.
  • 10Van der Wijngaart RF, Wong P. NAS parallel benchmarks version 2.4[Technical Report]. NAS, NAS-02-007, 2002.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部