期刊文献+

基于值-剖面的OpenMP运行时优化系统

An OpenMP Runtime Optimization System Based on Value-Profiling
下载PDF
导出
摘要 本文设计并实现了一个基于值-剖面的Open MP运行时优化系统CCRG OpenMP。它能够根据常见的值的组合优化并行区域,并且在运行时只有并行区代码需要重编译和管理。CCRG OpenMP基于动态重编译技术,避免了目前静态多版本技术的不足。同时,值-剖面的收集和分析由独立的动态优化器线程完成,降低了动态重编译引入的开销。SPEC OMP2001基准测试表明,我们基于值-剖面的Open MP优化系统能够较大地提高程序性能。 This paper presents the design and implementation of an OpenMP runtime optimization system based on valueprofiling, called CCRG OpenMP, where the parallel regions of OpenMP programs can be well optimized based on the common combinations of values. Moreover, only the code within the parallel regions needs to be re-compiled and managed at runtime. A distinguished feature of CCRG OpenMP is that a dynamic re-compile technique is adopted to overcome the shortcomings of the static muhi-versional approach. A separate dynamic optimizer is developed to collect and analyze value-profiles for reducing the overhead introduced by re-compile. The SPEC OpenMP benchmarks show the presented value-profile based optimization for OpenMP can improve the performance significantly.
作者 黄春 杨学军
出处 《计算机工程与科学》 CSCD 2006年第12期124-128,共5页 Computer Engineering & Science
基金 国家863计划资助项目(2004AA1Z2210 2002AA1Z2101)
关键词 OPENMP 值-剖面 运行时优化 OpenMP value-profile runtime optimization
  • 相关文献

参考文献14

  • 1The OpenMP Forum.OpenMP Fortran Application Program Interface.Version 2.0[EB/OL].http://www.OpenMP.org,2000-11.
  • 2B Calder,P Feller,A Eustace.Value Profiling[A].The 30th IEEE/ACM Int'l Symp on Microarchitecture (MICRO)[C].1997.259-269.
  • 3M Byler,J R B Daview,C Huson,et al.Multiple Version loops[A].Int'l Conf on Parallel Processing[C].1987.312-318.
  • 4R Gupta,R Bodik.Adaptive Loop Transformations for Scientific Programs[A].IEEE Symp on Parallel and Distributed Processing[C].1995.368-375.
  • 5X Zhang,Z Wang,N Gloy,et al.System Support for Automatic Profiling and Optimization[A].Proc 16th ACM Symp on Operating Systems Principles[C].1997.15-26.
  • 6V Bala,E Duesterwald,S Banerjia.Dynamo:A Transparent Runtime Optimization System[A].ACM SIGPLAN 2000 Conf on Programming Language Design and Implementation[C].2000.
  • 7E Mehofer,B Scholz.A Novel Probabilistic Data Flow Framework[A].Int'l Conf on Compiler Construction[C].2001.
  • 8E Mehofer,B Scholz.Probabilistic Communication Optimizations and Parallelization for Distributed-Memory Systems[A].9th Euromicro Workshop on Parallel and Distributed Processing[C].2001.
  • 9Dimitrios S Nikolopoulos.Quantifying and Resolving Remote Memory Access Contention on Hardware DSM Multiprocessors[A].Proc of the 16th IEEE Int'l Parallel and Distributed Processing Symp[C].2002.
  • 10M Burcea,M J Voss.A Runtime Optimization System for OpenMP[A].Workshop on OpenMP Applications and Tools[C].2003.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部