期刊文献+

基于区域图数据流分析的通信优化算法 被引量:5

A Communication Optimization Algorithm Based on Data-Flow Analysis of Region Graph
下载PDF
导出
摘要 减少通信开销对于并行化编译器生成高效的分布代码是非常重要的.首先提出了一个冗余并行执行模型(RPEM)作为通信优化算法生成的目标程序的执行模型,之后给出了区域图的概念和区域最大化算法,在最大化区域图的基础上进行数据流分析可以增大数据流分析粒度,提高分析的效率,同时也有助于通信的提前与合并.最后提出了一种基于区域图数据流分析的通信优化算法.该算法能够进行跨循环、跨过程的数据流分析,提高分析的精度,改善通信优化效果.实验结果表明,该算法对于通信量较大的程序能够有效地减少通信的次数和通信量,具有良好的可扩展性. Reducing communication overhead is extremely important for parallelizing compiler to generate efficient codes for distributed-memory systems. In this paper, a redundant parallel execution model (RPEM) is proposed as an execution model for target programs optimized by the new algorithm. The region graph is introduced, and an effective algorithm is proposed to maximize the regions in the region graph. A region-based data-flow analysis algorithm is proposed to perform communication optimization. The overhead of data-flow analysis can be reduced by performing analysis on the maximized region graph. The coarse grain analysis also helps to communication lift up and aggregation. This communication optimization algorithm is able to perform inter-loop and inter-procedure analysis. Experimental results show that this algorithm is effective in reducing both communication volume and number of messages in programs with a large communication amount.
出处 《软件学报》 EI CSCD 北大核心 2003年第2期175-182,共8页 Journal of Software
基金 国家自然科学基金~~
关键词 区域图 数据流分析 通信优化算法 并行编译 分布代码 计算机 communication optimization data-flow analysis region graph distributed memory system
  • 相关文献

参考文献11

  • 1[1]Balasundaram V, Fox G, Kennedy K, Kremer U. An interactive environment for data partitioning and distribution. In: Charleston SC, ed. Proceedings of the 5th Distributed Memory Computing Conference. New York: ACM Press, 1990.
  • 2[2]Banerjee P, Chandy JA, Gupta M, Hodges IVEW, Holm JG, Lain A, Palermo DJ, Ramaswamy S, Su E. The PARADIGM compiler for distributed-memory multicomputers. IEEE Computer, 1995,28(10):37~47.
  • 3[3]Adve V, Mellor-Crummey J, Sethi A. An integer set framework for HPF analysis and code generation. Technical Report, TR97-275, Computer Science Department, Rice University. 1997.
  • 4[4]Knuth DE. An empirical study of FORTRAN programs. Software-Practice and Experience, 1971,1(2):105~134.
  • 5[5]Johnson SP, Ierotheou CS, Cross M. Automatic parallel code generation for message passing on distributed memory systems. Parallel Computing, 1996,22(2):227~258.
  • 6[6]Allen FE, Cocke J. A program data flow analysis procedure. Communications of the ACM, 1978,19(3):137~174.
  • 7[7]Tarjan RE. Finding dominators in directed graphs. SIAM Journal of Computing, 1974,3(1):62~89.
  • 8[8]Atkinson DC, Griswold WG. Implementation techniques for efficient data-flow analysis of large programs. In: Anneliese AA, Aniello C, eds. Proceedings of the IEEE International Conference on Software Maintenance. Italy: University of Sannio Press, 2001. 52~61.
  • 9[9]Kandemir M, Banerjee P, Choudhary A, et al. A global communication optimization technique based on data-flow analysis and linear algebra. ACM Transactions on Programming Languages and Systems (TOPLAS), 1999,21(6):1251~1297.
  • 10[10]Hall MW, Murphy BR, Amarasinghe SP, Liao S, Lam MS. Interprocedural analysis for parallelization. In: Huang CH, Sadayappan P, Banerjee U, eds. Proceedings of the 8th International Workshop on LCPC. Columbus, Ohio: Springer-Verlag, 1995. 61~80.

同被引文献24

  • 1夏军,杨学军.基于数据空间融合的全局计算与数据划分方法[J].软件学报,2004,15(9):1311-1327. 被引量:7
  • 2王轶然,陈莉,冯晓兵,张兆庆.全局部分重复计算划分[J].计算机研究与发展,2006,43(12):2158-2165. 被引量:2
  • 3董春丽,韩林,赵荣彩.并行编译中一种线性数据和计算划分算法[J].计算机工程,2006,32(24):26-28. 被引量:5
  • 4Stanford SUIF Compiler Group. SUIF: A Parallelizing Optimizing Research Compiler[R]. Computer Systems Lab, Stanford University, Tech, Rep.: CSL-TR-94-620, 1994-05.
  • 5Hurson A, Lim J T, Parallelization of DOALL and DOACROSS Loops A Survey. [S. l.]: Academic Press, 1997.
  • 6Feautrier P. Dataflow analysis of array and scalar references[ J]. In- ternational Journal of Parallel Programming, 1991,2 ( 1 ) :23-53.
  • 7Maydan D E, Amarasinghe S P, Lain M S. Army data-flow analysis and its use in array privatization[ C]. Proceedings of the 20th ACM SIGPLAN-SIGACT Symposium on POPL '93,1993:2-15.
  • 8Tomofumi Y, Sanjay R. Canonic multi-projection:memory alloca- tion for distributed memory parallelization [ R]. CSl1-106, Colo- rado: Colorado State University ,2011.
  • 9Wang S S, Zhao R C, Pang J M. Improvement and implementation of accurate array data-flow analysis [ C ]. Proceedings of The 2006 International Conference on Parallel & Distributed Processing Tech- niques and Applications & Conference on Real - Time Computing Systems & Applications (PDPTA'06) ,2006.
  • 10Gu J ,Li Z. Efficient interprocedural array data-flow analysis for au- tomatic program parallelization[ J]. IEEE Transactions on Software Engineering, 2000,26 ( 3 ) : 244-261.

引证文献5

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部