基于区域图数据流分析的通信优化算法被引量：5

A Communication Optimization Algorithm Based on Data-Flow Analysis of Region Graph

下载PDF

导出

摘要减少通信开销对于并行化编译器生成高效的分布代码是非常重要的.首先提出了一个冗余并行执行模型(RPEM)作为通信优化算法生成的目标程序的执行模型,之后给出了区域图的概念和区域最大化算法,在最大化区域图的基础上进行数据流分析可以增大数据流分析粒度,提高分析的效率,同时也有助于通信的提前与合并.最后提出了一种基于区域图数据流分析的通信优化算法.该算法能够进行跨循环、跨过程的数据流分析,提高分析的精度,改善通信优化效果.实验结果表明,该算法对于通信量较大的程序能够有效地减少通信的次数和通信量,具有良好的可扩展性. Reducing communication overhead is extremely important for parallelizing compiler to generate efficient codes for distributed-memory systems. In this paper, a redundant parallel execution model (RPEM) is proposed as an execution model for target programs optimized by the new algorithm. The region graph is introduced, and an effective algorithm is proposed to maximize the regions in the region graph. A region-based data-flow analysis algorithm is proposed to perform communication optimization. The overhead of data-flow analysis can be reduced by performing analysis on the maximized region graph. The coarse grain analysis also helps to communication lift up and aggregation. This communication optimization algorithm is able to perform inter-loop and inter-procedure analysis. Experimental results show that this algorithm is effective in reducing both communication volume and number of messages in programs with a large communication amount.

作者钟洪涛舒继武温冬婵郑纬民

机构地区清华大学计算机科学与技术系

出处《软件学报》 EI CSCD 北大核心 2003年第2期175-182,共8页 Journal of Software

基金国家自然科学基金~~

关键词区域图数据流分析通信优化算法并行编译分布代码计算机 communication optimization data-flow analysis region graph distributed memory system

分类号 TP314 [自动化与计算机技术—计算机软件与理论] TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献11

1[1]Balasundaram V, Fox G, Kennedy K, Kremer U. An interactive environment for data partitioning and distribution. In: Charleston SC, ed. Proceedings of the 5th Distributed Memory Computing Conference. New York: ACM Press, 1990.
2[2]Banerjee P, Chandy JA, Gupta M, Hodges IVEW, Holm JG, Lain A, Palermo DJ, Ramaswamy S, Su E. The PARADIGM compiler for distributed-memory multicomputers. IEEE Computer, 1995,28(10):37～47.
3[3]Adve V, Mellor-Crummey J, Sethi A. An integer set framework for HPF analysis and code generation. Technical Report, TR97-275, Computer Science Department, Rice University. 1997.
4[4]Knuth DE. An empirical study of FORTRAN programs. Software-Practice and Experience, 1971,1(2):105～134.
5[5]Johnson SP, Ierotheou CS, Cross M. Automatic parallel code generation for message passing on distributed memory systems. Parallel Computing, 1996,22(2):227～258.
6[6]Allen FE, Cocke J. A program data flow analysis procedure. Communications of the ACM, 1978,19(3):137～174.
7[7]Tarjan RE. Finding dominators in directed graphs. SIAM Journal of Computing, 1974,3(1):62～89.
8[8]Atkinson DC, Griswold WG. Implementation techniques for efficient data-flow analysis of large programs. In: Anneliese AA, Aniello C, eds. Proceedings of the IEEE International Conference on Software Maintenance. Italy: University of Sannio Press, 2001. 52～61.
9[9]Kandemir M, Banerjee P, Choudhary A, et al. A global communication optimization technique based on data-flow analysis and linear algebra. ACM Transactions on Programming Languages and Systems (TOPLAS), 1999,21(6):1251～1297.
10[10]Hall MW, Murphy BR, Amarasinghe SP, Liao S, Lam MS. Interprocedural analysis for parallelization. In: Huang CH, Sadayappan P, Banerjee U, eds. Proceedings of the 8th International Workshop on LCPC. Columbus, Ohio: Springer-Verlag, 1995. 61～80.

同被引文献24

1夏军,杨学军.基于数据空间融合的全局计算与数据划分方法[J].软件学报,2004,15(9):1311-1327. 被引量：7
2王轶然,陈莉,冯晓兵,张兆庆.全局部分重复计算划分[J].计算机研究与发展,2006,43(12):2158-2165. 被引量：2
3董春丽,韩林,赵荣彩.并行编译中一种线性数据和计算划分算法[J].计算机工程,2006,32(24):26-28. 被引量：5
4Stanford SUIF Compiler Group. SUIF: A Parallelizing Optimizing Research Compiler[R]. Computer Systems Lab, Stanford University, Tech, Rep.: CSL-TR-94-620, 1994-05.
5Hurson A, Lim J T, Parallelization of DOALL and DOACROSS Loops A Survey. [S. l.]: Academic Press, 1997.
6Feautrier P. Dataflow analysis of array and scalar references[ J]. In- ternational Journal of Parallel Programming, 1991,2 ( 1 ) :23-53.
7Maydan D E, Amarasinghe S P, Lain M S. Army data-flow analysis and its use in array privatization[ C]. Proceedings of the 20th ACM SIGPLAN-SIGACT Symposium on POPL '93,1993:2-15.
8Tomofumi Y, Sanjay R. Canonic multi-projection:memory alloca- tion for distributed memory parallelization [ R]. CSl1-106, Colo- rado: Colorado State University ,2011.
9Wang S S, Zhao R C, Pang J M. Improvement and implementation of accurate array data-flow analysis [ C ]. Proceedings of The 2006 International Conference on Parallel & Distributed Processing Tech- niques and Applications & Conference on Real - Time Computing Systems & Applications (PDPTA'06) ,2006.
10Gu J ,Li Z. Efficient interprocedural array data-flow analysis for au- tomatic program parallelization[ J]. IEEE Transactions on Software Engineering, 2000,26 ( 3 ) : 244-261.

引证文献5

1史晓华,吴甘沙,金茂忠,LUEH Guei-Yuan,刘超,王雷.在开放世界中实现逃逸分析[J].软件学报,2008,19(3):522-532. 被引量：1
2龚雪容,陆林生,赵荣彩.分布内存系统中流水并行代码的自动生成[J].计算机工程,2008,34(11):77-79. 被引量：4
3王军委,赵荣彩,李妍.基于Define-Use分析的冗余通信消除算法[J].计算机工程,2009,35(4):85-87. 被引量：1
4丁锐,赵荣彩,韩林.一种基于数组生命期的数据分解算法[J].软件学报,2013,24(12):2843-2858.
5丁锐,赵荣彩,赵捷.一种面向划分的数组数据流分析方法[J].小型微型计算机系统,2014,35(3):532-537.

二级引证文献6

1吴悦,雷超付,杨洪斌.选择性循环的并行方法[J].计算机工程,2010,36(9):35-37. 被引量：1
2闫昭,刘磊.基于任务量划分的紧嵌套循环自动并行化方法[J].吉林大学学报（理学版）,2010,48(4):631-635.
3周雷,陈克非.基于符号运算的归纳变量识别与约化[J].计算机工程,2010,36(24):71-73. 被引量：2
4赵捷,赵荣彩,丁锐,陈达智.基于Define-Use图的MPI通信求解算法[J].计算机工程,2012,38(4):247-250.
5张庆花,赵荣彩,李朋远.一种面向规则DOACROSS循环的自动并行化框架[J].小型微型计算机系统,2016,37(6):1365-1370.
6白开峰,赵宏斌,李妍,韩丽芳,马勇.资源池环境下虚拟机防逃逸监控设计[J].电子技术与软件工程,2018(18):183-184. 被引量：1

1梅园.无线传感网络加密通信优化算法的研究与仿真[J].计算机仿真,2012,29(7):195-198. 被引量：5
2王章静,张琦,罗俊海.无线传感网络通信优化仿真分析[J].计算机仿真,2014,31(1):282-285. 被引量：15
3张云山.xPC实时半物理仿真平台中并行执行模型的设计[J].电子设计工程,2012,20(18):63-66.
4谢立,赵宽奇,陈军.逻辑程序的一种并行执行模型[J].计算机研究与发展,1994,31(5):1-6.
5黄朝,南建国.FC光纤通道中的通信优化算法的仿真分析[J].计算机仿真,2013,30(8):194-197. 被引量：4
6陈桂芳.互联网环境下一种新的非否认协议研究[J].计算机测量与控制,2010,18(2):457-458. 被引量：1
7丁锐,赵荣彩,韩林.程序并行化中基于暴露集生成数据分布代码[J].计算机工程与设计,2009,30(15):3573-3576.
8孙成政,慈云桂.逻辑程序并行执行模型的模拟实验研究[J].计算机学报,1989,12(9):650-662.
9郑宇华,戴非,谢立.一个支持非独立“与”并行的逻辑程序“与”并行执行模型[J].计算机研究与发展,1993,30(11):1-7. 被引量：1
10顾兆军,张友海.并行程序控制流图可视化的实现[J].中国民航学院学报,2000,18(1):43-48.

软件学报

2003年第2期

浏览历史

内容加载中请稍等...

基于区域图数据流分析的通信优化算法被引量：5

参考文献11

同被引文献24

引证文献5

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

基于区域图数据流分析的通信优化算法 被引量：5

参考文献11

同被引文献24

引证文献5

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

基于区域图数据流分析的通信优化算法被引量：5