基于跨基本块变换和循环分布的SLP优化技术

SLP Optimization Algorithm Using Across Basic Block Transformation and Loop Distribution

下载PDF

导出

摘要现有的SLP优化算法无法处理内层循环中存在的依赖环和归约,并且在基本块边界产生大量的冗余拆包和赋值语句,从而导致向量化效率不高。针对该问题,提出了一种基于跨基本块变换和循环分布的SLP优化算法。该算法以控制流图为基础,根据基本块间各数组变量的Define-Use关系以及跨越基本块之间的数据依赖关系进行跨基本块的向量化变换,有序地采用跨基本块变换和循环分布,尽可能发掘最内层循环基本块内语句的并行性,使SLP自动向量化编译器生成具有更多SIMD指令的向量化代码。实验结果表明,该算法能够隐藏更多跨基本块冗余操作的开销,同时利用跨基本块的数据依赖生成更优的SIMD指令,有效地提高了向量化程序的加速比。 The existing SLP algorithms cannot handle dependent ring and the reduction of the inner loop, and generate a large number of redundant packet disassembly and assignment statements in a basic block boundary, which leads to the lower quantization efficiency. In order to solve the problem, this paper proposed a SLP optimization algorithm using cross basic block transformation and loop distribution. Based on the control flow graph, according to the basic blocks of the array variable between Define-Use and across basic block data relation between across basic block, the algorithm makes the quantized transform, orderly uses across basic block transform and loop distribution, and then expands inner loop within a basic block sentence parallelism as far as possible, making SLP automatic vectorization compiler to genera te the vectorization code which has more SIMD instruction. The experimental results show that the algorithm can hide more across basic block redundancy operation cost, at the same time generate better SIMD instructions across basic block data dependence, effectively improving the vectorization program speedup.

作者索维毅赵荣彩姚远张小妹

机构地区解放军信息工程大学解放军

出处《计算机科学》 CSCD 北大核心 2013年第10期24-28,60,共6页 Computer Science

基金核高基重大专项(2009ZX01036-001-001-2)资助

关键词 SLP 跨基本块变换循环分布数据依赖控制流图 Define-Use关系 SLP, Cross basic block, Loop distribution, Data dependence, Control flow graph, Define-Use relationship

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献12

1Franchetti F, Kral S, Lorenz J, et al. Efficiem utilization of SIMD extensions[J]. Proceedings ofthe IEEE, 2005,93(2) : 409-425.
2TMS320C6000 CPU and Instruction Set Reference Guide(Rev. F)[M]. TexasInstruments Inc. 2000.
3SC140 DSP Core Reference Manual[R/OL]. http://cache, free- scale, com/files/dsp/doc/ref_ manual/MNSC140CA:)RE, pdf, 2012-05-20.
4Fridman J, Greenfield Z. The Tiger SHARC DSP Architecture [J]. IEEE Micro, 2000,20(1 ) : 66-76.
5Tanaka H, Ota Y, Matsumoto N, et al. A New Compilation Technique for SIMD Code Generation Across Basic Block Boundaries[C] // Design Automation Conference ( ASP-DAC), 2010 15th Asia and South Pacifi. Jan. 2010:101-106.
6Larsen S, Amarasinghe S. Exploiting superword level parallelism with multimedia instruction sets[C]//Proc of the ACM SIGP- LAiN Conference on Programming Language Design and Imple- mentation. June 2000:145-156.
7Shin J, Hall M, Charne J. Superword-level Parallelism in the Presence of Control Flow[C]//Proc. of the International Sym- posium on Code Generation and optimization. March 2005:165- 175.
8Nuzman D, Zaks A. Outer-loop vectorization: revisited for short simd architectures[C]//Proceedings of the 17th international conference on parallel architectures and compilation techniques, PACT ' 08. New York, NY, USA, ACM, 2008: 2-11.
9Aho A V,Lam M S,Sethi R,et al.编译原理[M].陈火旺,刘春林,谭庆平,等,译.北京: 机械工业出版社,2009.
10陈火旺,刘春林.程序设计语言编译原理(第3版)[M].北京:国防工业出版社,2001.

共引文献2

1徐健峰,张正兰,张明.OIL代码自动生成技术过程中的部分研究[J].信息化纵横,2009(7):7-9.
2窦增杰,王震宇,陈楠,王瑞敏,田佳.基于可执行代码中间表示的控制流分析[J].计算机工程,2010,36(21):31-33. 被引量：2

1曾扬.循环分布及依赖关系破除的优化问题[J].计算机学报,1993,16(6):470-475. 被引量：1
2韩林,徐金龙,李颖颖,王阳.面向部分向量化的循环分布及聚合优化[J].计算机科学,2017,44(2):70-74. 被引量：1
3陈海波.一般循环的分布问题[J].计算机应用与软件,1989,6(2):24-32. 被引量：1
4黄磊,姚远,侯永生,杨明.自动向量化中基于数据依赖分析的循环分布算法[J].计算机科学,2011,38(9):288-293.
5吴荣钦.DBASEⅢ通用数组变量的设计技巧[J].福建电脑,1989(4):33-34.
6余振汉.谈DBASEⅢ数组变量的设计与应用[J].中国纺织大学学报,1991,17(2):54-56.
7涂聪.大数据时代背景下的数据可视化应用研究[J].武昌理工学院学报,2013,8(2):107-108.
8郝国良.浅谈虚拟专用网[J].中国科技博览,2009(16):114-114.
9孙大烈,李建中.基于MapReduce的Skyline-join查询算法[J].哈尔滨工业大学学报,2012,44(1):103-106. 被引量：6
10王秉政,黄亚楼,董恒竞.一种深度优先挖掘Generator表示的有效算法[J].计算机工程,2007,33(8):20-22.

计算机科学

2013年第10期

浏览历史

内容加载中请稍等...

基于跨基本块变换和循环分布的SLP优化技术

参考文献12

共引文献2

相关作者

相关机构

相关主题

浏览历史