摘要
谓词执行是在控制流存在的条件下可以有效挖掘指令级并行性的硬件机制,而在分簇结构上实现谓词机制,可以提高分簇结构上条件的执行效率。针对分簇结构展开谓词体系结构的研究,提出了分簇结构部分谓词的高效实现方法,以及基于循环展开的分簇结构部分谓词支持框架。实验表明,提出的分簇结构部分谓词及编译框架可以很好地提高条件执行程序的执行效率,部分典型算法核心的加速比可以达到350%500%。
Predicate execution is a valid method to develop instruction parallelism in the programs with control flow.And implementation of predicate for clustering can improve efficiency of conditional statement.Researching on predicate mechanism for clustering,this paper presented an efficient partial predicate mechanism for clustering,and proposed a compiling implementation framework based on loop unrolling for partial predicate.Experiments show that the predicate implementation framework can better enhance executive efficiency of programs with conditional statement.The speed for some classic algorithms kernels can reach 350%~500%.
作者
王向前
洪一
郑启龙
Wang Xiangqian;Hong Yi;Zheng Qilong(No.38 Research Institute,China Electronics Technology Group Corporation,Hefei 230088,China;School of Computer&Information,Hefei University of Technology,Hefei 230009,China;School of Computer Science&Technology,University of Science&Technology of China,Hefei 230027,China)
出处
《计算机应用研究》
CSCD
北大核心
2018年第1期126-129,共4页
Application Research of Computers
基金
国家重大专项资助项目(2012ZX01034001-001)
关键词
谓词
部分谓词
分簇结构
循环展开
predicate
partial predicate
clustering
loop unrolling