一种面向众核架构的数据流编译框架被引量：2

A Compilation Framework of Dataflow Programs for Many-Core Architecture

下载PDF

导出

摘要数据流编程模型将程序设计与媒体处理相结合,已大量应用到各个领域.众核处理器已经成为主流和工业标准,如何利用众核架构的特性来提高流应用执行性能已成为目前研究工作的一大难点.文中提出了一个高效的流编译框架来优化流应用的执行,该框架包含3个优化策略:设计一个最优的软件流水调度方法;提出一个高效的数据存储分配算法;并采用合理的众核间的映射策略,减小通信以及同步的开销.文中在Godson-T上实现了该编译器框架,实验结果表明,该方法比优化前有较大性能改进. Domain specific programming like Dataflow Programming Model which combines the features of media applications and programming languages has applied to many fields. Many-core architecture has become the mainstream solution and industry standard, how to use the character- istic of many-core architecture to improve the performance of stream applications has become a difficulty in present research work. In order to solve these problems, we propose an efficient stream compilation framework for many-core architecture to optimize the execution of stream applications, which is composed of three optimization strategy. In the first phrase, rate-optimal software pipelining schedule is constructed to improve parallelism. Then, a buffer allocation algorithm is proposed to allocate the data for pipelining schedule and redundant buffer copy operation is eliminated. The last phase maps the logical cores to the physical cores to reduce the communi- cation overhead. We also implement the framework on Godson-T and the experiments show that our method obtains about an average 58% improvement.

作者魏海涛秦明康于俊清范东睿

机构地区华中科技大学计算机科学与技术学院华中科技大学网络与计算中心中国科学院计算技术研究所计算机体系结构国家重点实验室

出处《计算机学报》 EI CSCD 北大核心 2014年第7期1560-1569,共10页 Chinese Journal of Computers

基金国家"八六三"高技术研究发展计划重点项目(2012AA010902) 高等学校博士学科点专项科研基金(20120142110089) 中国科学院计算技术研究所国家重点实验室开放基金 IBM X10 Innovation基金资助~~

关键词编译框架数据流程序众核处理器软件流水并行 compilation framework； data flow programs many-core processor software pipelining parallelism

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献13

1Waingold E,et al.Baring it all to software:Raw machines.IEEE Computer,1997,30(9):86-93.
2Tan G,Fan D,Zhang J,et al.Experience on optimizing irregular computation for memory hierarchy in manycore architecture//Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP08).Salt Lake City,USA,2008:279-280.
3Howard J,et al.A 48-core IA-32 message-passing processor with DVFS in 45 nm CMOS//Proceedings of the Solid State Circuits Conference Digest of Technical Papers (ISSCC).San Francisco,USA,2010:108-109.
4Wei Haitao,Yu Junqing,Yu Huafei,Gao Guang R.Minimizing communication in rate optimal software pipelining for stream programs//Proceedings of the 2010 International Symposium on Code Generation and Optimization (CGO).Toronto,Canada,2010:210-217.
5Gordon M I,Thies W,Amarasinghe S.Exploiting coarsegrained task,data,and pipeline parallelism in stream programs//Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems.New York,USA,2006:151-162.
6Steinke S,Wehmeyer L,Lee B-S,Marwedel P.Assigning program and data objects to scratchpad for energy reduction//Proceedings of the Conference on Design,Automation and Test in Europe (DATE'02).Paris,France,2002:409-415.
7Lam M.Software pipelining:An effective scheduling technique for VLIW machines//Proceedings of the SIGPLAN' 88 Conference on Programming Language Design and Implementation.Atlanta,USA,1988:318-328.
8Choi Y,Lin Yuan,Chong N,et al.Stream compilation for real time embedded multicore systems//Proceedings of the 2009 International Symposium on Code Generation and Optimization(CGO).Seattle,USA,2009:210-220.
9Verma M,Wehmeyer L,Marwedel P.Dynamic overlay of scratchpad memory for energy minimization//Proceedings of the International Conference on Hardware/Software Codesign and System Synthesis.Stockholm,Sweden,2004:104-109.
10Avissar O,Barua R.An optimal memory allocation scheme for scratchpad-based embedded systems.IEEE Transactions on Embedded Computing Systems,2002,1(1):6-26.

同被引文献10

1吉顺慧,李必信,邱栋.基于XCFG的BPEL数据流属性分析与验证[J].电子学报,2013,41(7):1365-1370. 被引量：3
2张维维,魏海涛,于俊清,李鹤,黎昊,杨秋吉.COStream:一种面向数据流的编程语言和编译器实现[J].计算机学报,2013,36(10):1993-2006. 被引量：10
3张建朋,陈福才,李邵梅,刘力雄.基于密度与近邻传播的数据流聚类算法[J].自动化学报,2014,40(2):277-288. 被引量：28
4赵强利,蒋艳凰,卢宇彤.具有回忆和遗忘机制的数据流挖掘模型与算法[J].软件学报,2015,26(10):2567-2580. 被引量：15
5周爱平,程光,郭晓军,朱琛刚.长持续时间数据流的并行检测算法[J].通信学报,2015,36(11):156-166. 被引量：2
6Omid ABBASZADEH,Ali AMIRI,Ali Reza KHANTEYMOORI.An ensemble method for data stream classification in the presence of concept drift[J].Frontiers of Information Technology & Electronic Engineering,2015,16(12):1059-1068. 被引量：3
7申小伟,叶笑春,王达,张浩,王飞,谭旭,张志敏,范东睿,唐志敏,孙凝晖.一种面向科学计算的数据流优化方法[J].计算机学报,2017,40(9):2181-2196. 被引量：9
8Xu Tan,Xiao-Chun Ye,Xiao-Wei Shen,Yuan-Chao Xu,Da Wang,Lunkai Zhang,Wen-Ming Li,Dong-Rui Fan,Zhi-Min Tang.A Pipelining Loop Optimization Method for Dataflow Architecture[J].Journal of Computer Science & Technology,2018,33(1):116-130. 被引量：2
9Xu Tan,Xiao-Wei Shen,Xiao-Chun Ye,Da Wang,Dong-Rui Fan,Lunkai Zhang,Wen-Ming Li,Zhi-Min Zhang,Zhi-Min Tang.A Non-Stop Double Buffering Mechanism for Dataflow Architecture[J].Journal of Computer Science & Technology,2018,33(1):145-157. 被引量：4
10向陶然,叶笑春,李文明,冯煜晶,谭旭,张浩,范东睿.基于细粒度数据流架构的稀疏神经网络全连接层加速[J].计算机研究与发展,2019,56(6):1192-1204. 被引量：11

引证文献2

1刘红庆,舒底清,刘燕,黄雁.基于加权机制概念漂移的数据流GNB分类检测[J].控制工程,2019,26(3):589-595. 被引量：5
2范志华,李文明,叶笑春,范东睿.数据流计算研究进展与概述[J].数据与计算发展前沿,2021,3(5):65-81. 被引量：1

二级引证文献6

1郭锋锋.大数据背景下引入多重选择机制分类挖掘带概念漂移的高速数据流优化算法[J].九江学院学报（自然科学版）,2019,34(3):76-77.
2熊菊霞,吴尽昭.高维数据流异常节点动态跟踪仿真研究[J].计算机仿真,2020,37(10):445-449. 被引量：3
3韦洁华.基于自适应微簇的任意形状概念漂移数据流聚类[J].计算机应用与软件,2020,37(11):260-267. 被引量：1
4王俊红,郭亚慧.面向动态数据块的非平衡数据流分类算法[J].计算机工程与应用,2021,57(13):124-129. 被引量：4
5李林,马芳平,彭放,孙延黎,徐镭梦.基于知识图谱的数据中心异常数据流检测系统设计[J].电子设计工程,2023,31(7):77-81.
6康旺,寇竞,赵巍胜.存算一体芯片发展现状、趋势与挑战[J].中国科学：信息科学,2024,54(1):16-24. 被引量：3

1郭青,陈国良,陈意云.数据流程序设计语言[J].计算机研究与发展,1990,27(4):22-30. 被引量：2
2刘旸,张兆庆,乔如良.基于域的编译框架[J].计算机学报,2003,26(2):188-194. 被引量：5
3白秀秀,董小社,刘超,曹海军,李亮.面向异构多核架构的自适应编译框架[J].计算机学报,2014,37(7):1548-1559. 被引量：2
4张维维,魏海涛,于俊清,李鹤,黎昊,杨秋吉.COStream:一种面向数据流的编程语言和编译器实现[J].计算机学报,2013,36(10):1993-2006. 被引量：10
5杨秋吉,于俊清,莫斌生,何云峰.面向Storm的数据流编程模型与编译优化方法研究[J].计算机工程与科学,2016,38(12):2409-2418. 被引量：3
6张素平,王冬,丁丽丽,王鹏翔,宫一,于海宁.一种基于SLP的新型编译框架[J].计算机应用研究,2017,34(1):21-26. 被引量：1
7龙舜.一个Java自适应优化编译框架的设计与实现[J].暨南大学学报（自然科学与医学版）,2006,27(5):676-682.
8刘磊,李振国,高艳华,丁岩,申春,刘雷.特定领域语言MISPC及其编译框架实现技术[J].吉林大学学报（理学版）,2016,54(4):805-812. 被引量：3
9魏海涛,于俊清,余华飞,秦明康.一种面向数据流程序的软件流水并行化方法[J].计算机学报,2011,34(5):889-898. 被引量：5
10赵迪,华保健,朱洪军.高阶代码消除性能比较框架的设计与实现[J].计算机应用,2016,36(9):2481-2485. 被引量：1

计算机学报

2014年第7期

浏览历史

内容加载中请稍等...

一种面向众核架构的数据流编译框架被引量：2

参考文献13

同被引文献10

引证文献2

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

一种面向众核架构的数据流编译框架 被引量：2

参考文献13

同被引文献10

引证文献2

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

一种面向众核架构的数据流编译框架被引量：2