期刊文献+

一种粗粒度可重构体系结构多目标优化映射算法 被引量:4

A Multi-Objective Optimization Mapping Algorithm for Coarse Grained Reconfigurable Architectures
下载PDF
导出
摘要 针对多约束下的行流水粗粒度可重构体系结构的硬件任务划分映射问题,提出了一种多目标优化映射算法.该算法根据运算节点执行时延、依赖度等因素构造了累加概率权值函数,在满足可重构单元面积和互连等约束下,通过该函数值动态调整就绪节点的映射调度次序,当一块可重构单元阵列当前行映射完毕后,就自动换行,当一块阵列被填满,就切换到下一块,当一个数据流图映射完毕后,就自动计算划分块数等参数.实验结果表明,与层贪婪映射算法相比,文中算法平均执行总周期降低了8.4%(RCA4×4)和5.3%(RCA6×6),与分裂压缩内核映射算法相比,文中算法平均执行总周期降低了20.6%(RCA4×4)和21.0%(RCA6×6),从而验证了文中提出算法的有效性. Based on rowpipelining coarse grained reconfigurable architecture( CGRA),we presented MOM( multiobjective optimization mapping) algorithm to solve the multi-constraints hardware task partitioning-mapping problem. The cumulative probability weight function was constructed by the execution delay of computing nodes and the dependence between two nodes,etc. With the constraints of reconfigurable cell area and interconnection,the proposed algorithm could adjust dynamically the scheduling order of the ready nodes by thefunction values. When a rowof the RCA was mapped completely,MOM began on a newrow. When the RCA was filled,MOM switched to the next one. When a DFG( data flowgraph) was mapped completely,the number of modules and etc were calculated automatically in MOM. Experiment results showthat the average execution total cycles of MOM decrease by 8. 4%( RCA4 × 4) and 5. 3%( RCA6 × 6) comparing with LBGM( level based greedy mapping) algorithm. Comparing with SPKM( split-push kernel mapping) algorithm,the average execution total cycles of MOM decrease by 20. 6%( RCA4 × 4) and 21%( RCA6 × 6). Experimental evaluation confirms the efficiency of our approach.
出处 《电子学报》 EI CAS CSCD 北大核心 2015年第11期2151-2160,共10页 Acta Electronica Sinica
基金 国家863高技术研究发展计划(No.2009AA011705,No.2013AA013204) 国家自然科学基金重点项目(No.61432017) 安徽省自然科学基金(No.1408085MF124) 芜湖市科技计划自然科学基金(No.芜科计字[2012]94号) 安徽工程大学国家自然科学预研基金 安徽省高校省级自然科学基金重点项目(No.Kj2015A003)
关键词 可重构单元阵列 时域映射 累加概率权值 异步计算时延 资源约束 reconfigurable cell array temporal mapping accumulation probability weight asynchronous computation delay resource constraint
  • 相关文献

参考文献16

  • 1Cardoso J M P,Diniz C D,Weinhardt M.Compiling for reconfigurable computing:a survey[J]. ACM Computing Surveys,2010,42(4):1301-1365.
  • 2Yoon J W,Lee J,Park S,et al. Architecture customization of on-chip reconfigurable accelerators[J]. ACM Transactions on Design Automation of Electronic Systems,2013,18(4):52:1-52:22.
  • 3Zhao X,Erdogan A T,Arslan T.High-efficiency customized coarse-grained dynamically reconfigurable architecture for JPEG2000[J]. IEEE Transactions on Very Large Scale Integration Systems,2013,21(12):2343-2348.
  • 4Kim Y,Lee J,Shrivastava A,et al.High throughput data mapping for coarse-grained reconfigurable architectures[J]. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems,2011,30(11):1599-1609.
  • 5杨子煜,严明,王大伟,李思昆.面向CGRA循环流水映射的数据并行优化[J].计算机学报,2013,36(6):1280-1289. 被引量:5
  • 6Han K,Lee G,Choi K.Software-level approaches for tolerating transient faults in a coarse-grained reconfigurable architecture[J]. IEEE Transactions on Dependable and Secure Computing,2014,11(4):392-398.
  • 7Yoon J W,Shrivastava A,Park S,et al.A graph drawing based spatial mapping algorithm for coarse-grained reconfigurable architectures[J]. IEEE Transactions on Very Large Scale Integration Systems,2009,17(11):1565-1578.
  • 8Ferreira R S,Cardoso J M P,Damiany A,et al.Fast placement and routing by extending coarse-grained reconfigurable arrays with Omega Networks[J]. Journal of Systems Architecture,2011,57(8):761-777.
  • 9Lee G,Choi K,Dutt N D.Mapping multi-domain applications onto coarse-grained reconfigurable architectures[J]. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems,2011,30(5):637-650.
  • 10Ansaloni G,Tanimura K,Pozzi L,et al.Integrated kernel partitioning and scheduling for coarse-grained reconfigurable arrays[J]. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems,2012,31(12):1803-1816.

二级参考文献48

  • 1潘雪增,孙康,陆魁军,王继民,平玲娣.动态可重构系统任务时域划分算法[J].浙江大学学报(工学版),2007,41(11):1839-1844. 被引量:7
  • 2周博,邱卫东,谌勇辉,彭澄廉.基于簇的层次敏感的可重构系统任务划分算法[J].计算机辅助设计与图形学学报,2006,18(5):667-673. 被引量:12
  • 3Estrin G, Bussel B, Turn R et al. Parallel processing in a re- structurable computer system [ J]. IEEE Transactions on Elec- Ironic Computers, 1963,12(6) : 747 - 755.
  • 4Campi F,Toma M,Lodi A,et al.A VLIW processor with re- configurable instruction set for embedded applications [ J ]. IEEE Journal of Solid-State Circuits, 2003, 38 ( 11 ) : 1876 - 1886.
  • 5Fatahalian K, Houston M . GPUs: A closer look [ J]. ACM Queue,2008,6(2) : 18 - 28.
  • 6Joao M.P. Cardoso, Pedro C. Diniz, Markus Weinha_rdt. Com- piling for reconfigurable computing: A survey [ J]. ACM Com- puting Surveys, 2010.42(4) : 1301 - 1365.
  • 7Karthikeya M, Puma G and Bhatia D. Temporal partitioning and scheduling data flow graphs for reconfigurable computers [J]. IEEE Transactions on Computers, 1999, 48( 6): 579 - 590.
  • 8Jo~o M P Cardoso,Neto H. An enhanced static-list scheduling algorithm for temporal partitioning onto RPUs [ C ]. In: Pro- eeedings of 1999 IFIP International Conference on Very Large Scale Integration, Lisbon, IEEE CS Press, December 1999. 485 - 496.
  • 9Mahmoud M, Masato M.A combined approach to high-level synthesis for dynamically reconfigurable systems [ J ]. IEEE Transactions on Computers,2004,53( 12):1508- 1522.
  • 10Joao M P Cardoso. On combining temporal partitioning and sharing of functional units in compilation for reconfigurable architectures [ J]. IEEE Transactions on Computers, 2003.52 (10) : 1362 - 1375.

共引文献28

同被引文献26

引证文献4

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部