期刊文献+

Efficient Scheduling Mapping Algorithm for Row Parallel Coarse-Grained Reconfigurable Architecture 被引量:4

原文传递
导出
摘要 Row Parallel Coarse-Grained Reconfigurable Architecture(RPCGRA)has the advantages of maximum parallelism and programmable flexibility.Designing an efficient algorithm to map the diverse applications onto RPCGRA is difficult due to a number of RPCGRA hardware constraints.To solve this problem,the nodes of the data flow graph must be partitioned and scheduled onto the RPCGRA.In this paper,we present a Depth-First Greedy Mapping(DFGM)algorithm that simultaneously considers the communication costs and the use times of the Reconfigurable Cell Array(RCA).Compared with level breadth mapping,the performance of DFGM is better.The percentage of maximum improvement in the use times of RCA is 33%and the percentage of maximum improvement in non-original input and output times is 64.4%(Given Discrete Cosine Transfor 8(DCT8),and the area of reconfigurable processing unit is 56).Compared with level-based depth mapping,DFGM also obtains the lowest averages of use times of RCA,non-original input and output times,and the reconfigurable time.
出处 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2021年第5期724-735,共12页 清华大学学报(自然科学版(英文版)
基金 supported by the Natural Science Foundation of Anhui Province(No.1808085MF203) the National Natural Science Foundation of China(No.61432017)。
  • 相关文献

参考文献1

二级参考文献14

  • 1潘雪增,孙康,陆魁军,王继民,平玲娣.动态可重构系统任务时域划分算法[J].浙江大学学报(工学版),2007,41(11):1839-1844. 被引量:7
  • 2周博,邱卫东,谌勇辉,彭澄廉.基于簇的层次敏感的可重构系统任务划分算法[J].计算机辅助设计与图形学学报,2006,18(5):667-673. 被引量:12
  • 3Estrin G, Bussel B, Turn R et al. Parallel processing in a re- structurable computer system [ J]. IEEE Transactions on Elec- Ironic Computers, 1963,12(6) : 747 - 755.
  • 4Campi F,Toma M,Lodi A,et al.A VLIW processor with re- configurable instruction set for embedded applications [ J ]. IEEE Journal of Solid-State Circuits, 2003, 38 ( 11 ) : 1876 - 1886.
  • 5Fatahalian K, Houston M . GPUs: A closer look [ J]. ACM Queue,2008,6(2) : 18 - 28.
  • 6Joao M.P. Cardoso, Pedro C. Diniz, Markus Weinha_rdt. Com- piling for reconfigurable computing: A survey [ J]. ACM Com- puting Surveys, 2010.42(4) : 1301 - 1365.
  • 7Karthikeya M, Puma G and Bhatia D. Temporal partitioning and scheduling data flow graphs for reconfigurable computers [J]. IEEE Transactions on Computers, 1999, 48( 6): 579 - 590.
  • 8Jo~o M P Cardoso,Neto H. An enhanced static-list scheduling algorithm for temporal partitioning onto RPUs [ C ]. In: Pro- eeedings of 1999 IFIP International Conference on Very Large Scale Integration, Lisbon, IEEE CS Press, December 1999. 485 - 496.
  • 9Mahmoud M, Masato M.A combined approach to high-level synthesis for dynamically reconfigurable systems [ J ]. IEEE Transactions on Computers,2004,53( 12):1508- 1522.
  • 10Joao M P Cardoso. On combining temporal partitioning and sharing of functional units in compilation for reconfigurable architectures [ J]. IEEE Transactions on Computers, 2003.52 (10) : 1362 - 1375.

共引文献6

同被引文献22

引证文献4

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部