寄存器堆互连的VLIW结构及其指令调度算法

Instruction Scheduling Algorithm for Register File Connectivity Clustered VLIW Architecture

下载PDF

导出

摘要超长指令字(Very Long Instruction Word,VLIW)处理器一般采用总线互连的多簇结构,每个簇中的功能单元共享一个本地寄存器堆,簇间采用总线传输数据,以避免功能单元增多时,全连通结构的延时、面积和功耗的快速增长;但簇间数据共享时的拷贝和延时,使得处理器在性能上有所下降.文中提出了一种寄存器堆互连的多簇VLIW结构,采用寄存器堆来连接各个簇,从而可以避免簇间数据传输的延时和额外的数据拷贝操作.同时也提出了针对这种结构的指令调度算法,以提高指令调度的性能.实验结果表明,与全连通的VLIW结构相比,寄存器堆互连结构在性能上仅有13%左右的性能下降,代码长度则基本不变;这都优于总线互连的多簇结构. Generally VLIW（Very Long Instruction Word） processors are implemented as busconnectivity clustered architecture, in which the function units in a cluster only access the corresponding local registers and different clusters are connected by buses. This architecture can avoid aggressive growing of delay, area and power in full-connectivity VLIW processors when function units increase. However, performance degradation is induced by its copy operations and latency of communications between clusters. This paper presents a new clustered architecture, in which a register file is used to connect all the clusters so as to turn copy and latency away. This paper also gives instruction scheduling algorithm to improve the performance. The experimental results in- dicate that this new architecture under the help of this scheduling algorithm shows only 13% performance degradation and little code size increase in average compared with those of fully tivity VLIW architecture, which prevails that of bus-connectivity clustered VLIW archite connec cture.

作者周志雄何虎杨旭张延军孙义和

机构地区清华大学微电子学研究所

出处《计算机学报》 EI CSCD 北大核心 2008年第1期127-132,共6页 Chinese Journal of Computers

基金国家自然科学基金(60236020)资助

关键词超长指令字指令调度寄存器堆 VLIW instruction scheduling register file

分类号 TP314 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献9

1Joseph A. Fisher, Paolo Faraboschi and Cliff Young. Embedded computing. California: Morgan Kaufmann, 2005
2Texas Instrument Inc. TMS320C62x/67x CPU and instruction set reference guide. 1998
3Fridman J, Greefield Z. The TigerSharc DSP architecture. IEEE Micro, 2000, 20(1): 66-76
4Zhang Yan Jun, He Hu, Sun Yi-He. A new register file access architecture for software pipelining in VLIW processors// Proceedings of the ASP-DAC. Shanghai, 2005, 1:627-630
5Faraboschi P, Finsher J A, Young C. Instruction scheduling for instruction level parallel processors. Proceedings of the IEEE, 2001, 89(11): 1638-1659
6Ellis J R. Bulldog: A Compiler for VLIW Architectures. London: The MIT Press, 1986
7Capitanio A, Dutt N, Nicolau A. Partitioned register files for VLIWs: A preliminary analysis of tradeoffs//Proceedings of the 25th International Symposium on Microarehiteeture, 1992: 292-300
8Ozer E, Banerjia S, Conte T M. Unified assign and schedule: A new approach to scheduling for clustered register file microarchitectures//Proceedings of the 31st International Symposium on Microarchitecture. Dallas, TX, 1998:308-315
9Rixner S, Dally W J, Khailany Bet al. Register organization for media processing//Proceedings of the 6th International Symposium on High-Performance Computer Architecture. Touluse, 2000:375-286

1甘玲,汤睿.一种面向VLIW芯片的线性指令调度算法[J].微计算机信息,2009,25(2):153-155.
2刘瑞芳,万继光,谭志虎.RAID中零拷贝技术研究[J].华中科技大学学报（自然科学版）,2005,33(z1):161-163. 被引量：1
3李智勇.Windows XP中使用鼠标右键引起CPU100％占用问题[J].电子乐园,2009(4):37-37.
4王红梅,王敏,张铁军,单睿,侯朝焕.面向VLIW结构的寄存器压力敏感表调度算法[J].计算机应用研究,2009,26(11):4039-4041.
5车立新.红外遥控器发射信号的“高效识别方案”[J].黑龙江科技信息,2013(14):12-12.
6金丽,包志华,陈海进.基于ARM嵌入式系统的C程序优化设计方法[J].南通大学学报（自然科学版）,2006,5(3):61-64. 被引量：8
7王丽芳,符意德,纽远.嵌入式系统程序优化方法的研究[J].通讯和计算机（中英文版）,2005,2(4):14-17.
8姚玉钦.一种基于FPGA的ARM与PCI接口设计方案[J].信阳师范学院学报（自然科学版）,2009,22(2):304-306.
9卢海军.最小的多线程框架[J].单片机与嵌入式系统应用,2010,10(4):70-71. 被引量：2
10牛学军,李俊莉.用C语言实现程序与帮助信息分离[J].辽宁师专学报（自然科学版）,2000,2(2):38-40.

计算机学报

2008年第1期

浏览历史

内容加载中请稍等...

寄存器堆互连的VLIW结构及其指令调度算法

参考文献9

相关作者

相关机构

相关主题

浏览历史