降低协同设计虚拟机启动开销的译码后指令缓存技术被引量：3

Decoded Instruction Cache for Reducing Startup Overhead in Co-Designed Virtual Machines

下载PDF

导出

摘要协同设计虚拟机采用动态二进制翻译实现不同体系结构间的二进制兼容,对源指令的翻译和处理影响了协同设计虚拟机的启动性能.研究发现,在一个采用解释执行和翻译相结合的协同设计虚拟机中,处理非热点代码的解释执行是虚拟机启动开销的主要来源.发现了协同设计虚拟机中的解释例程局部性,并提出了一种硬件译码后指令缓存结构DICache(decoded instruction cache),用于存储解释执行过程中译码后的指令信息,开发解释例程的局部性,避免大量重复的译码操作.在一个协同设计虚拟机上对DICache进行评估,采用一组SYSmark 2004 SE商业应用测试程序进行测试.结果表明,DICache可以有效减少重复译码量,将协同设计虚拟机的启动性能平均提高约2.4倍.与相关的优化技术相比,DICache的性能更好,且具有更强的适用性. Co-designed virtual machines（co-VM） provide the processor designer with new opportunities for innovation through the combined hardware and software.Co-VM uses dynamic binary translation to implement binary compatibility between different instruction set architectures（ISA）.Interpreting and translating the source ISA binaries will affect the startup performance of a co-VM.In the exploration of startup performance of our VM which employs interpretation and superblock translation,we observe that the cold code interpretation causes the major startup overhead of co-VM and the redundant source instruction decoding forms the bottleneck of interpretation.We oberserve the interpretation routine locality and propose a hardware decoded instruction cache（DICache） for saving instruction information decoded during interpretation.DICache can be organized as normal cache and maintained by hardware.We implement a co-VM and conduct some benchmarks from SYSmark 2004 SE to evaluate the DICache performance on a co-VM.We also evaluate the implementation overhead of DICache,such as area and power consumption.It is demonstrated that DICache could significantly reduce the redecoding operations and speedup the interpretation,thus bringing a speedup of 2.4 on average relative to the startup performance of the normal co-VM.Compared with other related optimization techniques,DICache performs more efficiently with better adaptability.

作者陈微王志英肖侬沈立陆洪毅

机构地区国防科学技术大学计算机学院

出处《计算机研究与发展》 EI CSCD 北大核心 2011年第1期19-27,共9页 Journal of Computer Research and Development

基金国家"九七三"重点基础研究发展计划基金项目(2007CB310901) 国家自然科学基金项目(60803041) 国家"八六三"高技术研究发展计划基金项目(2009AA01Z101)

关键词协同设计虚拟机动态二进制翻译解释执行启动开销 co-design virtual machine dynamic binary translation interpretation startup overhead

分类号 TP302 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献12

1Smith J E, Nair R. Virtual Machines: Versatile Platforms for Sysetms and Processes [M]. Beijing: Publishing House of Electronics Industry, 2006.
2Dehnert J C, Grant B K, Banning J P, et al. The transmeta code MorphingTM software: Using speculation, recovery, and adaptive retranslation to address real-life challenges [C] //Proc of the 1st Annual IEEE/ACM Int Symp on Code Generation and Optimization (CGO'03). Washington, DC: IEEE Computer Society, 2003: 15-24.
3Ebcioglu K, Altman E R. DAISY: Dynamic compilation for 100% architectural compatibility [C]//Proc of 24th lnt Syrup on Computer Architecture (ISCA'97). New York: ACM, 1997:26-37.
4马湘宁,武成岗,唐锋,冯晓兵,张兆庆.二进制翻译中的标志位优化技术[J].计算机研究与发展,2005,42(2):329-337. 被引量：9
5Hwu W W, Mahlke S A, Chen W Y, et aI. The superblock: An effective technique for VLIW and superscalar compilation [J]. Journal of Supereomputing, 1993, 7(1/2):229-248.
6Hu S, Smith J E. Reducing startup time in co-designed virtual machines [C] //Proc of the 33rd Annual Int Symp on Computer Architecture (ISCA'06). Washington, DC: IEEE Computer Society, 2006:277-288.
7SimpleScalar LLC. SimpleScalar 3.0 [OL]. [2009-09-19]. http ://www. simplescalar, com.
8Bochs 2. 3. 6 [OL]. [2009-09-19]. http://sourceforge, net/ projects/bochs/files/bochs/2. 3. 6/boehs-2. 3. 6/.
9Business Applications Performance Corporation (BAPCO). An overview of SYSmark 2004 SE [OL]. [2009-08-16]. http://www, bapco, corn/support/technical documents/SYSmark 2004 SEWhitePaper. pdf.
10HP Corporation. CACTI 4.0[OL]. [2009-09-19]. http:// quid. hpl. hp. com: 9081/cacti/.

二级参考文献12

1E. R. Altman, D. Kaeli, Y. Sheffer. Welcome to the opportunities of binary translation. IEEE Computer, 2000, 33 (3): 40～45.
2M. Srinivasan. Method and apparatus for emulating status flag.USA, US Patent 5774694, 1998.
3R.J. Hookway, M. A. Herdeg. Digital FX! 32: Combining emulation and binary translation. Digital Technical Journal, 1977,9(1): 3～12.
4P. Hohensee, M. Myszewski, D. Reese. WABI CPU emulation.Hot Chips Ⅷ, Palo Alto, CA, 1996.
5C. Cifuentes, M. Van Emmerik. UQBT: Adaptable binary translation at low cost. IEEE Computer, 2000, 33(3): 60～66.
6A. Klaiber. The technology behind Crusoe processor. Transmeta Corporation, Tech Rep, 2000.
7243190 Intel Architecture Software Developer's Manual,Volume 1: Basic Architecture. Santa Clara: Intel Corporation,1999.
8243191 Intel Architecture Software Developer's Manual,Volume 2: Instruction Set Reference. Santa Clara: Intel Corporation, 1999.
994039-7311 MIPS R4000 Microprocessor User's Manual (Second edition). Mountain View: MIPS Technologies Inc, 1994.
10A.V. Aho, R. Sethi, J. D. Ullman. Compilers: Principles,Techniques, and Tools. Beijing: Post & Telecommunications Press, 2001.

共引文献8

1唐锋,武成岗,冯晓兵,张兆庆.基于动态反馈的标志位线性分析算法[J].软件学报,2007,18(7):1603-1611. 被引量：4
2罗琼程,吴强.动态二进制翻译中数据预取优化研究[J].计算机应用研究,2009,26(12):4572-4576. 被引量：1
3王荣华,孟建熠,陈志坚,严晓浪.动态二进制翻译中的标志位优化算法[J].浙江大学学报（工学版）,2014,48(1):124-129. 被引量：1
4王文文,武成岗,白童心,王振江,远翔,崔慧敏.二进制翻译中标志位的模式化翻译方法[J].计算机研究与发展,2014,51(10):2336-2347. 被引量：3
5杜彬,赵瑞珍,李琼.面向ARM平台的二进制翻译系统标志位优化[J].计算机工程,2014,40(10):318-320.
6戴涛,单征,卢帅兵,石强,潭捷.基于优先级动态二进制翻译寄存器分配算法[J].浙江大学学报（工学版）,2016,50(7):1338-1346. 被引量：5
7董卫宇,戚旭衍,曾韵,郭玉东,蒋烈辉.跨平台系统虚拟机的二进制翻译优化[J].计算机工程与应用,2016,52(23):42-49.
8王军,庞建民,傅立国,岳峰,单征,张家豪.二进制翻译中动静结合的寄存器分配优化方法[J].计算机研究与发展,2019,56(4):708-718. 被引量：5

同被引文献31

1殷国鹏,莫云生,陈禹.利用社会网络分析促进隐性知识管理[J].清华大学学报（自然科学版）,2006,46(z1):964-969. 被引量：94
2胡敬东,李学来,刘凤茹.煤矿应急救援技术研究若干新进展[J].煤矿安全,2005,36(5):33-35. 被引量：16
3张翠华,任金玉,于海斌.供应链协同管理的研究进展[J].系统工程,2005,23(4):1-6. 被引量：73
4郭德勇,刘金城,姜光杰.煤矿瓦斯爆炸事故应急救援响应机制[J].煤炭学报,2006,31(6):697-700. 被引量：38
5MCIVOR R, HUMPHREYS P, MCCURRY L. Electronic commerce: supporting collaboration in the supply chain EJ~. Journal of Materials Processing Technology, 2003,139(1/2/3) :147-152.
6LIEBOWITZ J. Linking social network analysis with the analytic hierarchy process for knowledge mapping in organizations [J].Journal of Knowledge Management, 2005,9 (1) : 76-86.
7BOGINSKI V, BUTENKO S, PARDALOS P M. Statistical analysis of financial networks [J]. Computational Statistics and Data Analysis, 2005,48 (2) :431-443.
8王建辉,李登月.利用“虚拟现实”技术辅助高校体育教学的探讨[J].青春岁月,2012,4.
9朱涛,常国岑,施笑安.基于复杂网络的指挥信息系统拓扑模型研究[J].系统仿真学报,2008,20(6):1574-1576. 被引量：34
10郭克希,谭佩莲,杨俊.一种基于TRIZ理论的协同设计系统[J].东华大学学报（自然科学版）,2008,34(3):258-261. 被引量：7

引证文献3

1柳崧轶.油田地面工程动态监测系统[J].油气田地面工程,2014,33(8):21-22. 被引量：1
2吴保磊,张军波,王恩元.煤矿应急救援系统协同研究[J].工矿自动化,2015,41(10):56-60. 被引量：2
3马东波.虚拟机技术在高职计算机实践教学中的应用效果探究[J].产业与科技论坛,2015,14(18):202-203. 被引量：4

二级引证文献7

1梁涛.国内油田动态监测技术进展研究[J].化工管理,2015(5):143-143.
2卢超.高职计算机实践教学中虚拟机技术的应用[J].神州,2017,0(34):130-130.
3周峰.试论虚拟技术在高职院校计算机教学中的应用价值[J].数码世界,2018,0(5):119-119.
4常继红.计算机辅助音乐教学实践中的技术应用探讨[J].电脑迷,2017(6):16-135. 被引量：2
5衡连伟,夏业领.煤矿安全应急管理系统耦合协调度评价研究[J].安徽理工大学学报（社会科学版）,2018,20(1):55-61. 被引量：2
6薛虎.虚拟机技术在高职计算机实践教学中的应用探析[J].无线互联科技,2021,18(5):98-99. 被引量：3
7张煜.浅谈煤矿应急救援系统单元重要性分析[J].现代国企研究,2019,0(10):204-204. 被引量：1

1江修.微型计算机存贮器译码片选的一种控制方法[J].三峡大学学报（人文社会科学版）,1997,20(3):45-46.
2蒋进,梅海军,王平.PDF417二维条码在嵌入式设备中的应用[J].单片机与嵌入式系统应用,2004(4):36-39. 被引量：1
3陈微,王志英,陈顼颢,沈立,陆洪毅,肖侬.基于DICache的混合线索解释执行技术[J].计算机工程与科学,2012,34(2):50-55. 被引量：1
4宋磊,尹俊平,陈虹.基于R的并行统计计算[J].计算机科学,2013,40(3):95-99. 被引量：2
5刘凤,刘青昆.集群下Cholesky分解的数据重用算法[J].微计算机应用,2011,32(2):15-20.
6万健如,张海波,曹才开.单神经元PID控制器永磁同步电机调速系统[J].电力电子技术,2005,39(1):75-77. 被引量：21
7尚明生.异构总线网络的可分负载优化调度算法[J].计算机工程,2005,31(20):30-32. 被引量：1
8胥京宇.全新Spansion闪存产品面向嵌入式应用[J].世界电子元器件,2011(3):57-57.
9全新Spansion闪存产品带来强劲创新力[J].消费电子,2011(3):86-86.
10孙含欣,杨鲲鹏,赵雨来,佟冬,程旭.CASA:A New IFU Architecture for Power-Efficient Instruction Cache and TLB Designs[J].Journal of Computer Science & Technology,2008,23(1):141-153.

计算机研究与发展

2011年第1期

浏览历史

内容加载中请稍等...

降低协同设计虚拟机启动开销的译码后指令缓存技术被引量：3

参考文献12

二级参考文献12

共引文献8

同被引文献31

引证文献3

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

降低协同设计虚拟机启动开销的译码后指令缓存技术 被引量：3

参考文献12

二级参考文献12

共引文献8

同被引文献31

引证文献3

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

降低协同设计虚拟机启动开销的译码后指令缓存技术被引量：3