期刊文献+

乱序超标量处理器核的性能分析与优化 被引量:1

Counters based performance analysis and optimization of an out-of-order superscalar processor core
下载PDF
导出
摘要 随着处理器微体系结构日益复杂,性能分析在处理器研制过程中的作用越来越重要。常用的性能分析方法是建立性能模型,该方法主要用于研制初期的设计空间探索,如果用于微体系结构级的分析和优化,速度和精度都会成为限制因素。因此,提出一种基于计数器的性能分析方法,该方法以项目组已经完成的一款处理器核的硬件实现代码为基础,在处理器核外部添加一个专用性能监测单元,收集微体系结构分析和优化需要的各种事件,并通过结果分析器对统计的事件进行分析,得到微体系结构实现的性能受限因素。采用此方法,在现场可编程门阵列原型系统上对SPEC CPU2000测试程序运行时的性能受限因素进行分析,并根据分析结果采取相应的优化措施,优化后的处理器核性能得到了明显提升。 ith the ever-increasing design complexity in the processor micro-architecture, performance analysis becomes more and more important in the research and design of processors. Performance models are used widely in the performance analysis, which are more suitable for the design space exploration in the early stage. When used in micro-architecture optimizations, the accuracy and the speed of performance models are the lim iting factors. Therefore, a performance analysis method based on counters was proposed. In this method, the RTL register transfer level code of a processor core was used as a baseline, and a specialized performance monitor unit was added to collect the events needed by the microarchitecture analysis and optimization. Then the collected events were sent to a result analyzer, where the factors affecting the performance were obtained. By a dopting the method, we analyzed what affects the performance in running SPEC CPU2000 benchmarks on FPGA ( fieldprogrammable gate array) prototyping, and optimized the micro-architecture of processor core according to the analysis results. The performance of the optimized processor core is improved obviously.
作者 孙彩霞 隋兵才 王蕾 王永文 黄立波 李文哲 王俊辉 SUN Caixia;SUI Bingcai;WANG Lei;WANG Yongwen;HUANG Libo;LI Wenzhe;WANG Junhui(College of Computer, National University of Defense Technology, Changsha 410073 , China)
出处 《国防科技大学学报》 EI CAS CSCD 北大核心 2016年第5期14-19,共6页 Journal of National University of Defense Technology
基金 国家自然科学基金资助项目(61103011 61170045 61402501)
关键词 性能分析 计数器 处理器核 微体系结构 performance analysis counters processor core micro-architecture
  • 相关文献

参考文献3

二级参考文献22

  • 1邓正宏,康慕宁,罗旻.超标量微处理器研究与应用[J].微电子学与计算机,2004,21(9):59-63. 被引量:4
  • 2Wei-WuHu Fu-XinZhang Zu-SongLi.Microarchitecture of the Godson-2 Processor[J].Journal of Computer Science & Technology,2005,20(2):243-249. 被引量:52
  • 3[1]Ghosh S,et al.Cache Miss Equations: A Compiler Framework for Analyzing and Tuning Memory Behavior.In ACM Transactions on Programming Languages and Systems,1999,21(4):702~745
  • 4[2]http://www.cs.wisc.edu/~mscalar/simplescalar.html
  • 5[3]Merten M C,et al.An Architectural Framework for Run-Time Optimization.IEEE Transactions on Computers,2001,50(6):567~589
  • 6[4]Lambert, et al.Profiling I/O Interrupts in Modern Architectures.In:8th Intl.Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems,San Francisco, California,2000
  • 7[5]Hirzel M, et al.Bursty Tracing: A Framework for Low-Overhead Temporal Profiling.In:4th Workshop on Feedback-Directed and Dynamic Optimization (FDDO), Dec.2001
  • 8[6]http://icl.cs.utk.edu/projects/papi/
  • 9[7]http://www.gz-juelich.de/zam/PCL/
  • 10[8]http://research.compaq.com/SRC/dcpi/

共引文献25

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部