期刊文献+

面向多核结构的自适应选择性指令主动推送技术 被引量:1

An Adaptive and Selective Instruction Active Push Technique for Multi-core Architecture
下载PDF
导出
摘要 正确有效的指令预取策略是避免指令缺失的关键技术,程序流程改变时指令预取方向正确率不高、指令预取准确度和存储器带宽有效利用率较低是导致指令缺失的主要因素.本文提出基于置信度评估的自适应选择性指令主动推送技术ASIAP,一方面减少无效指令预取的数量,进行精确指令预取,在避免Cache污染的同时提升指令预取的有效性;另一方面采用指令主动推送部件自适应选择性地完成部分非顺序指令预取请求,减少了取入错误路径上无用指令的可能.通过与Next_Line、Target_Line、Wrong_Path、BTA、Markov和CFGP等策略的对比,在2-16内核配置下,ASIAP策略相对于其它策略准确性平均提升3.7%-28.71%;L1 I-cache缺失率平均下降3.3%-14.39%. Correct and effective instruction prefetching strategy is key technique to avoid instruction miss.Unfortunately,correctness and accuracy of instruction prefetching is not very precise,and the utilization ratio of memory bandwidth is relative low,which lead to instruction miss.This paper proposes an adaptive and selective instruction active push technique for multi-core architecture,called ASIAP.On one hand,in order to avoid the cache pollution and increase the prefetching validity,we perform instruction prefetching precisely as far as possible;on the other hand,part of non-sequential type prefetching are responded preferentially by a specific instruction active push unit adaptively and selectively,which decreases the possibility of fetching the useless instructions in the wrong path.We evaluate ASIAP with other prefetching strategies such as Next_Line、Target_Line、Wrong_Path、BTA、Markov and CFGP.Simulation result indicates that,under the configuration of 2 cores to 16 cores,relative to other strategies,ASIAP improves prefetching accuracy by average 3.7%-28.71%,and reduces of L1 I-Cache miss rate by average 3.3%-14.39%.
出处 《小型微型计算机系统》 CSCD 北大核心 2013年第3期636-643,共8页 Journal of Chinese Computer Systems
基金 国家"核高基"科技重大专项课题项目(2009ZX01039-003-001-03 2009ZX01023-004)资助 国家自然科学(60905007)资助
关键词 片上多核处理器 存储系统 置信度评估 自适应 指令预取 主动推送 chip multi-core memory system confidence estimation adaptation instruction prefetching active push
  • 相关文献

参考文献2

二级参考文献15

  • 1R Colwell, R Nix, J O' Donnell, et al. A VLIW Architecture for a Trace Scheduling Compiler[ A ]. Proc of the 2nd Int' 1 Conf on Architectural Support for Programming Languages and Operating Systems[C]. 1987.180 - 192.
  • 2W Hwu, S Mahlke, W Chen, et al. The Superblock: An Effective Technique for VLIW and Superscalar Compilation[ J]. The Journal of Supercomputing, 1993,7:229 - 248.
  • 3C Xia, J Torrellas. Instruction Prefetching of Systems Codes with Layout Optimized for Reduced Cache Misses[ A] .23rd Annual Int' l Symp on Computer Architecture[ C]. 1996.
  • 4N Jouppi. Improving Direct-Mapped Cache Performance by the Addition of a Small Fully Associative Cache and Prefetch Buffers[ A].Proc of the 17th Annual Int'l Symp on Computer Architecture[ C].1990.
  • 5J Pieroe, T Mudge. Wrong-Path Instruction Prefetching[ A ]. 29th Int'l Symp on Microarchitecture[ C]. 1996.165 - 175.
  • 6D Joseph, D Grunwald. Prefetching Using Markov Predictors[ A ].24th Annual Int'l Symp on Computer Architecture[C]. 1990.
  • 7G Reinman, B Calder,T Austin. Fetch directed instruction prefetching[A] .In Proceedings of the 32nd Annual International Symposium on Microarchitecture [ C]. Haifa: IEEF. Computer Society Press, 1999.16- 27.
  • 8D Krfft. Lockup-free instruction fetch/prefetch cache organization.[A] .In 8th Annual lntermational Symposium of Computer Arehiteeture[C]. Minneapolis: IEEE, Congsater Society Press, 1981.81 - 87.
  • 9K Faxkas, N Jouppi. Complexity/performance trdeoffs with non-blocking loads [A]. In 21st Annual International Symposium on Computer Architecture [ C ]. Chicago: IEEE Computer Society Press, 1994.211 -222.
  • 10J Pierce, T Mudge. Wrong-path instruction prefetching [ A ]. The 29th International Symposium on Microarchitecture [ C]. Paris:IEEE. Computer Society Press, 1996.165 - 175.

共引文献9

同被引文献14

引证文献1

二级引证文献31

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部