期刊文献+

无污染Cache访问控制技术

Non-polluting cache accessing technique
下载PDF
导出
摘要 制造工艺的快速进步给集成电路设计提供了广阔的空间,而发展较慢的设计能力导致难以对片上资源高效利用。目前,高性能处理器片上Cache普遍占到芯片总面积的一半以上,而如何高效、智能地利用片上Cache空间,构建高性能存储系统是处理器微体系结构研究的重要内容。分析了Cache数据污染和猜测执行对处理器性能的影响,并在此基础上提出一种基于数据Tag有效位分裂的无污染Cache访问控制技术—Pease,将原先D-CacheTag中的一位数据有效位扩展为读数据有效位(RVB)和写数据有效位(WVB)两位,根据RVB和WVB值的不同组合对数据读写访问进行控制。不但充分保留了猜测执行的数据预取性,使污染数据透明化,写入数据时无需对污染数据进行替换操作,消除了污染数据对Cache效率的影响。Pease技术相对于baseline结构来说,IPC的提升幅度为1.05%~8.40%,平均提升4.04%;L1 D-Cache缺失率降低幅度为19.05%~48.16%,平均降低29.66%。 Rapid progress of semiconductor fabrication provid-es capacious space for IC designs, but unfortunately, the slow development of design ability makes it difficult to utilize the on-chip resource efficiently. At present, more than half of die area of modern microprocessor is inhabited by cache. So, how to make use of cache space smartly and efficiently, and construct high performance memory system has become one of the most important content in processor architecture design. This paper analyses the impacts of cache data pollution and speculative execution to processor performance, and proposes a non-polluting cache accessing technique based on data tag valid-bit splitting, which is called Pease. The valid-bit in D-Cache tag is splited into two bits, read data valid bi(tRVB)and write data valid bi(tWVB). According the different RVB and WVB combinations, corresponding accessing strategies to D-Cache are applied. As a result, Pease technique not only preserves the prefetch ability of speculative execution, but also makes the cache polluting data transparent, which means that, in no empty cache line situation, consequent data can be written into D-Cache directly, but without need to perform cache replacement operation. In other word, Pease technique makes polluting data totally harmless to D-cache. Simulation result indicates that, relative to the baseline architecture, Pease technique improves IPC from 1.05% to 8.40% , averagely 4.04% , and reduces miss rate of D-Cache from 19.05% to 48.16% averagely 29.66%.
出处 《计算机工程与应用》 CSCD 2013年第10期5-9,共5页 Computer Engineering and Applications
基金 中央高校科研基金项目(No.CHD2011JC055) 中央高校科研基金项目(No.CHD2011JC089)
关键词 有效位 分裂 CACHE 污染 valid bit splitting Cache pollution
  • 相关文献

参考文献14

  • 1International technology roadmap for semiconductors[EB/OL]. [2012-09-07].http ://www.itrs.net.
  • 2Jacobsen E,Rotenberg E, Smith J E.Assigning confidence to conditional branch predictions[C]//29th Annual International Symposium on Microarchitecture.Paris, France: ACM, IEEE, 1996: 142-152.
  • 3Grunwald D, Klauser A, Manne S, et al.Confidence esimation for speculation control[C]//25th International Symposium on Computer Architecture.Barcelona, Spain: ACM, IEEE, 1998 : 122-131.
  • 4Reungsang P, Park S K, Jeong S W, et al.Reducing cache pollution of prefetching in a small data cache[C]//Proceedings of the International Conference on Computer Design:VLSI in Computers & Processors.Washington, DC, USA: IEEE Computer Society,2001 : 530-533.
  • 5Sendag R,Lilja D J,Kunkel S R.Exploiting the prefetching effect provided by executing mispredicted load instructions[C]// Proceedings of the 8th International Euro-Par Conference on Parallel Processing.London, UK: Springer-Verlag, 2002 : 468-480.
  • 6Chen Y, Sendag R,Lilja D J.Using incorrect speculation to prefetch data in a concurrent multithreaded processor[C//Pro- ceedings of the International Parallel and Distributed Pro- cessing Symposium.Washington, DC, USA: IEEE Computer Society, 2003.
  • 7Armstrong D N, Kim H, Mutlu O, Patt Y N.Wrong path events: exploiting unusual and illegal program behavior for early mispredietion detection and recovery[C]//Proceedings of the 37th annual IEEE/ACM International Symposium on Micro- architecture.Washington, DC, USA: IEEE Computer Society, 2004: 119-128.
  • 8Sendag R, Yilmazer A, Yi J J, et al.The impact of wrong-path memory references in cache-coherent multiprocessor systems[J]. J Parallel Distrib Comput,2007(67) : 1256-1269.
  • 9李瑛.同时多线程结构指令流特性及取指技术研究[D].西安:西北工业大学,2004.
  • 10Henessy J L,Patterson D A.Computer architecture-a quanti- tative approach[M].4th ed.[S.1.]: Morgan Kaufmann Publishers Inc, 2007.

二级参考文献21

  • 1R Colwell, R Nix, J O' Donnell, et al. A VLIW Architecture for a Trace Scheduling Compiler[ A ]. Proc of the 2nd Int' 1 Conf on Architectural Support for Programming Languages and Operating Systems[C]. 1987.180 - 192.
  • 2W Hwu, S Mahlke, W Chen, et al. The Superblock: An Effective Technique for VLIW and Superscalar Compilation[ J]. The Journal of Supercomputing, 1993,7:229 - 248.
  • 3C Xia, J Torrellas. Instruction Prefetching of Systems Codes with Layout Optimized for Reduced Cache Misses[ A] .23rd Annual Int' l Symp on Computer Architecture[ C]. 1996.
  • 4N Jouppi. Improving Direct-Mapped Cache Performance by the Addition of a Small Fully Associative Cache and Prefetch Buffers[ A].Proc of the 17th Annual Int'l Symp on Computer Architecture[ C].1990.
  • 5J Pieroe, T Mudge. Wrong-Path Instruction Prefetching[ A ]. 29th Int'l Symp on Microarchitecture[ C]. 1996.165 - 175.
  • 6D Joseph, D Grunwald. Prefetching Using Markov Predictors[ A ].24th Annual Int'l Symp on Computer Architecture[C]. 1990.
  • 7G Reinman, B Calder,T Austin. Fetch directed instruction prefetching[A] .In Proceedings of the 32nd Annual International Symposium on Microarchitecture [ C]. Haifa: IEEF. Computer Society Press, 1999.16- 27.
  • 8D Krfft. Lockup-free instruction fetch/prefetch cache organization.[A] .In 8th Annual lntermational Symposium of Computer Arehiteeture[C]. Minneapolis: IEEE, Congsater Society Press, 1981.81 - 87.
  • 9K Faxkas, N Jouppi. Complexity/performance trdeoffs with non-blocking loads [A]. In 21st Annual International Symposium on Computer Architecture [ C ]. Chicago: IEEE Computer Society Press, 1994.211 -222.
  • 10J Pierce, T Mudge. Wrong-path instruction prefetching [ A ]. The 29th International Symposium on Microarchitecture [ C]. Paris:IEEE. Computer Society Press, 1996.165 - 175.

共引文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部