无污染Cache访问控制技术

Non-polluting cache accessing technique

下载PDF

导出

摘要制造工艺的快速进步给集成电路设计提供了广阔的空间,而发展较慢的设计能力导致难以对片上资源高效利用。目前,高性能处理器片上Cache普遍占到芯片总面积的一半以上,而如何高效、智能地利用片上Cache空间,构建高性能存储系统是处理器微体系结构研究的重要内容。分析了Cache数据污染和猜测执行对处理器性能的影响,并在此基础上提出一种基于数据Tag有效位分裂的无污染Cache访问控制技术—Pease,将原先D-CacheTag中的一位数据有效位扩展为读数据有效位（RVB）和写数据有效位（WVB）两位,根据RVB和WVB值的不同组合对数据读写访问进行控制。不但充分保留了猜测执行的数据预取性,使污染数据透明化,写入数据时无需对污染数据进行替换操作,消除了污染数据对Cache效率的影响。Pease技术相对于baseline结构来说,IPC的提升幅度为1.05%~8.40%,平均提升4.04%;L1 D-Cache缺失率降低幅度为19.05%~48.16%,平均降低29.66%。 Rapid progress of semiconductor fabrication provid-es capacious space for IC designs, but unfortunately, the slow development of design ability makes it difficult to utilize the on-chip resource efficiently. At present, more than half of die area of modern microprocessor is inhabited by cache. So, how to make use of cache space smartly and efficiently, and construct high performance memory system has become one of the most important content in processor architecture design. This paper analyses the impacts of cache data pollution and speculative execution to processor performance, and proposes a non-polluting cache accessing technique based on data tag valid-bit splitting, which is called Pease. The valid-bit in D-Cache tag is splited into two bits, read data valid bi（tRVB）and write data valid bi（tWVB）. According the different RVB and WVB combinations, corresponding accessing strategies to D-Cache are applied. As a result, Pease technique not only preserves the prefetch ability of speculative execution, but also makes the cache polluting data transparent, which means that, in no empty cache line situation, consequent data can be written into D-Cache directly, but without need to perform cache replacement operation. In other word, Pease technique makes polluting data totally harmless to D-cache. Simulation result indicates that, relative to the baseline architecture, Pease technique improves IPC from 1.05% to 8.40% , averagely 4.04% , and reduces miss rate of D-Cache from 19.05% to 48.16% averagely 29.66%.

作者刘松鹤宋焕生亓淑敏李文敏

机构地区长安大学信息工程学院

出处《计算机工程与应用》 CSCD 2013年第10期5-9,共5页 Computer Engineering and Applications

基金中央高校科研基金项目(No.CHD2011JC055) 中央高校科研基金项目(No.CHD2011JC089)

关键词有效位分裂 CACHE 污染 valid bit splitting Cache pollution

分类号 TP303 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献14

1International technology roadmap for semiconductors[EB/OL]. [2012-09-07].http ://www.itrs.net.
2Jacobsen E,Rotenberg E, Smith J E.Assigning confidence to conditional branch predictions[C]//29th Annual International Symposium on Microarchitecture.Paris, France: ACM, IEEE, 1996: 142-152.
3Grunwald D, Klauser A, Manne S, et al.Confidence esimation for speculation control[C]//25th International Symposium on Computer Architecture.Barcelona, Spain: ACM, IEEE, 1998 : 122-131.
4Reungsang P, Park S K, Jeong S W, et al.Reducing cache pollution of prefetching in a small data cache[C]//Proceedings of the International Conference on Computer Design:VLSI in Computers & Processors.Washington, DC, USA: IEEE Computer Society,2001 : 530-533.
5Sendag R,Lilja D J,Kunkel S R.Exploiting the prefetching effect provided by executing mispredicted load instructions[C]// Proceedings of the 8th International Euro-Par Conference on Parallel Processing.London, UK: Springer-Verlag, 2002 : 468-480.
6Chen Y, Sendag R,Lilja D J.Using incorrect speculation to prefetch data in a concurrent multithreaded processor[C//Pro- ceedings of the International Parallel and Distributed Pro- cessing Symposium.Washington, DC, USA: IEEE Computer Society, 2003.
7Armstrong D N, Kim H, Mutlu O, Patt Y N.Wrong path events: exploiting unusual and illegal program behavior for early mispredietion detection and recovery[C]//Proceedings of the 37th annual IEEE/ACM International Symposium on Micro- architecture.Washington, DC, USA: IEEE Computer Society, 2004: 119-128.
8Sendag R, Yilmazer A, Yi J J, et al.The impact of wrong-path memory references in cache-coherent multiprocessor systems[J]. J Parallel Distrib Comput,2007(67) : 1256-1269.
9李瑛.同时多线程结构指令流特性及取指技术研究[D].西安:西北工业大学,2004.
10Henessy J L,Patterson D A.Computer architecture-a quanti- tative approach[M].4th ed.[S.1.]: Morgan Kaufmann Publishers Inc, 2007.

二级参考文献21

1R Colwell, R Nix, J O' Donnell, et al. A VLIW Architecture for a Trace Scheduling Compiler[ A ]. Proc of the 2nd Int' 1 Conf on Architectural Support for Programming Languages and Operating Systems[C]. 1987.180 - 192.
2W Hwu, S Mahlke, W Chen, et al. The Superblock: An Effective Technique for VLIW and Superscalar Compilation[ J]. The Journal of Supercomputing, 1993,7:229 - 248.
3C Xia, J Torrellas. Instruction Prefetching of Systems Codes with Layout Optimized for Reduced Cache Misses[ A] .23rd Annual Int' l Symp on Computer Architecture[ C]. 1996.
4N Jouppi. Improving Direct-Mapped Cache Performance by the Addition of a Small Fully Associative Cache and Prefetch Buffers[ A].Proc of the 17th Annual Int'l Symp on Computer Architecture[ C].1990.
5J Pieroe, T Mudge. Wrong-Path Instruction Prefetching[ A ]. 29th Int'l Symp on Microarchitecture[ C]. 1996.165 - 175.
6D Joseph, D Grunwald. Prefetching Using Markov Predictors[ A ].24th Annual Int'l Symp on Computer Architecture[C]. 1990.
7G Reinman, B Calder,T Austin. Fetch directed instruction prefetching[A] .In Proceedings of the 32nd Annual International Symposium on Microarchitecture [ C]. Haifa: IEEF. Computer Society Press, 1999.16- 27.
8D Krfft. Lockup-free instruction fetch/prefetch cache organization.[A] .In 8th Annual lntermational Symposium of Computer Arehiteeture[C]. Minneapolis: IEEE, Congsater Society Press, 1981.81 - 87.
9K Faxkas, N Jouppi. Complexity/performance trdeoffs with non-blocking loads [A]. In 21st Annual International Symposium on Computer Architecture [ C ]. Chicago: IEEE Computer Society Press, 1994.211 -222.
10J Pierce, T Mudge. Wrong-path instruction prefetching [ A ]. The 29th International Symposium on Microarchitecture [ C]. Paris:IEEE. Computer Society Press, 1996.165 - 175.

共引文献17

1黄彩霞,杨凤年.“计算机系统结构”实验课程改革和实践[J].计算机教育,2008(14):156-159. 被引量：1
2时磊,逄珺,杨磊,张铁军,王东辉.M5模拟器的内核分析及应用[J].微计算机应用,2009,30(4):30-34. 被引量：3
3扈啸,陈书明.面向指令Cache周期预取的代码排布方法[J].计算机研究与发展,2009,46(5):747-755.
4郭建军,戴葵,王志英.同步数据触发体系结构中指令预取技术研究[J].计算机工程与科学,2009,31(8):77-81.
5叶新栋,唐志强,涂时亮.Perceptron-Based分支预测SimpleScalar中的实现[J].计算机系统应用,2010,19(1):51-54.
6喻明艳,张祥建,杨兵.基于跳跃访问控制的低功耗分支目标缓冲器设计[J].计算机辅助设计与图形学学报,2010,22(4):695-702. 被引量：4
7杨惠,孙永节.高性能低功耗FT-XDSP的指令缓存队列[J].小型微型计算机系统,2010,31(7):1350-1354.
8吴俊杰,杨学军.面向非一致Cache的任意步长预提升技术[J].计算机科学与探索,2010,4(7):577-588. 被引量：4
9肖泽强.动态分支预测技术分析与量化研究[J].信息技术,2011,35(3):80-82. 被引量：1
10李慧敏,匡旺秋,马英英,谭琰.基于SimpleScalar的多媒体应用程序特征分析[J].计算机系统应用,2011,20(10):239-242.

1李琳.无线传感器网络路由算法仿真模型的研究[J].电脑开发与应用,2014,27(4):27-29. 被引量：1
2杨波,姜建国,胡波,刘坤颖.试析云取证面临的挑战[J].保密科学技术,2014(7):46-49.
3孙宇清.JAVA多线程编程中线程的实现与控制[J].内蒙古电大学刊,2005(12):40-43.
4文武.数据仓库如何消除数据污染?[J].中国计算机用户,1996(3):16-17. 被引量：2
5范才元.刍议视频监控系统的发展[J].网络安全技术与应用,2014(8):231-231.
6高帧率=低延迟？[J].A&S（安防工程商）,2014(4):12-12.
7激淡.主板中的怪兽——技嘉Z77X-UP7主板评测[J].微电脑世界,2012(12):36-36.
8LIKE.Word编辑技巧[J].电子制作．电脑维护与应用,2004(7):54-54.
9曹永冬.驯服“替换”通配符[J].电脑时空,2010(5):105-105.
10白学清,曹军梅,杨东风.正确理解VisualFoxPro数据的比较与替换[J].农业网络信息,2008(8):136-137.

计算机工程与应用

2013年第10期

浏览历史

内容加载中请稍等...

无污染Cache访问控制技术

参考文献14

二级参考文献21

共引文献17

相关作者

相关机构

相关主题

浏览历史