基于邻行链接访问的低功耗指令高速缓存被引量：1

Low power instruction cache based on adjacent line linking access

下载PDF

导出

摘要通过分析高速缓存访问的局部性原理,提出当前高速缓存访问行与若干紧邻行链接访问的低功耗指令缓存访问方法.该方法能够在发生相对跳转时依托于相邻行之间的访问链接信息,精确获得跳转目标行的路访问信息,减少对高速缓存标志存储器的访问,达到降低动态功耗的目的.在高速缓存行发生替换时,仅需检测并清除被替换行相邻范围内的若干缓存行的链接信息,从而实现链接关系的正确性.与基于路记忆访问的高速缓存器相比,应用该方法的高速缓存器的动态功耗可以平均减少6%. The behavior of cache accessing was analyzed. A new low power instruction cache accessing method that links the current cache line and its adjacent cache lines was proposed. When a direct jump be- tween cache lines occurs, the adjacent cache line links are reused to get the accurate way information of the target line. Then the accesses of tag array are reduced and tag lookups are avoided to reduce the dynamic power consumption. When a cache line is evicted, only its adjacent cache line links should be checked and invalidated to keep the correctness of the links. Experiment results show that dynamic power consumption can be reduced by 6 % on average with the new method compared to the traditional way memorization in- struction cache.

作者项晓燕陈志坚孟建熠严晓浪

机构地区浙江大学超大规模集成电路设计研究所

出处《浙江大学学报（工学版）》 EI CAS CSCD 北大核心 2013年第7期1213-1217,共5页 Journal of Zhejiang University：Engineering Science

基金中央高校基本科研业务费资助项目(2012QNA5004)

关键词指令高速缓存低功耗邻行链接访问 instruction cache low power adjacent cache line linking access

分类号 TN332 [电子电信—物理电子学] TN47 [电子电信—微电子学与固体电子学]

引文网络
相关文献

参考文献12

1GONZALEZ R, HOROWITZ M. Energy dissipation in general purpose microprocessors [J]. IEEE Journal of Solid-State Circuits, 1996, 31(9) :1277 - 1284.
2孟建熠,严晓浪,葛海通,徐鸿明.基于指令回收的低功耗循环分支折合技术[J].浙江大学学报（工学版）,2010,44(4):632-638. 被引量：4
3TSAI Y Y, CHEN C H. Energy-efficient trace reuse cache for embedded processors [J]. IEEE Transactions on Very Large Scale Integration Systems, 2011,19 (9) : 1681 - 1694.
4HASEGAWA A, KAWASAKI I, YAMADA K, et al. SH3: high code density, low power[J]. IEEE Micro, 1995, 15(6): 11-19.
5INOUE K, ISHIHARA T, MURAKAMI K. Way-pre- dicting set-associative cache for high performance and low energy consumption [C]// Proceedings of ISLPED. California: [s. n. ], 1999 : 273 - 275.
6XU C P, ZHANG G, HAO S Q. Fast way-prediction instruction cache for energy efficiency and high perform- ance [C]// Proceedings of NAS. Zhang Jia Jie: [s. n. ], 2009:235 - 238.
7MA A, ZHANG M, ASANOVIC K. Way memoriza- tion to reduce fetch energy in instruction caches [C] // ISCA Workshop on Complexity Effective Design. Swe- den: IEEE, 2001.
8XIE Z C, TONG D, CHENG X. WHOLE: a low ener- gy I-cache with separate way history [C]// Proceedings of IEEE International Conference on Computer Design. California: IEEE, 2009 : 137 - 143.
9龚帅帅,吴晓波,孟建熠,丁永林.基于历史链接关系的指令高速缓存低功耗方法[J].浙江大学学报（工学版）,2011,45(3):467-471. 被引量：3
10PANWAR R, RENNELS D. Reducing the frequency of tag compares for low power I:cache design [C]// Prooeedings of ISLPED. California: Es. rL 1, 1995:57 - 62.

二级参考文献23

1ZMILY A, KOZYRAKIS C. Simultaneously improving code size, performance, and energy in embedded processors [ C ] // Proceedings of the Conference on Design Automation and Test in Europe. Munich: European Design and Automation Association, 2006: 224- 229.
2EMMA P G, DAVIDSON E S. Characterization of branch and data dependencies in programs for evaluating pipeline performance [J]. IEEE Transactions on Computers, 1987, 36(7): 859- 875.
3HEYDEMANN K, BODIN F, KNIJNENBURG P M W, et al. UFS: a global trade-off strategy for loop unrolling for VLIW architectures [C]// 10th International Workshop on Compilers for Parallel Computers. Chichester: John Wiley & Sons, 2006: 1413 - 1434.
4亨尼西,帕特森.计算机体系结构:量化研究方法[M].3版.北京:机械工业出版社,2002:196-206.
5BELLAS N, HAJJ I, POLYCHRONOPOULOS C, et al. Energy and performance improvements in microprocessor design using a loop cache [C]// IEEE International Conference on Computer Design. Austin: IEEE, 1999 :378 - 383.
6DITZEL D R, MCLELLAN H R. Branch folding in the CRISP microprocessor reducing branch delay to zero [C] // Proceedings of the 14th Annual International Symposium on Computer Architecture. Pittsburgh: ACM, 1987:2 - 8.
7LEA H L, SCOTT J, MOYER B, et al. Low-cost branch folding for embedded applications with small tight loops [C]// 32nd Annual International Symposium on Microarchitecture. Haifa: IEEE, 1999: 103- 111.
8MALIK A, MOYER B, CERMAK D. A low power unified cache architecture providing power and performance flexibility [C]// International Symposium on Low Power Electronics and Design. Rapallo : ACM, 2000 : 241 - 243.
9PARK S H, YU S, CHO J W. Speculative branch folding for pipelined processors [J]. IEICE-Transactions on Information and Systems, 2005, 88(5): 1064 - 1066.
10C-SKY MicroSystems. 32-bit high performance and low power embedded processor [EB/OL]. [2003-08]. http://www. c-sky. com.

共引文献4

1李伟,肖建青.基于流水化和滑动窗口结构的低功耗指令Cache设计[J].计算机工程与科学,2015,37(6):1037-1042.
2肖建青,李伟,张洵颖,沈绪榜.一种面向超标量处理器的低功耗指令Cache设计[J].微电子学与计算机,2015,32(7):103-106.
3李泉泉,龚晓华,郭二辉.基于标志编码的指令Cache低功耗方法[J].微电子学与计算机,2016,33(12):30-33. 被引量：1
4李泉泉,张铁军,王东辉,侯朝焕.基于分支执行历史的循环缓冲低功耗方法[J].微电子学与计算机,2014,31(9):7-10.

同被引文献12

1张宇弘,王界兵,严晓浪,汪乐宇.标志预访问和组选择历史相结合的低功耗指令cache[J].电子学报,2004,32(8):1286-1289. 被引量：6
2Zang W,Gordon-Ross A. A survey on cache tuning from a power/energy perspective [J]. ACM Computing Surveys (CSUR) ,2013,45(3) :32.
3ZHENG Z, Zhiying W, Li S. Region-based way-partitioning on L1 data cache for low power[ J] IEICE Transactions on Information and Systems, 2013,96 ( 11 ) : 2466 - 2469.
4Xiangyun Z, Lianfeng Z, Dong B. Research on the low power design method for the embedded multi-core proces- sor[ A]. 2013 Fourth International Conference on Digital Manufacturing and Automation (ICDMA) [C ]IEEE, 2013. 1141 - 1144.
5Inoue K, Ishihara T, Murakami K. Way-predicting set-asso- ciative cache for high performance and low energy con- sumption[ A]. Proceedings of the 1999 International Sym- posium on Low Power Electronics and Design [ C ] ACM, 1999. 273 - 275.
6Chen H C. Design of a low-power way-predicting cache u- sing valid-bit pre-decision strategy [ J ]. Journal of the Chi- nese Institute of Engineers, 2008,31 (5) : 805 - 814.
7Ye J, Ding H, Hn Y, et al. A behavior-based adaptive ac- cess-mode for low-power set-associative caches in embed- ded systems [ J ]. Journal of Information Processing, 2012, 20( 1 ) :26 - 36.
8Kin J, Gupta M, Mangione-Smith W H. The filter cache:an energy efficient memory structure [ A] . Proceedings of the 30th Annual ACM/IEEE International Symposium on Micr- oarchitecture[ C ] IEEE Computer Society ,1997. 184 - 193.
9Yang C L,Lee C H. HotSpot cache:joint temporal and spatial locality exploitation for i-cache energy reduction [ A]. Pro- ceedings of the 2004 International Symposium on Low PowerElectronics and Design[ C]. IEEE,21304. 114 - 119.
10Fan L, Wang S, Zheng Y, et al. Low power cache architec- tures with hybrid approach of filtering unnecessary way ac- cesses[ A]. Proceedings of the 2013 International Work- shop on Programming Models and Applications for Mul- ticores and Manycores [ C ]. ACM, 2013.93 - 99.

引证文献1

1仇径,罗嘉蕙,项晓燕,陈志坚.基于热点行搜索的低功耗数据高速缓存[J].电子学报,2016,44(1):110-114. 被引量：1

二级引证文献1

1康哲,宋敏.无线传感网络信息高速缓冲存储仿真[J].计算机仿真,2019,36(2):259-262.

1游之墨.RSA公钥体制实现的现状与展望[J].信息安全与通信保密,1989(2):1-6.
2Contributed by Nokia.实现Internet服务和无线终端间的无缝连接[J].世界电信,1999,12(1):32-34.
3徐超.三星与Intrinsotu公司联手打造ARM CORTEX-A8处理器[J].通信世界,2009(29). 被引量：1
4龚帅帅,吴晓波,孟建熠,丁永林.基于历史链接关系的指令高速缓存低功耗方法[J].浙江大学学报（工学版）,2011,45(3):467-471. 被引量：3
5张宇弘,王界兵,严晓浪,汪乐宇.标志预访问和组选择历史相结合的低功耗指令cache[J].电子学报,2004,32(8):1286-1289. 被引量：6
6王亚莉,童卫勇,吕卫祥,姜小祥.基于PowerPC的主动雷达信号处理软件的设计与实现[J].雷达与对抗,2015,35(4):64-67.
7Jose Fridman.在数字基带处理器上代码的最佳放置[J].今日电子,2006(11):75-77.
8袁丽娜.浅析3G系统中的高数据速率[J].无线互联科技,2013,10(4):119-119.
9三星推出“最快的”移动双核处理器-ARMCortex-A8[J].中国传媒科技,2009(8):78-78.
10取代有线技术数据传输的未来摩托罗拉部署下一代固定无线网络[J].中国教育网络,2007(9):45-45.

浙江大学学报（工学版）

2013年第7期

浏览历史

内容加载中请稍等...

基于邻行链接访问的低功耗指令高速缓存被引量：1

参考文献12

二级参考文献23

共引文献4

同被引文献12

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于邻行链接访问的低功耗指令高速缓存 被引量：1

参考文献12

二级参考文献23

共引文献4

同被引文献12

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于邻行链接访问的低功耗指令高速缓存被引量：1