期刊文献+

三维众核片上处理器存储架构研究 被引量:3

The study of memory architectures for 3D chip multi-processors
下载PDF
导出
摘要 三维众核片上处理器的研究近年来逐渐引起了学术界的广泛关注.三维集成电路技术可以支持将不同工艺的存储器层集成到一颗芯片上,三维众核片上处理器可以集成更大的片上缓存以及主存储器.研究三维众核片上处理器存储架构,探索了集成SRAM L2cache层,DRAM主存储器层等,对三维众核片上处理器性能的影响.从仿真结果可知,相比集成1层L2cache,集成2层L2cache的三维众核片上处理器性能最大提高了55%,平均提高34%.将DRAM主存储器集成到片上最大可以提高三维众核片上处理器80%的系统性能,平均改善34.2%. The purpose of this paper is to study the performance improvement of memory architectures for three-di-mensional chip multi-processors(3D CMPs).As CMPs integrate more and more cores,a great deal of data access pressure is placed on the memory subsystem.Designers face the challenges of feeding enough data to a massive number of on-die cores for CMPs.Three-dimensional integrated circuits(3D ICs)can stack memories of different process technologies into the same chip.The stacking memory bandwidth can be enlarged by using fine-pitch through-silicon vias(TSVs),which can mitigate the pressure on the I/O infrastructure for CMPs.In this paper,we start with studying the potential benefit of 3D integration and the recent advantages on the research of memory ar-chitectures for 3D CMPs.Bothe large caches and main memories can be stacked in 3D CMPs.Hence,we focus on the memory architectures for 3D CMPs in two aspects,stacking cache architecture and stacking main memory architecture.3D CMPs can integrate much larger L2 caches compared to their 2D counterparts in the same area footprint.Meanwhile,the L2 caches can be several layers.We firstly explore the performance improvements of stacking SRAM L2 cache layers atop processor layers for 3D CMPs.The experimental results show that the 3D CMPs with 2 L2 cache layers can improve the performance up to 55% and 34% on average compared to that of 3D CMPs with 1 L2 cache layer.3D CMPs provide opportunities for composing future systems by integrating disparate technologies memories.The off-chip DRAM main memories can be stacked on the processor layers.We secondly study the performance benefit of integrating DRAM main memories into 3D CMPs.The experiment results show that stacking DRAM main memories can provide up to 80% and on average 34.2% performance improvement for 3D CMPs compared to the 2D CMPs with off-chip DRAM main memory.Our analysis and experimental results give a guideline to design efficient 3D CMPs with stacking SRAM L2 caches and DRAM main memories.
出处 《南京大学学报(自然科学版)》 CAS CSCD 北大核心 2014年第3期330-335,共6页 Journal of Nanjing University(Natural Science)
基金 国家自然科学基金面上项目(61176024) 高等学校博士学科点专项科研基金(20120091110029)
  • 相关文献

参考文献16

  • 1Wulf W, McKee S. Hitting the memory wall: Implications of the obvious. ACM SIGARCH Computer Architecture News, 1995,23 (1) : 20 - 24.
  • 2Topol A W, La Tulipe D C, Shi Jr L, et al. Three-di- mensional integrated circuits. IBM Journal of Research and Development, 2006,50 (4, 5 ) : 491- 506.
  • 3Davis W R,Wilson J, Mick S, et al. Demystifying 3D ICS: The pros and cons of going vertical. IEEE Design and Test of Computers, 2005, 22 (6) :498-510.
  • 4Ieong M,Guarini K W,Chan V,et al. Three di- mensional CMOS devices and integrated circuits. In: Proceedings of the IEEE 2003 Custom Integrated Circuits Conference,2003:207-213.
  • 5Feero B S, Pande P P. Networks-on-chip in a three-dimensional environment: A performance evaluation. IEEE Transactions on Computers, 2009,58(1) : 32-45.
  • 6Van der Plas G, Limaye P, Loi I, et al. Design issues and considerations for low-cost 3-D TSV IC technology. IEEE Journal of Solid-State Circuits, 2011,46 ( 1 ) : 293 - 307.
  • 7Millberg M, Nilsson E,Thid R,et al. The nostrum backbone--A communication protocol stack for networks on chip. In: Proceedings of the 17th International Conference on VLSI Design, 2004: 693-696.
  • 8Axel Jantsch, Hannu Tenhunen. Networks on chip. US: Kluwer Academic Publishers, 2003,303.
  • 9Xu T, Liljeberg P, Tenhunen H. Exploring DRAM last level cache for 3D network-on-chip Architecture. Advanced Materials Research, 2010, 403:4009-4018.
  • 10Meng J, Kawakami K, Coskun A. Optimizing energy efficiency of 3-D multicore systems with stacked DRAM under power and thermal constraints. In: Proceedings of the 49th ACM/EDAC/IEEE Design Automation Conference, 2012 : 648 - 655.

同被引文献21

  • 1申抒含,金炜东.多进制概率角复合位编码量子进化算法[J].模式识别与人工智能,2005,18(6):657-663. 被引量:9
  • 2宋晓峰,亢金龙,王宏.进化算法的发展与应用[J].现代电子技术,2006,29(20):66-68. 被引量:4
  • 3R Egawa, J Tada, H Kobayashi, et al. Evaluation of fine grain 3D integrated arithmetic units. In IEEE In- ternational 3D System Integration Conference. San Francisco, CA : IEEE, 2009 : 1-8.
  • 4Xie Y. Processor architecture design using 3D integra tion technology[C]//VLSI Design, 2010. VLSID'10 23rd International Conference on. India: IEEE, 2010 :446-451.
  • 5Puttaswamy K, Loh G H. Implementing caches in a 3D technology for high performanee processors[C]// Computer Design: VLSI in Computers and Processors, 2005. ICCD 2005. Proceedings. 2005 IEEE Interna- tional Conference on. South Korea: IEEE, 2005: 525-532.
  • 6Hsieh H C, Sun Y F, Yeh J C, et al. 3-D stacked memory system architecture exploration by esl virtual platform and reconfigurable stacking memory architec- ture in 3D-DSP SoC system[C]/// Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE Interna- tional Conference on. Italy:IEEE, 2014:5012-5016.
  • 7Azarkhish E, Loi I, Benini L. A high-performance multiported L2 memory IP for scalable three-dimen- sional integration[C]// 3D Systems Integration Con- ference (3DIC), 2013 IEEE International. San Fran- cisco,California: IEEE, 2013: 1-8.
  • 8Lefter M, Voicu G R, Cotofana S D. A shared poly- hedral cache for 3D wide-I/O multi-core computing platforms[C]// Circuits and Systems (ISCAS), 2015 IEEE International Symposium on. Portuqal: IEEE, 2015 :425-428.
  • 9Guthmuller E, Miro-Panades I, Greiner A. Architectural exploration of a fine-grained 3D cache for high performance in a manycore comext[C]// Very Large Scale Integration (VLSI-SoC), 2013 IFIP/IEEE 21st International Confer- ence on. Istanbul:IEEE, 20131302-307.
  • 10Xie Y. Future memory and interconnect technologies [C]// Proceedings of the Conference on Design, Auto- mation and Test in Europe EDA Consortium. San Jose: IEEE, 2013:964-969.

引证文献3

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部