期刊文献+

The Value of a Small Microkernel for Dreamy Memory and the RAMpage Memory Hierarchy

The Value of a Small Microkernel for Dreamy Memory and the RAMpage Memory Hierarchy
原文传递
导出
摘要 This paper explores potential for the RAMpage memory hierarchy to use a microkernel with a small memory footprint, in a specialized cache-speed static RAM (tightly-coupled memory, TCM). Dreamy memory is DRAM kept in low-power mode, unless referenced. Simulations show that a small microkernel suits RAMpage well, in that it achieves significantly better speed and energy gains than a standard hierarchy from adding TCM. RAMpage, in its best 128KB L2 case, gained 11% speed using TCM, and reduced energy 14%. Equivalent conventional hierarchy gains were under 1%. While 1MB L2 was significantly faster against lower-energy cases for the smaller L2, the larger SRAM's energy does not justify the speed gain. Using a 128KB L2 cache in a conventional architecture resulted in a best-case overall run time of 2.58s, compared with the best dreamy mode run time (RAMpage without context switches on misses) of 3.34s, a speed penalty of 29%. Energy in the fastest 128KB L2 case was 2.18J vs. 1.50J, a reduction of 31%. The same RAMpage configuration without dreamy mode took 2.83s as simulated, and used 2.393, an acceptable trade-off (penalty under 10%) for being able to switch easily to a lower-energy mode. This paper explores potential for the RAMpage memory hierarchy to use a microkernel with a small memory footprint, in a specialized cache-speed static RAM (tightly-coupled memory, TCM). Dreamy memory is DRAM kept in low-power mode, unless referenced. Simulations show that a small microkernel suits RAMpage well, in that it achieves significantly better speed and energy gains than a standard hierarchy from adding TCM. RAMpage, in its best 128KB L2 case, gained 11% speed using TCM, and reduced energy 14%. Equivalent conventional hierarchy gains were under 1%. While 1MB L2 was significantly faster against lower-energy cases for the smaller L2, the larger SRAM's energy does not justify the speed gain. Using a 128KB L2 cache in a conventional architecture resulted in a best-case overall run time of 2.58s, compared with the best dreamy mode run time (RAMpage without context switches on misses) of 3.34s, a speed penalty of 29%. Energy in the fastest 128KB L2 case was 2.18J vs. 1.50J, a reduction of 31%. The same RAMpage configuration without dreamy mode took 2.83s as simulated, and used 2.393, an acceptable trade-off (penalty under 10%) for being able to switch easily to a lower-energy mode.
机构地区 School of ITEE
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2005年第5期586-595,共10页 计算机科学技术学报(英文版)
基金 昆士兰州大学校科研和教改项目
关键词 low-power design main memory virtual memory cache memories microkernels low-power design, main memory, virtual memory, cache memories, microkernels
  • 相关文献

参考文献29

  • 1Machanick P, Salverda P, Pompe L. Hardware-software tradeoffs in a Direct Rambus implementation of the RAMpage memory hierarchy. In Proc. 8th Int. Conf. Architectural Support for Programming Languages and Operating Systems(ASPLOS-VIII), San Jose, CA, October 1998, pp.105-114.
  • 2Machanick P. Scalability of the RAMpage memory hierarchy.South African Computer Journal, August 2000, (25): 68-73.
  • 3Machanick P. Initialexperiences with dreamy memory and the RAMpage memory hierarchy. In Proc. Ninth Asia-Pacific Computer Systems Architecture Conf., Beijing, September 2004, pp.146-159.
  • 4ARM. The ARM11 Microprocessor and ARM PrimeXsys Platform. ARM, October 2002. http://www.arm.com/pdfs/ARM11%20Core%20& %20Plat form%20Whitepaper.pdf.
  • 5Micron Technology. 256MB: ×4, ×8, ×16 DDR SDRAM,December 2003, Data Sheet. http://download.micron.com/pd f/datasheets/dr am/ddr/256M ×4 ×8 ×16DDR.pdf.
  • 6Machanick P. The case for SRAM main memory. Computer Architecture News, December 1996, 24(5): 23-30.
  • 7Wulf W A, McKee S A. Hitting tile memory wall: Implications of the obvious. Computer Architecture News, March 1995, 23(1): 20-24.
  • 8Johnson E E. Graffiti on the memory wall. Computer Architecture News, September 1995, 23(4): 7-8.
  • 9Jochen Liedtke. Toward real microl~ernels. Commun. ACM,1996, 39(9): 70-77.
  • 10Uwe Dannowski, Kevin Elphinstone, Jochen Liedtke et al.The L4Ka vision. Technical report, University of Karlsruhe,System Architecture Group, April 2001. http://i30www.ira. uka.de/research/documents/14ka/L4Ka.pdf.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部