期刊文献+

分片式流处理器上存储系统的设计与实现 被引量:1

Design and Implementation of Memory System for Tiled Stream Processor
下载PDF
导出
摘要 针对"存储墙"问题,从提高片外带宽使用率的角度出发,为分片式流处理器设计实现数据并行存储系统。该存储系统通过多级调度能有效减少片外访存的次数,降低片外带宽的需求。软件模拟和仿真验证的结果表明,在不同工作负载特征下,通过设计参数的优化选择,该设计能够充分挖掘存储访问的行局部性和体间并行性,从而提高带宽的使用效率。 Aiming at the consideration of improving the utilizing efficiency of off-chip bandwidth and resolving "Memory Wall" problem, this paper designs a Data-Parallel Memory System(DPMS) for the tiled stream processor in current project. This memory system can reduce the time costs of off-chip memory access and meet the needs of off-chip bandwidth. Results of software simulation and emulation verification indicate that for different workloads, the design can fully capture the row-locality and bank-parallelism of memory access, by optimizing the configuration parameters and further boost the utilizing efficiency of DRAM bandwidth.
出处 《计算机工程》 CAS CSCD 北大核心 2010年第11期217-220,共4页 Computer Engineering
基金 国家自然科学基金资助重点项目(60633040 60736012) 国家"973"计划基金资助项目(2005CB321601) 国家"863"计划基金资助项目(2006AA01A102 2009AA01Z106) 教育部-英特尔信息技术专项科研基金资助项目(MOE-INTEL-08-07)
关键词 分片式流处理器 数据并行存储系统 片外带宽 tiled stream processor Data-Parallel Memory System(DPMS) off-chip bandwidth
  • 相关文献

参考文献4

  • 1Rixner S, Dally W J, Kapasi U J, et al. A Bandwidth Efficient Architecture for Media Processing[C]//Proc. of the 31st Annual IEEE/ACM International Symposium on Micro-architecture. Dallas, USA: [s. n.], 1998: 3-13.
  • 2Ahn J H, Erez M, Dally W J. The Design Space of Data-parallel Memory Systems[C]//Proc. of ACM/IEEE Conference on Supercomputing. Tampa, Florida, USA: [s. n.], 2006.
  • 3Micron Technology Inc. Synchronous DRAM[EB/OL]. (2003-02- 06). http://download.micron.com/pdf/datasheets/dram/sdram/64 MSDRAMx32.pdf.
  • 4Altera Corporation. SDRAM Controller White Paper[EB/OL]. (2002-04-06). http://www.altera.eom.en/technology/memory/dram/ sdr/mem-sdr sdram.html.

同被引文献4

  • 1Zhou Ping, Zhao Bo, Yang Jun, et al. A Durable and Energy Efficient Main Memory Using Phase Change Memory Technology[C] //Proc. of the 36th International Symposium on Computer Architecture. Austin, Texas, USA: [s. n.] , 2009.
  • 2Dong Xiangyu, Jouppi N P, Xie Yuan, et al. PCRAMsim: System- level Performance, Energy, and Area Modeling for Phase-change RAM[C] //Proc. of the International Conference on Computer- Aided Design. San Jose, USA: [s. n.] , 2009.
  • 3Process Integration, Devices, and Structures[Z]. ITRS, 2007.
  • 4Rixner S, Dally W J, Kapasi U J, et al. Memory Access Sche- duling[C] //Proc. of the 27th Annual International Symposium on Computer Architecture. Vancouver, Canada: [s. n.] , 2000.

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部