期刊文献+

双核处理器性能最优的共享Cache划分 被引量:4

A Performance-oriented Runtime Mechanism for Shared Cache Partition of Dual-core Processor
下载PDF
导出
摘要 文中使用模拟器模拟的方法,测试IPC-CP对多道程序的吞吐率、加权加速比和公平性的影响.实验结果表明:以IPC最优为目标的Cache划分在三个评估指标中都优于以失效率最优为目标的Cache划分.其中,IPC-CP的吞吐率最高比失效率最优的Cache划分高出54%,平均高出27%. This paper presents the evaluation of IPC-CP on top of a dual-core processor simulator,with 12 multi-programmed workloads,shows that IPC-CP improves performance by up to 54% and on average 27% over UCP.
作者 所光 杨学军
出处 《微电子学与计算机》 CSCD 北大核心 2008年第9期28-30,33,共4页 Microelectronics & Computer
基金 国家自然科学基金项目(60621003 60603081) 国家"八六三"计划项目(2007AA12Z147 2007AA01Z102)
关键词 多核处理器 共享CACHE划分 IPC-CP chip multi-processor shared Cache partition IPC-CP
  • 相关文献

参考文献7

  • 1Sinharoy B, KaUa R N, Tendler J M, et al. Power 5 system microarchitecture[J]. IBM J. Res. Dev, 2005, 49 (4/5) : 505 - 521.
  • 2Kongetira P, Kongetira P, Aingaran K, et al. Niagara: a 32- way multithreaded spare processor[J]. Micro IEEE, 2005, 25(2) :21 - 29.
  • 3Stone H S, Turek J, Wolf J L. Optimal partitioning of cache memory[J]. IEEE Trans. Comput, 1992, 41(9) : 1054 - 1068.
  • 4Guang S, Xunjun Y, Guanghui L, et al. IPC-based e.ache partitioning: an IPCoriented dynamic shared cache partitioning mechanism[ C]//International Conference on Convergence and Hybrid Information Technology-ICHIT 2008. Korea, Busan, 2008.
  • 5Suh G E, Rudolph L, Devadas S. Dynamic partitioning of shared cache memory[J]. Supercomput, 2004, 28(1) : 7 - 26.
  • 6Dybdahl H, Stenstr P, Natvig L. A cache - partitioning aware replacement policy for chip multiprocessors[J ]. High Performance Computing- HiPC 2006, 2006(12) :22 - 34.
  • 7Qureshi M K, Patt Y N. Utility - based cache partitioning: a low-overhead, high-performance, runtime mechanism to partition shared caches[ C]//IEEE Computer Society. Austin: Texas University, 2006: 423 - 432.

同被引文献18

  • 1林明亮,祝永新.基于SimpleScalar的异构多核仿真器[J].微电子学与计算机,2007,24(7):204-208. 被引量:7
  • 2Fei Sun, Srivath.s Ravi, Anand Raghunathan. Applicationspecific heterogeneous multiprocessor synthesis using extensible processors [ J ]. IEEE Transactions on CAD of Integrated Circuits and Systems, 2006, 25(9) : 1589- 1602.
  • 3Chris Rowen. Engineering the complex SOC fast flexibel design with configurable processors[M]. USA: Prentice Hall, 2005.
  • 4David Butenhof. Programming with POSIX threads[M]. USA: Addison - Wesley, 2003.
  • 5Chen Guobing, Chen Tianzhou, Yan Like. On - chip communication framework design for embedded heterogeneous [D]. Zhejiang: Zhejiang University,2007.
  • 6Seng Lin Shee, Andrea Erdos, Sri Parameswaran. Heterogeneous multiprocessor implementations for JPEG: a case study[J]. CODES ISSS, 2006,11(3) :217-222.
  • 7Wong W, Baer J L. Modified LRU policies for improving second-level cache behavior [J]. HPCA- 6, 2000 : 49-60.
  • 8Method for implementing a pseudo least recent used (LRU) mechanism in a four-way cache memory within a data processing system[EB/OL](2001) [2010-04 -26]. http://www, patentstorm, us/patents/6240489/description, html.
  • 9Renau J. SESC[EB/OL]. [2010-04-26]. fttp:// sesc. sourceforge, net, 2004.
  • 10Chun Liu, Anand Sivasubramaniam, Mahmut Kandemir. Organizing the last line of defense before hit- ting the memory wall for CMPs [C]//Proceedings of the International Symposium on High-Performance Computer Architecture ( HPCA' 004). Washington, DC,USA.. IEEE Computer Society, 2004: 176-185.

引证文献4

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部