期刊文献+

用于减少远程Cache访问延迟的最后一次写访问预测方法 被引量:1

A last-write-touch prediction scheme used to reduce remote Cache miss latency
下载PDF
导出
摘要 为减少远程Cache访问延迟,提高共享存储系统的性能,提出了一种新的基于程序内在写突发特性的最后一次写访问预测方法,并对一个具体的目录协议进行了改造,以支持该预测方法。通过预测Cache块的最后一次写访问并提前对其进行降级,处理器能直接从主存中读取数据,从而减少了远程Cache访问所需的一个网络跳步数。与当前基于指令的预测方法相比,该方法能极大减少存储开销。基准测试程序的评测结果表明,该方法能获得83.1%的预测准确率,并且能提高8.57%的程序执行性能,同时与基于指令的预测方法相比,该方法能分别减少历史踪迹表69%的存储开销和签名表36%的存储开销。 To reduce remote cache transfer latency and improve the performance of shared memory systems,a new last-write-touch prediction scheme that exploits the inherent write characteristics of a program is proposed and a directory protocol to support the scheme is adapted. By predicting a last-write-touch and self downgrading a cache block in advance,a processor can get the data from the memory directly and one network hop can be saved for a remote cache access. Compared with the existing instruction-based prediction technique,much storage overhead can be reduced. Experimental results show that it can achieve an average prediction accuracy of 83. 1%,leading to improvements up to average 8. 57% on the final application performance. Moreover,compared with the instruction-based prediction scheme,the scheme can reduce the storage overheads of the history table by 69% and the storage overheads of the signature table by 36%.
出处 《国防科技大学学报》 EI CAS CSCD 北大核心 2015年第1期14-20,共7页 Journal of National University of Defense Technology
基金 国家自然科学基金资助项目(61202119) 国家863计划资助项目(2013AA014301)
关键词 CACHE一致性协议 远程Cache失效 写突发 最后一次写访问 自降级 Cache coherence protocol remote Cache miss write burst last-write-touch self downgrade
  • 相关文献

参考文献14

  • 1Aeacio M E, Gonzalez J, Garcia J M, et al. A novel approach to reduce L2 miss latency in shared multiproeessors [ C ]// Proceedings of the 16th International Parallel and Distributed Processing Symposium, Fort Lauderdale, USA, 2002:62-70.
  • 2Gharachorloo K, Sharma M, Steely S, et al. Architecture and design of Alphaserver GS320 [ C ]//Proceedings of the International Conference on Architectural Support for Programming Language and Operating Systems (ASPLOS IX) , Cambridge, USA, 2000:13-24.
  • 3lyer R, Bhuyan L N, Nanda A. Using switch directories tospeed up cache-to-cache transfers in CC-NUMA multiprocessors[ C]//Proceedings of the 14th International Parallel and Distributed Processing Symposium ( IPDPS' 00 ) , Cancun, Mexico, 2000 : 721 - 728.
  • 4Zhang Z. Architectural sensitive application characterization: the approach of high-performance index-set ( HP-Set ) [ R ]. Technical Report HPL - 2001 - 75, HP Laboratories Palo Alto, USA, 2001.
  • 5Abdel-Shafi H, Hall J, Adve S V, et al. An evaluation of the fine-grain producer-initiated communication in cache-coherent multiprocessors[ C ]//Proceedings of the 3rd IEEE Symposium On High-Performance Computer Architecture, San Antonio, USA, 1997:204 - 215.
  • 6Trancoso P, Torrellas J. The impact of speeding up critical sections with data prefetching and forwarding[ C ]//Proceedings of the International Conference on Parallel Processing, Minneapolis, USA, 1996:79 - 86.
  • 7Martin M M K, Harper P J, Sorin D I, et aL Using destination-set prediction to improve the latency/bandwidth tradeoff in sliared-memory multiprocessors [ C ]//'Proceedings of the International Symposium on Computer Architecture, San Diego, USA, 2003:206 - 217.
  • 8Demetriades S, Cho S. Predicting coherence communication by tracking synchronization points at run time[ C ]//Proceedings of the International Symposium on Microarchitecture, Vancouver,Canada, 2012:351 - 362.
  • 9Lebeck A R, Wood D A. Dynamic self-invalidation: reducing coherence overhead in shared-memory multiprocessors [ C ]// Proceedings of the International Symposium on Computer Architecture, Santa Margherita Ligure, Italy, 1995:48 - 59.
  • 10Lai A, Falsafi B. Selective, accurate, and timely self- invalidation using last-touch prediction [ C ]//Proceedings of the International Symposium on Computer Architecture, Vancouver, Canada, 2000 : 139 - 148.

同被引文献11

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部