期刊文献+

Virtual and physical address translation mechanism of interconnect network

Virtual and physical address translation mechanism of interconnect network
下载PDF
导出
摘要 Most of users are accustomed to utilizing virtual address in their parallel programs running at the scalable high-performance parallel computing systems.Therefore a virtual and physical address translation mechanism is necessary and crucial to bridge the hardware interface and software application.In this paper,a new virtual and physical translation mechanism is proposed,which includes an address validity checker,an address translation cache(ATC),a complete refresh scheme and many reliability designs.The ATC employs a large capacity embedded dynamic random access memory(eDRAM)to meet the high hit ratio requirement.It also can switch the cache and buffer mode to avoid the high latency of accessing the main memory outside.Many tests have been conducted on the real chip,which implements the address translation mechanism.The results show that the ATC has a high hit ratio while running the well-known benchmarks,and additionally demonstrates that the new high-performance mechanism is well designed. Most of users are accustomed to utilizing virtual address in their parallel programs running at the scalable high-performance parallel computing systems.Therefore a virtual and physical address translation mechanism is necessary and crucial to bridge the hardware interface and software application.In this paper,a new virtual and physical translation mechanism is proposed,which includes an address validity checker,an address translation cache(ATC),a complete refresh scheme and many reliability designs.The ATC employs a large capacity embedded dynamic random access memory(eDRAM)to meet the high hit ratio requirement.It also can switch the cache and buffer mode to avoid the high latency of accessing the main memory outside.Many tests have been conducted on the real chip,which implements the address translation mechanism.The results show that the ATC has a high hit ratio while running the well-known benchmarks,and additionally demonstrates that the new high-performance mechanism is well designed.
作者 黎铁军 张建民 马柯帆 肖立权 李思昆 LI Tie-jun ZHANG Jian-min MA Ke-fan XIAO Li-quan LI Si-kun(School of Computer, National University of Defense Technology, Changsha 410073, China)
机构地区 School of Computer
出处 《Journal of Beijing Institute of Technology》 EI CAS 2016年第3期365-374,共10页 北京理工大学学报(英文版)
基金 Supported by the National Natural Science Foundation of China(61103083,61133007) National High Technology Research and Development Program of China(863Program)(2012AA01A301,2015AA01A301)
关键词 interconnect network virtual address physical address CACHE interconnect network virtual address physical address cache
  • 相关文献

参考文献1

二级参考文献26

  • 1Top500, http://www.top500.org, 2013.
  • 2Liao K X, Xiao Q L, Yang Q C, Lu T Y. MilkyWay-2 supercomputer system and application. Submitted to Frontiers of Computer Science, 2013.
  • 3Pritchard H, Gorodetsky I, Buntinas D. A ugni-based mpich2 nemesis network module for the cray xe. In: Proceedings of the 18th European MPI Users' Group Conference on Recent Advances in the Message Passing Interface. 2011, 110--119.
  • 4Xie M, Lu Y, Liu L, Cao H, Yang X. Implementation and evaluation of network interface and message passing services for Tianhe-la supercomputer. In: Proceedings of the 19th IEEE Annual Symposium on High Performance Interconnects. 2011,78-86.
  • 5Chun B N, Mainwaring A, Culler D E. Virtual network transport protocols for myrinet. IEEE Micro, 1998, 18(1): 53--63.
  • 6Araki S, Bilas A, Dubnicki C, Edler J, Konishi K, Philbin J. Userspace communication: a quantitative study. In: Proceedings of the 1998 ACMjlEEE Conference on Supercomputing (CDROM). 1998, 1-16.
  • 7Bhoedjang R A, Ruhl T, Bal H E. User-level network interface protocols. Computer, 1998, 31 (11): 53--60.
  • 8Schoinas I, Hill M D. Address translation mechanisms in network interfaces. In: Proceedings of the 4th International Symposium on HighPerformance Computer Architecture. 1998,219-230.
  • 9InfiniBand Architecture Specification: Release 1.0. InfiniBand Trade Association, 2000.
  • 10Graham R L, Poole S, Shamis P, Bloch G, Bloch N, Chapman H, Kagan M, Shahar A, Rabinovitz I, Shainer G. Overlapping computation and communication: Barrier algorithms and connectx-2 core-direct capabilities. In: Proceedings of the 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum. 2010, 1-8.

共引文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部