期刊文献+

基于动态连接的RDMA可靠传输协议设计 被引量:8

Design of the RDMA Reliable Communication Protocol Based on Dynamic Connection
下载PDF
导出
摘要 未来100P/E级高性能计算机系统对网络的传输可靠性、性能均衡性、可扩展性方面有更高的需求。本文提出的RDMA传输模型,采取配置少量资源,动态连接使用的策略实现端到端的数据可靠传输。与传统的可靠通信协议如Infiniband相比,本方案的优势为:(1)支持自动重路由,可绕过网络故障区域保证消息的可靠传输;(2)支持报文乱序到达,支持源和目的间的多路径传输,提供消息的流控机制,能较好地均衡网络整体性能,减少网络热点和缓解网络拥塞;(3)基于通信接口硬件实现可靠性数据结构,不需要消耗主存为通信建立连接,具有极高的系统可扩展性。初步测试结果表明,采取了优化措施后,该协议不会增加小于4K字节消息的传输延迟。 Upcoming 100 Petascale/Exascale Supercomputers will demand highly reliable, well balanced and highly scalable interconnection networks. Our RDMA transport model implements an end-to- end reliable communication protocol by a small quantity of resources configuration and the dynamic connection strategy. Unlike the conventional implementations such as Infiniband, the proposed scheme has superior attributes in terms of a) being able to recover network failures by changing route automatically; b)being able to handle the packets coming out of order and use multiple paths between the source and destination nodes,providing message flow control,all of these measures can reduce the network hot spot and congestion;c)the reliability resources are implemented in hardware, not consuming the memory for connection, so it has good system scalability. The experimental results show that our optimized scheme does not increase the latency of the messages whose size is below 4k bytes.
出处 《计算机工程与科学》 CSCD 北大核心 2012年第8期184-190,共7页 Computer Engineering & Science
基金 国家863计划资助项目(2012AA01A301) 国家自然科学基金资助项目(61003301)
关键词 可靠传输协议 RDMA 网络接口 INFINIBAND 动态连接 reliable communication protocol RDMA network interface Infiniband dynamic connection
  • 相关文献

参考文献12

  • 1Brightwell R, Pedretti K T, Underwood K D. Seastar Interk connect: Balanced Bandwidth for Scalable Performance [J]. IEEE MICRO,2006,26(3) :41-57.
  • 2Alverson R,Roweth D, Kaplan L. The Gemini System Inter- connect[C]//Proc of the 18th IEEE Symposium on High Performance Interconnects, 2010 : 83-87.
  • 3Takashi T. ICC: An Interconnect Controller for the Tofu In- terconnect Architecture[C]//Proc of Hot Chips 22,2010.
  • 4Yuichiro A,Yuzo T,Tomohiro I,et al. The Tofu Interconnect [J].IEEE MICRO,2012,32(1) :21-31.
  • 5lnfiniBand Trade Association. Infiniband Architecture Speci- fication,Version 1. 2. 1 [EB/OL]. [2007-11-01]. http: // www. infinibandta, org.
  • 6Sivaram R, Oovindaraju R K, Hochschild P, et al. Breaking the Connection RDMA Deconstructed[C] //IEEE Hot In- terconnects, 2005 : 36-42.
  • 7TopS00_201111. xls[EB/OL]. [2011-11-01]. http://www. topS00, org.
  • 8Koop M J, Sridhar J K, Panda D K. Scalable MPI Design over In finiBand Using EXtended Reliable Connetction[C]//IEEE Inter national Conference on Cluster Computing,2008:203-212.
  • 9Roweth D. HPC Interconnects: A Review of Current Prod- ucts and Introduction to QsNetIII[EB/OL]. [2008-02]. http://www, epec. ed. ac. uk/msc/programme-information/ guest-lectures/2007 2008/duncanroweth talk.
  • 10Tang J ,Bilas A. Tolerating Network Failures in System Ar- ea Networks[C]//International Conference on Parallel Pro- cessing, 2002 : 121-130.

同被引文献41

引证文献8

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部