期刊文献+

基于Myrinet/GM的多通道通信 被引量:2

Multi-Networking Communication Based on Myrinet/GM
下载PDF
导出
摘要 通信子系统对并行系统的计算效率有重要影响,大规模应用对并行平台的通信性能和可用性提出了挑战性的要求.多通道通信技术通过并行采用多路网络链路互连来提高并行系统通信性能和可用性.首先分析了多进程复用网络对通信性能的影响,然后以Myrinet/GM网络平台为基础,提出了基于网络接口层的通信链路动态选择与分配策略,设计和实现了支持多路Myrinet网络并行通信的协议层MNC.MNC支持通信进程平等,充分地利用多路Myrinet网络链路资源.在使用2路Myrinet互连的PC机群平台上,MNC进程间通信带宽相对于单链路提高了约34%,有效地提高了应用层通信性能. Communication subsystem is crucial for cluster computing, which affects its efficiency, adaptability and scalability. Large-Scale applications require challenging communication performance and availability from cluster systems. Multi-Networking communication is a novel approach to improve the communication performance and availability by using multiple network links in parallel. In this paper, the effect of multi-process multiplexing one network link is analyzed, a dynamic link dispatch scheme is proposed, and the design and implementation of a multi-networking communication layer, MNC is introduced, which extends GM messaging layer, and supports multi-Myrinet parallel communication. MNC provides multi-process effectively exploiting the raw performance of multi-Myrinet, and improves the communication performance of application layer significantly. Compared with one-way Myrinet/GM environment, the communication bandwidth between MNC processes has increased by 34% on the PC cluster interconnected with 2-way Myrinet.
出处 《软件学报》 EI CSCD 北大核心 2003年第2期278-284,共7页 Journal of Software
基金 国家自然科学基金 国家重点基础研究发展规划(973)~~
关键词 多通道通信 计算机网络 Myrinet/GM 网络平台 网络链路 Bandwidth Multiplexing Network protocols Performance
  • 相关文献

参考文献7

  • 1[1]Boden NJ, Cohen D, Felderman RE, Kulawik AE, Seitz CL, Seizovic JN, Su WK. Myrinet: a gigabit-per-second local area network. IEEE Micro, 1995,15(1):29~36.
  • 2[2]Petrini F, Feng WC, Hoisie A, Coll S, Frachtenberg E. The quadrics network (QsNet): high-performance clustering technology. In: Proceedings of the 9th IEEE Hot Interconnects (HotI 2001). IEEE Computer Society Press, 2001.125~133
  • 3[3]von Eicken T, Culler DE, Goldstein SC, Schauser KE. Active messages: a mechanism for integrated communication and computation. In: Abramson D, Gaudiot JL, eds. Proceedings of the 19th ISCA. Cold Coast: ACM Press, 1992. 256~266.
  • 4[4]Prylli KE, Tourancheau B. BIP: a new protocol designed for high-performance networking on Myrinet. In: Proceedings of the International Parallel Processing Symposium 1998. Orlando: IEEE Computer Society Press, 1998. 472~485.
  • 5[5]Myricom. The gm API. 1998. http://www.myri.com/GM/doc/gm_toc.html.
  • 6[6]Coll S, Frachtenberg E, Petrini F, Hoisie A, Gurvits L. Using multirail networks in high-performance clusters. In:IEEE Cluster 2001. Newpert Beach: IEEE Computer Society Press, 2001. 15~26.
  • 7[7]Bruning U, Schalicke L. ATOLL: a high-performance communication device for parallel systems. In: IEEE, ed. Proceedings of the 1997 Conference on Advances in Parallel and Distributed Computing. Shanghai: IEEE Computer Society Press, 1997. 228~234.

同被引文献11

  • 1陈建萍.江西华信神威气象数值预报业务系统简介[J].江西气象科技,2005,28(1):18-21. 被引量:4
  • 2黎健,陈建萍,单九生.江西省引进中尺度数值模式系统的技术分析[J].江西气象科技,2004,27(4):12-15. 被引量:6
  • 3YOOK J K,TILBURY D M,SOPARKAR N R.Trading computation for bandwidth: reducing communication in distributed control systems using state estimators[J].IEEE Transactions on Control Systems Technology,2002,10(4):503-518.
  • 4AMAMIYA M,TANIGUCHI H,MATSUZAKI T.An architecture of fusing communication and execution for global distributed processing[J].Parallel Processing Letters,2001,11(1):7-24.
  • 5ANDREWS D,AUSTIN P,COSTELLO P,et al.Interprocess communications in the AN/BSY-2 distributed computer system: a case study[J].Journal of Systems and Software,2002,61(3):233-242.
  • 6SEYEDI A,SAULNIER G J.A distributed algorithm for dynamic sub-channel assignment in a multi-user OFDM communication system[A].In:IEEE Workshop on Statistical Signal Processing Proceedings[C].[s.l.],2001.
  • 7KIM J,LILJA D J.Performance-based path determination for interprocessor communication in distributed computing systems[J].IEEE Transactions on Parallel and Distributed Systems,1999,10(3):316-327.
  • 8BEGEL A,BUONADONNA P,CULLER D E,et al.An analysis of VI architecture primitives in support of parallel and distributed communication[J].Concurrency Computation Practice and Experience,2002,14(1):55-76.
  • 9周桂林,戈弋,李三立,黄震春,马群生.一种适用于机群系统的用户层消息传递机制[J].软件学报,2001,12(5):689-697. 被引量:4
  • 10都志辉,麦联叨,朱子玉,刘昊飞,李三立.克服机群系统通信瓶颈的软件方法[J].小型微型计算机系统,2002,23(1):32-35. 被引量:7

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部