期刊文献+

一种高性能的全序组播算法 被引量:1

A High Performance Total Order Broadcast Algorithm
下载PDF
导出
摘要 全序组播是构建分布式应用程序的一种重要组通信原语,它能够保证一个通信组中的所有成员都按照同样的顺序接收消息.目前的全序组播算法不能同时获取低延迟和高吞吐量,并且缺乏对应用程序通信模式的适应性,因此不适用于高性能计算环境.在分析已有算法排序机制基础上,指出影响全序组播算法性能的关键因素,并提出一种基于leader/followers模式和阻塞检测机制的新算法.算法工作原理如下:每一个组成员都可以在任意时刻发送消息,但只能提交来自当前leader成员的消息;一旦leader成员进入不活跃状态,则通过特殊的命令来指定某个活跃的follower成员为新的leader成员.模拟实验结果表明,该算法在延迟时间和吞吐量等性能指标方面都优于已有算法,同时在突发消息模式下能够大幅度提升性能. Total order broadcast is an important group communication primitive for building fault-tolerant distributed applications, and it guarantees that all members in a communication group receive messages in the same order even if some members are faulty. The existing total order broadcast algorithms can not achieve both low latency and high throughput at the same time, and lack adaptability for the communication patterns of applications, and thus they are not suitable for high performance computing environments. In analyzed in this paper are the ordering mechanisms in some existing typical algorithms, and the key factors that affect the performance of total order broadcast algorithms are pointed out. Then a novel algorithm is proposed, which builds the total order using the leader/followers pattern and is driven by block detection mechanism. It works as follows: each group member can send messages at any time, but only messages from the current leader are delivered, and if the leader remains inactive, it will issue a special request to change the leadership to one of the active follower members. Simulation experiments are performed and the results show that the new algorithm achieves good performance in terms of both latency and throughput, and is much more efficient under bursty message arrival pattern.
出处 《计算机研究与发展》 EI CSCD 北大核心 2007年第9期1449-1455,共7页 Journal of Computer Research and Development
基金 国家自然科学基金重大项目(90412011) 国家"九七三"重点基础研究发展规划基金项目(2005CB321804) 国家"八六三"高技术研究发展计划基金项目(2004AA112020 2005AA112030)
关键词 分布式算法 全序组播 原子组播 组通信 性能评估 distributed algorithm total order broadcast atomic broadcast group communication performance evaluation
  • 相关文献

参考文献13

  • 1Matthias Wiesmann,A Schiper.Comparison of database replication techniques based on total order broadcast[J].IEEE Trans on Knowledge and Data Engineering,2005,17(4):551-566
  • 2F Pedone,R Guerraoui,A Schiper.The database state machine approach[J].Distributed and Parallel Databases,2003,14(1):71-98
  • 3X Defago,A Schiper,P Urban.Comparative performance analysis of ordering strategies in atomic broadcast protocols[J].IEICE Trans on Information and Systems,2003,E86-D(12):2698-2709
  • 4X Defago,A Schiper,P Urban.Total order broadcast and multicast protocols:Taxonomy and survey[J].ACM Computing Surveys,2004,36(4):372-421
  • 5Y Amir,L E Moser,Melliar-Smith.The totem single-ring ordering and membership protocol[J].ACM Trans on Computer System,1995,3(4):311-342
  • 6K Birman,A Schiper,P Stephenson.Lightweight causal and atomic group multicast[J].ACM Trans on Computer Systems,1991,9(3):272-314
  • 7M K Aguilera,R E Strom.Efficient atomic broadcast using deterministic merge[C].In:Proc of the 19th Annual ACM Int'l Symp on Principles of Distributed Computing.New York:ACM Press,2000.209-218
  • 8F Cristian,S Mishra,G Alvarez.High performance asynchronous atomic broadcast[J].Distributed System Engineering Journal,1997,4(2):109-128
  • 9P Urban,X Defago,A Schiper.Neko:A single environment to simulate and prototype distributed protocols[J].Journal of Information Science and Engineering,2002,18(6):981-997
  • 10T D Chandra,S Toueg.Unreliable failure detectors for reliable distributed systems[J].Journal of ACM,1996,43(2):225-267

二级参考文献1

共引文献4

同被引文献9

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部