Nova-BFT:一种支持多种故障模型的副本状态机协议被引量：4

Nova-BFT:A Replicated State Machine Protocol Supporting Multiple Fault Models

下载PDF

导出

摘要云计算在简化用户访问资源方式的同时导致了支撑系统开发部署的复杂,软件错误、部署管理失误导致的拜占庭故障已经成为影响系统可靠性的重要原因.对于在大部分运行周期都满足良性故障模型的系统,拜占庭容错协议在通信复杂度、安全等方面的开销以及其在攻击场景下性能鲁棒性方面的缺陷都限制了其在实际系统中的使用.如何满足实际系统对多种故障模型的需求,已经成为系统设计的一个重要问题.针对这一现状,设计了Nova-BFT,一种有效支持多种故障模型的副本状态机协议,通过牺牲部分峰值吞吐率的方式满足拜占庭容错协议对性能鲁棒性的要求,采用配置参数方式自适应满足良性故障的性能需求.实验表明,Nova-BFT在拜占庭故障模型下吞吐率为4～5kop/s,同时其对良性故障模型的支持可以有效满足大多数实际应用的需求. Cloud computing has greatly simplified the ways that the clients can access the resources, and the pain is the increasing complexity of the supporting system development and deployment. The Byzantine faults caused by software bugs, management misbehaviors have become a major source that affects the reliability of the system. The cost in communication, security and the robustness issue under attack result in that the Byzantine fault tolerance technology can＇t be used directly in the practical systems which satisfy the benign fault model in most period of their lifecycle. How to satisfy the requirements of multiple fault-model has become an important problem in system design. To deal with the situation, we design Nova-BFT, a replicated state machine protocol which can support multiple fault models effectively. Nova-BFT fulfills performance robustness under attack by sacrificing some peak throughput in the fault-free scenario, and adaptively supports the benign fault model by adjusting configuration parameters. Experiments show that Nova BFT prototype has a 4-5 kop/s throughput in the Byzantine fault model assumption and it can also fulfill the benign fault model requirements of most practical systems.

作者王永剑裴翔李涛栾钟治钱德沛

机构地区北京航空航天大学中德联合软件研究所北京航空航天大学北京市网络技术重点实验室北京航空航天大学计算机学院信息网络安全公安部重点实验室公安部第三研究所

出处《计算机研究与发展》 EI CSCD 北大核心 2011年第7期1134-1145,共12页 Journal of Computer Research and Development

基金国家"八六三"高技术研究发展计划基金项目(2006AA01A124 2009AA01Z144 2009AA01A131 2010AA012404) 自然科学基金项目(90812001)

关键词云计算副本状态机拜占庭故障良性故障鲁棒性 cloud computing replicated state machine Byzantine fault tolerance benign faulttolerance robustness

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献21

1SCHNEIDER F B. Implementing fault tolerant services using the state machine approach: A tutorial [J]. ACM Computing Survey, 1990, 22(4): 299-319.
2LAMPORT L. Paxos Made Simple [J]. ACM SIGACT News (Distributed Computing Column), 2001, 32(4 ): 51.
3Chandra T D, Griesemer R, Redstone J. Paxos made live:An engineering perspective [C] //Proc of the 26th Annual ACM Syrup on Principles of distributed computing. New York: ACM, 2007:398-407.
4Burrows M. The Chubby lock service for loosely coupled distributed systems [C] //Proc of the 7th Syrup on Operating Systems Design and Implementation. New York: ACM, 2006:335-350.
5Junqueira F P, Reed B C. The life and times of a zookeeper [C]//Proc of the 28th ACM Symp on Principles of Distributed Computing ( PODC'09 ). New York: ACM, 2009 : 4-5.
6Kotla R, Alvisi L, Dahlin M, et al. Zyzzyva: Speculative byzantine fault tolerance [J]. SIGOPS Operating Systems Review, 2007, 41(6): 45-58.
7Seral'ini M, Bokor P, Dobre D, et al. Scrooge: Reducing the costs of' fast byzantine replication in presence of unresponsive replicas [C]//Proc of the 40th Annual IEEE/IFIP Int Conl. on Dependable Systems and Networks ( DSN-DCCS 2010 ). Piscataway, NJ: IEEE, 2010:353-362.
8Clement A, Wong E, Alvisi L, et al. Making Byzantine fault tolerant systems tolerate Byzantine faults [C] //Proc of the 6th USENIX Symp on Networked Systems Design and Implementation. New York: ACM, 2009:153-168.
9Castro M, Liskov B. Practical Byzantine fault tolerance and proactive recovery [J]. ACM Trans on Computer Systems, 2002, 20(4): 398-461.
10Abd-El Malek M, Ganger G R, Goodson G R, ct al. Fault scalable Byzantine fault-tolerant services [J]. SIGOPS Operating Systems Review, 2005, 39(5) : 59-74.

同被引文献26

1ARMBRUST M, FOX A, GRIFFITH R, et al. A view of cloud com- puting [J]. Communications of the ACM, 2010,53(4) :50-58.
2AMALDI E, KANN V. On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems [ J ]. Theoretical Computer Science, 1998,209(12) :237-260.
3FU S. Performance metric selection for autonomic anomaly detection on cloud computing systems [ C ]//Proc of IEEE Global Communica- tions Conference. 2011:1-5.
4CHANDOLA V, BANERJEE A, KUMAR V. Anomaly detection: a survey [ J ]. ACM Computing Surveys, 2009,41 (3) : 151-158.
5SCHOLKOPF B, PLATT J C, SHAWE-TAYLOR J C, et al. Estima- ting the support of a high-dimensional distribution [ J ]. Neural Computation, 2001,13 ( 7 ) : 1443-1471.
6BOYD S, VANDENBERGHE L. Convex optimization [ M]. [ S. l. ] : Cambridge University Press, 2004.
7HAN J, KAMBER M. Data mining: concepts and techniques[ M]. 3rd ed. [S. l. ]:Morgan Kaufmann, 2011.
8Sysstat[EB/OL]. http://sebastien, godard, pagesperso orange, fr/.
9ZHANG Yan-ming, HOU Xin-wen, XIANG Shi-ming, et al. Sub- space regularization: a new semi-supervised learning method [ C ]// Proc of European Conference on Machine Learning and Knowledge Discovery in Databases. 2009:586-601.
10ZHU X, GHAHRAMANI Z, LAFFERTY J. Semi-supervised learning using Gaussian fields and harmonic functions [ C ]//Proc of Interna- tional Conference on Machine Learning. 2003:912-920.

引证文献4

1夏敏纳,龚德良,肖娟.一种面向可靠云计算的自适应故障检测方法[J].计算机应用研究,2014,31(2):426-430. 被引量：6
2李军飞,胡宇翔,邬江兴.基于拜占庭容错提高SDN控制层可靠性的研究[J].计算机研究与发展,2017,54(5):952-960. 被引量：3
3朱康林.分布式虚拟化存储在公安专科类学校中的应用[J].计算机科学,2016,43(S1):571-576. 被引量：5
4熊昕,熊茂华.基于智慧路由的嵌入式互联网中间件系统的研究[J].信息技术与信息化,2018(4):111-114. 被引量：1

二级引证文献15

1杜晔,张亚丹,黎妹红,张大伟.基于改进FastICA算法的入侵检测样本数据优化方法[J].通信学报,2016,37(1):42-48. 被引量：14
2张志东.云计算环境下用户兴趣数据准确检测仿真[J].计算机仿真,2017,34(10):410-413. 被引量：3
3聂晶.云计算系统服务器节点故障的检测算法[J].内蒙古师范大学学报（自然科学汉文版）,2018,47(1):23-27. 被引量：3
4孙民权.基于通信消息的云计算系统故障传播分析[J].电脑知识与技术,2014,10(10X):7029-7031.
5夏红燕.信息网络分布式数据容灾应用研究[J].电脑知识与技术,2018,0(4X):36-39. 被引量：1
6权鹏宇,车文刚,余任,周志元.云资源池探针的故障检测方法研究[J].软件,2017,38(8):134-141.
7曾超,顾炜.存储即服务在公安行业的应用设想[J].信息与电脑,2017,29(15):49-50.
8左锋,宋艳.基于VSAN架构的云桌面系统建设研究[J].电脑与信息技术,2018,26(4):79-81. 被引量：3
9沈丛麒,陈双喜,吴春明,RUAN Wei.基于信誉度与相异度的自适应拟态控制器研究[J].通信学报,2018,39(A02):173-180. 被引量：14
10李德权,许月,薛生.基于动态约束自适应方法抵御高维鞍点攻击[J].计算机研究与发展,2020,57(9):2001-2008.

1王永剑,金波,董健.支持完整性检测的安全日志[J].清华大学学报（自然科学版）,2016,56(3):237-245.
2刘添添.移动Agent系统的一种安全容错机制[J].计算机工程,2005,31(18):116-118. 被引量：4
3张晓霞,张凤登,陈悫,张大庆.分布式WSN系统中的拜占庭故障算法研究[J].工业控制计算机,2014,27(1):70-72. 被引量：2
4陈康,黄剑,刘建楠.分布式协商:建立稳固分布式大数据系统的基石[J].大数据,2016,2(4):24-35. 被引量：2
5陈浩.处理器Lockstep技术研究[J].数字技术与应用,2012,30(8):56-58. 被引量：5
6赛门铁克推出信息风险管理战略[J].电子商务,2008,9(11):11-11.
7邓文达.基于有限状态机协议分析模型的入侵检测系统[J].自动化技术与应用,2006,25(6):48-50. 被引量：3
8王宝林,杨明,张永辉.纠删码分片验证技术研究[J].电脑知识与技术,2010(02X):1321-1323. 被引量：2
9黄晓东,张勇,邢春晓,黄寅飞,武剑锋,白硕.一种基于Paxos算法的证券交易系统内存复制方法研究[J].计算机科学,2012,39(12):139-144. 被引量：2
10程欣,龚宇净,杨孝宗.移动代理容错协议的应用比较及其SPN模型[J].计算机工程与应用,2005,41(1):151-153. 被引量：1

计算机研究与发展

2011年第7期

浏览历史

内容加载中请稍等...

Nova-BFT:一种支持多种故障模型的副本状态机协议被引量：4

参考文献21

同被引文献26

引证文献4

二级引证文献15

相关作者

相关机构

相关主题

浏览历史

Nova-BFT:一种支持多种故障模型的副本状态机协议 被引量：4

参考文献21

同被引文献26

引证文献4

二级引证文献15

相关作者

相关机构

相关主题

浏览历史

Nova-BFT:一种支持多种故障模型的副本状态机协议被引量：4