期刊文献+

一种基于无效链路的分布式故障诊断一致性协议 被引量:2

A Protocol of Fault Diagnosis Agreement Based on Invalid Link
下载PDF
导出
摘要 故障诊断一致性(fault diagnosis agreement,FDA)是高可靠容错分布式系统的性能和完整性的重要保障.目前,大部分FDA协议还是只考虑单一故障组件的简单网络,而对于实际的分布式应用、故障节点和故障链路并存的系统假设更加有意义.但是,在此假设下,对恶意(拜占庭故障)组件的诊断是不可能满足FDA的.为此,首先提出了一种无效链路(invalid link)故障模型,可以更加准确地描述恶意组件的故障行为对系统的影响,有效提高故障诊断的覆盖率.在此模型基础上,提出了一个基于证据的故障诊断协议——PLFDA,可以同时对恶意节点和恶意链路进行检测和定位,并且能够满足故障诊断一致性要求. Abstract Fault diagnosis agreement (FDA) can maintain the performance and integrity of highly reliable distributed systems. However, most of previous FDA protocols only take into account simple network with single faulty component. It is more important to study complicated network with faulty nodes and faulty links for real distributed applications. Unfortunately, the diagnosis of malicious (Byzantine) fault components can not satisfy FDA in this situation because of the arbitrariness of its behavior. Thus, the model of invalid link is proposed firstly in this paper, which can more accurately describe the effect of malicious faulty component under network with dual faulty components, and improve fault diagnosis coverage. Afterwards, based on the invalid link model, an evidence-based fault diagnosis protocol, PLFDA, is presented. PLFDA collects the messages which have accumulated in a Byzantine agreement protocol as evidence and then diagnoses the set of faulty components by examining the collected evidences. It can not only detect and locate simultaneously both faulty nodes and faulty links, but also satisfy requirements of FDA in a synchronous fully connected network, where the number of allowable faulty components is not greater than [ n/2] - 1, of which the number of allowable faulty nodes is less than or equal to [ ( n - 1)/3 ]. In addition, the proof of correctness and complexity of PLFDA and experimental results are given in the end.
出处 《计算机研究与发展》 EI CSCD 北大核心 2007年第6期914-923,共10页 Journal of Computer Research and Development
基金 国家自然科学基金项目(60503015)~~
关键词 分布式系统 故障诊断一致性 恶意节点 恶意链路 无效链路 distributed system fault diagnosis agreement malicious node malicious link invalid link
  • 相关文献

参考文献18

  • 1S C Wang,K Q Yan.Dual link fault diagnosis agreement[J].Journal of Systems and Software,2004,71(1/2):117-125
  • 2孟丹,张志宏,陈明宇.高生产率计算系统[J].计算机研究与发展,2005,42(4):563-569. 被引量:6
  • 3胡华平,金士尧,王维.分布式实时系统的高可靠性研究与实现[J].计算机研究与发展,1998,35(9):841-845. 被引量:22
  • 4L Lamport,R Shostak,M Pease.The Byzantine generals problem[J].ACM Trans on Programming Languages and Systems,1982,4(3):382-401
  • 5M Barborak,M Malek,A Dahbura.The consensus problem in faul-tolerance computing[J].ACM Computing Surveys,1998,25(6):171-220
  • 6Kuo-Qin Yan,Y H Chin,Shu-Ching Wang.Optimal agreement protocol in malicious faulty processors and faulty links[J].IEEE Trans on Knowledge and Data Engineering,1992,4(3):266-280
  • 7M J Fischer.The consensus problem in unreliable distributed systems(a brief survey)[G].In:Lecture Notes in Computer Science 158.Berlin:Springer,1983.127-140
  • 8M Fischer,et al.A lower bound for the assure interactive consistency[J].Elsevier Information Processing Letter,1982,14(4):183-186
  • 9S C Wang,Y H Chin,K Q Yan.Reaching a fault detectionagreement[C].Int'l Conf on Parallel Processing,Dupage county,USA,1990
  • 10Hsien-Sheng Hsiao,Yeh-Hao Chin,Wei-Pang.Reaching fault diagnosis agreement under a hybrid fault model[J].IEEE Trans on Computers,2000,49(5):980-986

二级参考文献15

  • 1Hu Huaping,Proc of ICRMS’96 Guangzhou,1996年,116页
  • 2何容成,计算机世界,1996年
  • 3DARPA. High Productivity Computing Systems (HPCS)Program. http: // www. darpa. mil/ipto/Programs/hpcs/index.htm, 2002-04.
  • 4Mootaz Elnozahy. PERCS: IBM effort in HPCS. http: // www.ncsc. org/casc/meetings/vision-public. pdf, 2003.
  • 5Cray Inc. The cascade project. http://www. cray. com/cascade/,2002.
  • 6Per Nyberg. The Cray rainier system: Integrated scalar/vector computing. http://www. ecmwf. int/newsevents/meetings/workshops/2004/high_performance-computing-11th/pdf/Per_Nyberg. pdf, 2004.
  • 7John L. Gustafson. Sun' s HPCS approach: Hero. http://www.ncsc. org/casc/meetings/CASC2. pdf, 2003-08.
  • 8HPCC. HPC challenge benchmark. http://icl. cs. utk. edu/hpcc/,2004.
  • 9J. Gustafson. Purpose-based benchmarks. http: // www.highproductivity. org/IJHPCA/8f-Gustafson-Productivity. pdf,2004.
  • 10Thomas Sterling. Productivity metrics and models for high performance computing. http: // www. highproductivity. org/IJHPCA/4f-Sterling-ProductivityPaperv3. pdf, 2004.

共引文献26

同被引文献16

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部