容错COTS服务器的Fail-Silent进程设计研究
被引量:1
摘要
首先提出了基于Fail-Silent进程的一种容错COTS服务器通用设计策略.接着通过比较Fail—Silent进程设计的两种策略,分析了基于监控的Fail—Silent进程设计策略的重要性和优势.然后作者通过分析指出现有的基于监控的Fail—Silent进程设计策略具有的三点局限性.最后针对这些局限性重点给出了改进设计并对其有效性进行了分析.改进设计应用于作者所承担的容错COTS服务器项目中,取得了良好的效果.
出处
《武汉大学学报(理学版)》
CAS
CSCD
北大核心
2004年第A01期19-22,共4页
Journal of Wuhan University:Natural Science Edition
同被引文献10
-
1TSAI T K,LYER R K,JEWITT D.An Approach towards Benchmarking of Fault-Tolerant Commercial Systems[A].Proceedings of 26^th IEEE Iht Symp on Fault Tolerant Computing[C].Sendai,1996.314-323.
-
2COSTA D,CARREIRA J,SILVA J G.WinFT:Using Off-the-shelf Computer on Industrial Environments[A].Proceedings of the 6th IEEE International Conference on Emerging Technologies and Factory Automation[C].California:Angeles,1997.39 -44.
-
3RUSSINOVICH M,SEGALL Z.Fault-Tolerance for Off-the-shelf Applications and Hardware[A].Proceedings of the 25th International Symposium on Fault-Tolerant Computing[C].California:Pasadena,1995.67-71.
-
4ALSBERG P A,DAY J D A.A principle for resilient sharing of distributed resources,International Conference on Software Engineering[A].Proceedings of the 2nd international conference on Software engineering[C].San Francisco,California,1976.562 -570.
-
5HU K,MEHROTRA S,KAPLAN S.Failure Handling in an Optimized Two-Safe Approach to Maintaining Primary-Backup Systems[A].Proceedings of the The 17thIEEE Symposium on Reliable Distributed Systems[C].West Lafayette,1998.161-168.
-
6POLYZOIS C A,GARCIA -MOLINA H.Evaluation of remote backup algorithms for transaction processing systems[J].ACM transactions on Database System.1994,19(3):423-449.
-
7OLIVEIRA R,PEREIRA J,SCHIPER A.Primary -backup replication:From a time-free protocol to a time-based implementation[A].Proceedings of the 20th IEEE Symposium on Reliable Distributed Systems[C].New Orleans,2001.14-23.
-
8ALSBERG P A,DAY J D A.A principle for resilient sharing of distributed resources[A].Proceedings of the 2nd international conference on Software engineering[C].San Francisco,California,1976.562-570.
-
9BUDHIRAJA N,MARZULLO K,etc.Optimal Primary-Backup Protocols[A],Proceedings of Sixth intl.Workshop on distributed algorithms[C].Haifa,Israel,1992.362-387.
-
10杨朝红,宫云战,桑伟前,刘海燕,李庆艳.基于主从异步复制技术的容灾实时系统研究与实现[J].计算机研究与发展,2003,40(7):1104-1109. 被引量:20
-
1莫毓昌,崔刚,曲峰.面向容错COTS服务器的PB机制研究[J].哈尔滨工业大学学报,2006,38(10):1617-1621. 被引量:1
-
2邹候文.密码机制和容错系统中的协同一致[J].广州大学学报(自然科学版),2003,2(5):442-445.
-
3左德承,高巍,杨孝宗.DRD──基于诊断的高可靠分布式计算机系统的设计[J].计算机应用研究,2001,18(4):23-25.
-
4寇欣宇,王仲,叶声华.基于多进程的测控系统软件设计及其数据通信[J].计算机工程与应用,2000,36(6):101-103. 被引量:5
-
5莫毓昌,崔刚.基于泛洪的可靠广播算法分析[J].哈尔滨工业大学学报,2006,38(3):331-333. 被引量:1
-
6王新杰,谌向华.嵌入式智能家居安防监控系统[J].硅谷,2012,5(8):40-40. 被引量:2
-
7姜守旭,王继隆.Peer-to-Peer系统信息匿名更新的研究与实现[J].哈尔滨理工大学学报,2005,10(4):103-108.
-
8李晓汀,丁凡,熊华钢.基于OPNET的CAN网络建模与仿真[J].北京航空航天大学学报,2009,35(3):284-287. 被引量:12
-
9赵建超.基于Linux下智能卡登陆操作系统的实现方法[J].西安航空技术高等专科学校学报,2006,24(5):27-29. 被引量:2
-
10吴兆芝.X86平台多任务实验演示系统设计与实现[J].通化师范学院学报,2011,32(8):17-19.