期刊文献+

一种基于iSCSI的自适应故障检测器的研究 被引量:1

Research of an Adaptive Failure Detector on iSCSI
下载PDF
导出
摘要 故障检测器是构建可靠的iSCSI存储系统所必需的基础组件。本文实现了一种iSCSI系统中自适应故障检测器iAFD(adaptive failure detector for iSCSI)。根据心跳(heartbeat)策略,设计了一种自适应心跳机制。故障检测器通过估计预期到达时间来提供一个探测时间,并动态地估算心跳消息超时时限,以适应系统状态的变化,减少故障检测服务的错误。实验表明,该方法与其它的故障检测方法相比,故障检测出错次数较少,检测时间较短,并能够适应高可靠计算系统状况的变化,在侦测的实时性和正确性上提供较好的平衡。 Failure detector is one of the fundamental building blocks to build dependable communication applications over iSCSI systems subject to faults. In this paper, we propose a new implementation of an adaptive failure detectoriAFD (adaptive failure detector for iSCSI). This implementation is a variant of the heartbeat failure detector which is adaptable. It dynamically estimates the heartbeat detection timeout and transmission delay of the system. It adapts to the change of the system state so as to reduce false detections. The experimental results show that the failure detector has less false detections and shorter detection time compared with normal failure detector. It can adapt the change of the state of high reliable computing system, and achieve a compromise between a good detection time and the need of avoiding false detections.
出处 《计算机科学》 CSCD 北大核心 2008年第6期90-94,共5页 Computer Science
基金 国家自然科学基金项目"支持内容感知的IP存储网络系统研究"(60673001)资助
关键词 故障检测 iSCSI系统 心跳算法 自适应性 Failure detection, iSCSI system, Heartbeat, Adaptability
  • 相关文献

参考文献1

二级参考文献19

  • 1Bagchi S, Srinivasan B, Whisnant K, Kalbarczyk Z, Iyer RK. Hierarchical error detection in a software implemented fault tolerance(SIFT) environment. IEEE Trans. on Knowledge and Data Engineering, 2000,12(2):203-224.
  • 2Wichadakul D, Nahrstedt K, Gu XH, Xu DY. 2K^Q+: An integrated approach of QoS compilation and reconfigurable,component-based run-time middleware for the unified QoS management framework. In: Guerraoui R, ed. Middleware 2001. New York: Springer-Verlag, 2001. 373-394.
  • 3Chandra TD, Toueg S. Unreliable failure detectors for reliable distributed systems. Journal of ACM, 1996,43(2):225-267.
  • 4Hayashibara N, Cherif A. Failure detectors for large-scale distributed systems. In: Kikuno T, ed. Proc. of the 21st IEEE Symp. on Reliable Distributed Systems (SRDS 2002). Washington: IEEE Computer Society, 2002. 404-409.
  • 5Chen W, Toueg S, Aguilera MK. On the quality of service of failure detectors. IEEE Trans. on Computers, 2002,51(5):561-580.
  • 6Sun Microsystems, Inc. Java management extensions instrumentation and Agent specification, vl.0. 2000.
  • 7Schmidt D, Stal M, Rohnert H, Buschmann F. Pattern-Oriented Software Architecture: Patterns for Concurrency and Distributed Objects, Volume 2. New York: John Wiley & Sons, 2000.
  • 8Sun Microsystems. Inc. Java 2 platform enterprise edition specification (version 1.4). 2003.
  • 9Fetzer C. Perfect failure detection in timed asynchronous systems. IEEE Trans. on Computers, 2003,52(2):99--112.
  • 10Gupta I, Chandra TD, Goldszmidt GS. On scalable and efficient distributed failure detectors. In: Kshemkalyani A, Shavit N, eds.Proc.of the 20th Symp. on Principles of Distributed Computing (PODC 2001). New York: ACM Press, 2001. 170-179.

共引文献17

同被引文献5

  • 1侯宗浩,董小社,郑守淇,刘爱华,胡雷钧.一种支持负载均衡的多机心跳模型[J].小型微型计算机系统,2005,26(1):11-14. 被引量:8
  • 2Ma T, Hillston J, Anderson S. Evaluation of the QoS of crash- recovery failure detection [C ]//In SAC '07 : Proceedings of the ACM Symposium on Applied Com-puting(DADS Track). ACM, 2007.
  • 3Gouda MG,McGuire TM. Accelerated heartbeat protocols[C]// Distributed Computing Systems, 18th International Conference, 1998.
  • 4Fengji Ye,Biplab Sikdar. Distance A-ware virtual carrier sensing for improved spatial reuse in wireless networks[C]//In Global Communications Conference : GlobeCom, Dallas, TX, USA, 2004, 6: 3793-3797.
  • 5陈诚,陈海涛.一种自适应的容灾系统心跳检测算法[J].计算机工程与科学,2008,30(5):53-55. 被引量:7

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部