期刊文献+

SFT: A Consistent Checkpointing Algorithm with Short Freezing Time

SFT: A Consistent Checkpointing Algorithm with Short Freezing Time
原文传递
导出
摘要 A consistent checkpointing algorithm with short freezing time (SFT) is presented in this paper. It supports fault-tolerance in distributed systems. The algorithm has shorter freezing time, lower overhead, and simplicity of recovery. To make checkpoint time shorter, a special control message (Munblock) is used to ensure that a process can respond the checkpoint event quickly at any given time. Moreover, main memory algorithm is used to improve the concurrency of checkpointing. By using SFT, the freezing time resulted by checkpointing is less than 0.03s. Furthermore, the control message number of SFT is only O(n). A consistent checkpointing algorithm with short freezing time (SFT) is presented in this paper. It supports fault-tolerance in distributed systems. The algorithm has shorter freezing time, lower overhead, and simplicity of recovery. To make checkpoint time shorter, a special control message (Munblock) is used to ensure that a process can respond the checkpoint event quickly at any given time. Moreover, main memory algorithm is used to improve the concurrency of checkpointing. By using SFT, the freezing time resulted by checkpointing is less than 0.03s. Furthermore, the control message number of SFT is only O(n).
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2000年第2期169-175,共7页 计算机科学技术学报(英文版)
基金 the National Natural Science Foundation of China !69673012
关键词 CHECKPOINTING FAULT-TOLERANCE distributed system freezing time checkpointing, fault-tolerance, distributed system, freezing time
  • 相关文献

参考文献2

  • 1James S Plank,IEEE Parallel Distributed Technologies,1994年,2卷,2期,62页
  • 2James S Plank,IEEE Paralleland Distributed Technologies,1994年,5卷,8期,874页

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部