期刊文献+

Improving Rollback-Recovery Efficiency by Tuning Pessimism Grain

Improving Rollback-Recovery Efficiency by Tuning Pessimism Grain
原文传递
导出
摘要 Wide-area systems are becoming a popular infrastructure for long-running applications. Rollback- recovery, as a common technology for fault tolerance and load balance, must meet the challenges of scal- ability and inherent variability in such applications. Most of the rollback-recovery protocols, however, are poor in scalability. Although pessimistic message logging protocols have no such problem, their fault-free overhead sometimes is prohibitive. Aiming at good scalability and acceptable overhead, this paper intro- duces the concept of pessimism grain and presents a coarse-grained pessimistic message-logging scheme. The paper also evaluates the impact of pessimism grain on the performance of the recovery scheme. Ex- perimental results show that pessimism grain is one of the key configuration parameters to reach a desired performance level. In practice, the proper pessimism grain should be selected based on the characteristics of the applications. Wide-area systems are becoming a popular infrastructure for long-running applications. Rollback- recovery, as a common technology for fault tolerance and load balance, must meet the challenges of scal- ability and inherent variability in such applications. Most of the rollback-recovery protocols, however, are poor in scalability. Although pessimistic message logging protocols have no such problem, their fault-free overhead sometimes is prohibitive. Aiming at good scalability and acceptable overhead, this paper intro- duces the concept of pessimism grain and presents a coarse-grained pessimistic message-logging scheme. The paper also evaluates the impact of pessimism grain on the performance of the recovery scheme. Ex- perimental results show that pessimism grain is one of the key configuration parameters to reach a desired performance level. In practice, the proper pessimism grain should be selected based on the characteristics of the applications.
出处 《Tsinghua Science and Technology》 SCIE EI CAS 2007年第S1期8-13,共6页 清华大学学报(自然科学版(英文版)
基金 the National Natural Science Foundation of China (Nos. 60473031, 60673155) the Natural Science Foundation of Hunan (No. 05JJ30116)
关键词 rollback recovery SCALABILITY performance evaluation protocol optimization rollback recovery scalability performance evaluation protocol optimization
  • 相关文献

参考文献8

  • 1Cao G,Singhal M.Checkpointing with mutable check- points. Theoretical Computer Science . 2003
  • 2Elnozahy E N,Plank J S.Checkpointing for peta-scale systems: A look into the future of practical rollback- recovery. IEEE Transactions on Dependable and Secure Computing . 2004
  • 3Plank J S,Thomason M G.Processor allocation and checkpoint interval selection in cluster computing systems. Journal of Parallel and Distributed Computing . 2001
  • 4Alvisi L,Marzullo K.Message logging: Pessimistic, opti- mistic, causal and optimal. IEEE Transactions on Software Engineering . 1998
  • 5Vaidya N H.Impact of checkpoint latency on overhead ratio of a checkpointing scheme. IEEE Transactions on Computers . 1997
  • 6Vaidya N H.A case for two-level recovery schemes. IEEE Transaction on Computers . 1998
  • 7Elnozahy E,Alvisi L,Wang Y,et al.A survey of rollbackrecovery protocols in message passing systems. ACM Computing Surveys . 2002
  • 8Bhatia K,Marzullo K,Alvisi L.Scalable causal message logging for wide-area environment. Concurrency and Computation: Practice and Experience . 2003

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部