期刊文献+

基于PVM的准同步检查点设置方法

Method for PVM-based quasi-synchronous checkpointing
下载PDF
导出
摘要 检查点是并行系统中实现容错的重要手段,同步检查点方法已广泛应用在工作站机群系统中。PVM所提供的消息传递机制支持高效的异构网络计算,但不支持容错功能。为了降低同步检查点设置的时间开销,提出了一种基于PVM的准同步检查点设置方法,它吸取了同步检查点方法的优点,又通过消息记录方式实现各节点间独立进行状态保存,大大降低了检查点的同步开销,提高了检查点操作效率,该方法在PVM环境下得以实现,实验结果表明所提出的方法具有较好的容错性能。 Checkpoint is an important means to implement fault-tolerance in parallel system. Synchronous checkpointing method has been widely used in network of workstation system. Message-passing mechanism, provided by PVM, has high efficiency in heterogeneous network computing, while lacks of supporting fault-tolerance. In order to reduce time overhead, a method for PVM-based quasi-synchronous checkpointing was given. This method adopted the advantages of synchronous checkpointing method, and enabled each node to save status independently by recording message. Thereby, overhead of synchronization of checkpointing was reduced greatly, and operation efficiency of checkpoint was enhanced. This method was implemented in PVM environment. Results of experiments showed that the method had better performance of fault-tolerance.
作者 张宇 张玉芳
出处 《计算机工程与设计》 CSCD 北大核心 2006年第3期494-496,共3页 Computer Engineering and Design
关键词 检查点 准同步 消息 checkpoint quasi-synchronization message
  • 相关文献

参考文献9

二级参考文献11

  • 1鞠九滨,计算机学报,1997年,20卷,10期,873页
  • 2鞠九滨,计算机学报,1997年,20卷,10期,873页
  • 3Elnozahy E N,CMU Technical Report CMU-CS-99-148,1999年
  • 4Tannenbaum T,Litzkow M.The condor distributed processing system[J].Dr.Dobbs Journal,1995,(2):40~48
  • 5Wang Dong-sheng,Zheng Wei-min,Wang Ding-xing,Shen Meiming.Checkpointing and rollback recovery for network of workstations[J].Science in China Series E-Technological Sciences APR,1999 42(2) 207~214
  • 6Pei Dan,Wang Dong-sheng Shen Mei-ming.Quasi-asynchronous migration: a novel migration protocol for PVM Tasks[J].Operating Systems Review.April 1999,33(2):5~14
  • 7Stellner G,Pruyne J.Resource management and checkpointing for PVM[C].In: Proceedings of the 2nd European PVM Users Group Meeting.Lyon,France: Edition Hermes ,1995.131~136.
  • 8Casas J,Clark D L et al.MPVM: a migration transparent version of PVM[J].Computing Systems,1995,8(2):171~216
  • 9鞠九滨,魏晓辉,徐高潮,尹玉.DPVM:支持任务迁移和排队的PVM[J].计算机学报,1997,20(10):872-877. 被引量:12
  • 10魏晓辉,鞠九滨.分布式系统中的检查点算法[J].计算机学报,1998,21(4):367-375. 被引量:12

共引文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部