期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
Checkpointing and rollback recovery for network of workstations 被引量:1
1
作者 汪东升 郑纬民 +1 位作者 王鼎兴 沈美明 《Science China(Technological Sciences)》 SCIE EI CAS 1999年第2期207-214,共8页
Network of workstations (NOW) now becomes one of the main trends of parallel computing. But for long-running scientific programs, it needs effective fault tolerance for its changing property. Checkpointing and rollbac... Network of workstations (NOW) now becomes one of the main trends of parallel computing. But for long-running scientific programs, it needs effective fault tolerance for its changing property. Checkpointing and rollback recovery is a solution to this problem. First the main problems upon rollback recovery are discussed, the different checkpointing techniques for NOW are analyzed, and then the design and implementation of ChaRM (checkpoint-based rollback recovery and process migration) system are described. The comparison of three coordinated checkpointing systems is given. 展开更多
关键词 CHECKPOINTING ROLLBACK recovery network of WORKSTATIONS (NOW) DOMINO effect COORDINATED check-pointing.
原文传递
SCR Algorithm: Saving/Restoring States of File Systems
2
作者 魏晓辉 鞠九滨 《Journal of Computer Science & Technology》 SCIE EI CSCD 2000年第4期393-400,共8页
Fault-tolerance is very important in cluster computing and has beenimplemented in many famous cluster-computing systems using checkpoint/restartmechanisms. But existent check-pointing algorithms cannot restore the sta... Fault-tolerance is very important in cluster computing and has beenimplemented in many famous cluster-computing systems using checkpoint/restartmechanisms. But existent check-pointing algorithms cannot restore the states of afile system when roll-backing the running of a program, so there are many restrictionson file accesses in existent fault-tolerance systems. SCR algorithm, an algorithmbased on atomic operation and consistent schedule, which can restore the states offile systems, is presented in this paper. In the SCR algorithm, system calls on filesystems are classified into idem-potent operations and non-idem-potent operations.A non-idem-potent operation modifies a file system's states, while an idem-potentoperation does not. SCR algorithm tracks changes of the file system states. It logseach non-idem-potent operation used by user programs and the information that canrestore the operation in disks. When check-pointing roll-backing the program, SCRalgorithm will revert the file system states to the last checkpoint time. By usingSCR algorithm, users are allowed to use any file operation in their programs. 展开更多
关键词 FAULT-TOLERANCE check-pointing atomic operation recoverability of file systems
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部