摘要
PVM(ParallelV irtualMachine)作为一种主流的集群并行计算环境,能够应付主机和网络故障并且提供轮询机制和故障通知原语来建立容错应用。由于PVM原有的故障检测机制不能满足实时性要求,因此提出并实现了两种改进PVM检错机制实时性能的方法,使得对主机故障事件的发现时间缩短了一万倍以上。
PVM is one of the most popular software environments used on parallel cluster system and it is able to withstand host and network failures and provide polling and notification primitives to allow fault-tolerant applications to be built. Because the fault-detection mechanism of PVM isn' t able to meet real-time requirement, we implement two methods of improvement for real-time fault-detection mechanism in PVM which the time that is spend in detecting host crash decreases 10 000 times than before improvement.
出处
《计算机应用研究》
CSCD
北大核心
2006年第11期183-185,188,共4页
Application Research of Computers
基金
国防预研基金资助项目(41306050103)
船舶基金资助项目(41901080101)
关键词
PVM
集群
故障检测
PVM(Parallel Virtual Machine)
Cluster
Fault-detection