摘要
分布式系统中软件可靠性是应用软件的发布者和用户关心的重要问题。针对大规模分布式应用,包括电子政务、电子商务、多媒体服务和端到端的自动化解决方案,已经产生了各种各样的模型来评价或预测其可靠性,但是这些系统的可靠性问题依然存在。相反,为了确保分布式系统的可靠性,要求在预测或评价整个系统可靠性之前,检查与企业分布式应用相关的每一个单个构件或因素的可靠性,且实现透明的错误检测和错误恢复机制为用户提供无缝交互。因此,文章从检查单个构件可靠性的角度,提出了在分布式系统上运行的应用软件可靠性的问题和挑战。
Software reliability of distributed systems has always been the concern of the software vendors and users. Solutions for Large-scale distributed applications , including e-government, e-business, media services and end to end automatic solutions. Various model have been produced to evaluate or predict reliability, but reliability issues still exists. On the contrary, ensuring distributed system's reliability, reliability of every single component or factor should be checked before evaluating or predicting the whole system's reliability, and implementing transparent fault detection and recovery to provide seamless interaction for users. So, from viewpoint of examining the individual component, we propose the problems and challenges of reliability of software running on the distributed systems.
出处
《电脑与信息技术》
2014年第6期29-32,共4页
Computer and Information Technology
关键词
分布式系统
可靠性预计
评价
容错
distributed system
reliability prediction
fault tolerance
assessment