期刊文献+

网格中基于OGSA的自适应故障检测策略(英文)

Adaptive Fault Detection Strategy in GRID Based on OGSA
下载PDF
导出
摘要 开放网格服务体系(OGSA)需要可扩展和可伸缩容错机制,以解决不同需求,比如多样故障处理策略及故障处理策略和应用代码的分离.本文提出了一种基于OGSA的错误检测框架,根据用户需求,检测算法可以向应用提供错误检测服务.最后,本文进行了性能分析和试验评估,结果表明本文提出的方法能够满足多样和动态的网格应用. Open Grid Services Architecture (OGSA) requires scalable and flexible fault tolerant mecha- nisms to address different requirements such as support for diverse failure handling strategies and separating failure handling strategies from application codes. In this paper, we propose a failure detection framework based on the OGSA. An adaptive failure detection algorithm is presented, and according to users' demands,it can provide application with failure detection service. At last, we put forward performance analysis and experiment evaluation, and the result shows that the method proposed in this paper satisfies application in grid with diversity and dynamic.
出处 《北京交通大学学报》 EI CAS CSCD 北大核心 2008年第6期102-105,110,共5页 JOURNAL OF BEIJING JIAOTONG UNIVERSITY
关键词 网格计算 可靠性 容错 自适应 故障检测 grid computing reliability fauh-tolerant adaptive fault detection
  • 相关文献

参考文献6

  • 1Foster I, Kesselman C. The Globus Project: A State Report[J]. Future Generation Computer Systems, 1999, 15 (5) : 607 - 621.
  • 2Hwang S, Kesselman C. Grid Workflow: a Flexible Framework for Fault Tolerance in the Grid[ C]//Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing, 2003: 126- 137.
  • 3Stelling P, Foster I, Kesselman C, et al. A Fault Detection Service for Wide Area Distributed Computations[ C]// In Proceedings of the 7th IEEE Symposium on High Performance Distributed Computing, Los Alamitos, IEEE Computer Society Press, 1998: 268- 278.
  • 4Abawajy J H. Fault Detection Service Architecture for Grid Computing Systems [ C ]//Proceedings of ICCSA 2004, Lecture Note in Computer Science 3044, Berlin, Springer, 2004: 107- 115.
  • 5田东,陈蜀宇,陈峰.一种网格环境下的动态故障检测算法[J].计算机研究与发展,2006,43(11):1870-1875. 被引量:9
  • 6Xuanhua Shi, Hai Jin, Zongfen Han, et al. ALTER: Adaptive Failure Detection Services for Grids[ C]//Proceedings of the 2005 IEEE Int' l Conf. on Services Computing (SCC' 05), Los Alamitos, CA, IEEE Computer Society Press, 2005: 355-358.

二级参考文献11

  • 1I Foster.The Grid:A new infrastructure for 21st century science[J].Physics Today,2002,55(22):42-47
  • 2R Medeiros,W Cirne,F Brasileiro.Faults in grids:Why are they so bad and what can be done about it[C].In:Proc of the 4th Int'l Workshop on Grid Computing.Los Alamitos,CA:IEEE Computer Society Press,2003.18-24
  • 3S Hwang,C Kesselman.A flexible framework for fault tolerance in the grid[J].Journal of Grid Computing,2003,1(3):251-272
  • 4P Stelling,C Dematteis,I Foster,et al.A fault detection service for wide area distributed computations[J].Cluster Computing,1999,(2):117-128
  • 5J H Abawajy.Fault detection service architecture for grid computing systems[G].In:Proc of ICCSA 2004,Lecture Note in Computer Science 3044.Berlin:Springer,2004.107-115
  • 6A Jain,R K Shyamasundar.Failure detection and membership in grid environments[C].In:Proc of the 5th IEEE/ACM Int'l Workshop on Grid Computing (GRID'04).Los Alamitos,CA:IEEE Computer Society Press,2004.44-52
  • 7T D Chandra,S Toueg.Unreliable failure detectors for reliable distributed systems[J].Journal of ACM,1996,43(2):225-267
  • 8W Chen,S Toueg,M K Aguilera.On the quality of service of failure detectors[J].IEEE Trans on Computers,2002,51(2):13-32
  • 9M Bertier,O Marin,P Sens.Implementation and performance evaluation of an adaptable failure detector[C].In:Proc of IEEE Int'l Conf on Dependable Systems and Networks (DSN'02).Los Alamitos,CA:IEEE Computer Society Press,2002.354-363
  • 10N Hayashibara,X Défago,R Yared,et al.The φ accrual failure detector[C].In:Proc of the 23rd IEEE Int'l Symp on Reliable Distributed Systems (SRDS'04).Los Alamitos,CA:IEEE Computer Society Press,2004.66-78

共引文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部