期刊文献+

失效恢复机制下的网格任务冗余调度优化 被引量:1

Optimal Redundant Scheduling of Grid Tasks Based on Fault Recovery
下载PDF
导出
摘要 网格技术是目前学术界和工业界解决计算密集型问题的一种重要工具。由于网格系统的复杂性,网格在可靠性方面仍面临着诸多问题。针对目前网格服务可靠性低的问题,引入本地失效恢复机制,并允许资源自行调节网格任务生存时间以及失效恢复次数,从而建立更加符合实际的网格服务可靠性模型。在建模中,采取网格任务冗余调度方式,以进一步提高网格服务可靠性。基于建立的考虑失效恢复机制的网格服务可靠性模型,建立费用约束下的资源冗余调度优化模型,以获得网格服务可靠性最大的任务调度策略。针对该NP问题,采用遗传算法对该优化问题进行求解,并在求解过程中设计专门的修正算子校正不可行个体,以保障算法的正常运行。算例分析验证了算法的有效性。 Grid technology is an important tool to solve computation-intensive problems in current academic and industrial circles.Due to the complexity of grid system,there are a lot of problems unsolved in grid reliability.To cope with the low level of grid service reliability,a fault recovery mechanism in grid resources is introduced and a more practical grid service reliability model is proposed,in which the constraints on the life numbers of subtasks and on the numbers of recoveries performed can be imposed freely by resource owners.In order to further improve grid service reliability,a redundant scheduling of grid tasks is used.Based on the proposed grid service reliability model considering fault recovery,an optimization model with a cost constraint for redundant scheduling problem is presented to maximize the grid service reliability.A genetic algorithm is developed to solve it and some special repair operators are designed to adjust the infeasible solutions of the chromosomes,which can ensure normal algorithm operation.A numerical example is given to show the efficiency of the genetic algorithm.
出处 《机械工程学报》 EI CAS CSCD 北大核心 2010年第23期154-160,共7页 Journal of Mechanical Engineering
基金 国家自然科学基金资助项目(70828001)
关键词 网格 服务可靠性 失效恢复 冗余调度 遗传算法 Grid Service reliability Fault recovery Redundant scheduling Genetic algorithm
  • 相关文献

参考文献14

  • 1胡业发,陶飞,周祖德.制造网格资源服务Trust-QoS评估及其应用[J].机械工程学报,2007,43(12):203-211. 被引量:35
  • 2王爱民,范莉娅,肖田元,范文慧.面向制造网格的应用平台及虚拟企业建模研究[J].机械工程学报,2005,41(2):176-181. 被引量:26
  • 3DAI Yuanshun,XIE Min,POH K L.Reliability of grid service systems[J].Computers and Industrial Engineering,2006,50(1-2):130-147.
  • 4CHRISTOPHER D.Reliability in grid computing systems[J].Concurrency and Computation:Practice and Experience,2009,21(8):927-959.
  • 5LEVITIN G,DAI Yuanshun.Service reliability and performance in grid system with star topology[J].Reliability Engineering and System Safety,2007,92(1):40-46.
  • 6LEVITIN G,DAI Yuanshun,HANOCH B H.Reliability and performance of star topology grid service with precedence constraints on subtask execution[J].IEEE Transactions on Reliability,2006,55(3):507-515.
  • 7郭夙昌,杨波,黄洪钟.考虑节点失效恢复能力的网格服务可靠性建模与分析[J].西安交通大学学报,2008,42(6):693-697. 被引量:7
  • 8DAI Yuanshun,WANG Xiaolong.Optimal resource allocation on grid systems for maximizing service reliability using a genetic algorithm[J].Reliability Engineering and System Safety,2006,91(9):1071-1082.
  • 9KRAUTER K,BUYYA R,MAHESWARAN M.A taxonomy and survey of grid resource management systems for distributed computing[J].Software–Practice & Experience,2002,32(2):135-164.
  • 10XIE Min.Software reliability modeling[M].Singapore:World Scientific Publishing Company,1991.

二级参考文献53

共引文献81

同被引文献6

  • 1Slawinska M, Slawinski J, Sunderam V. Unibus: Aspects of heterogeneity and fault tolerance in cloud computing. Proc. of IEEE International Conference. USA. 2010.
  • 2Deng J, Huang SCH, Hart YSS, Deng JI-I. Fault-Tolerant and reliable computation in cloud computing. IEEE Globecom 2010 Workshop on Web and Pervasive Security. USA, 2010: 1601-1605.
  • 3刘宴兵,尚明生,肖云鹏.网格高性能调度及资源管理技术.北京:科学出版社,2010.145—149.
  • 4Gan GN, Huang TL, Gao S. Genetic simulated annealing algorithm for task scheduling based on cloud computing environment. Proc. of IEEE International Conference. Guilin, China. 2010: 60-63.
  • 5程世娟,卢伟,何平.蚁群算法在冗余系统可靠性最优分配上的应用[J].计算机工程与应用,2009,45(15):64-66. 被引量:9
  • 6刘波,林伟伟,齐德昱.一种冗余调度的可靠网格计算模型[J].小型微型计算机系统,2010,31(3):515-518. 被引量:1

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部