期刊文献+

异构平台实时任务的可用性提升容错调度算法 被引量:3

A Real-Time Task Availability Improving Fault-Tolerant Scheduling Algorithm on Heterogeneous Platform
下载PDF
导出
摘要 随着互联网+、云计算以及大数据等领域的迅速发展,异构平台成为部署科学计算、工业控制、云存储等关键应用的重要平台.由于平台内处理机性能及软硬件体系结构的异构性,异构平台表现出良好的可扩展性与高性价比.但是平台规模扩大和系统应用日趋复杂导致异构平台上实时任务的可调度性变差,系统可用性降低.针对此问题,提出了一种异构平台实时任务的可用性提升容错调度算法(availability improving fault-tolerant scheduling algorithm,AIFSAL).以处理器利用率和可用性成本为依据设计任务调度整体框架结构、处理机、任务以及调度模型;结合可用性成本设计算法并通过主副版本备份(primary/backup copy,PB)方法实现容错,任务副版本根据处理器利用率不同选择被动或重叠方式执行以减少系统冗余开销,提高可调度性,调度中无论任务主、副版本均优先选择可用性成本低的处理机以提高系统可用性;对任务分配情况和可调度性进行理论分析以证明AIFSAL的可行性.仿真实验与比较分析表明,AIFSAL较可用性约束(availability approached task scheduling algorithm,AATSAL)算法、单调速率扩展(task partition based fault-tolerant rate-monotonic,TPFTRM)算法以及最早完成时间(MinMin)算法在不降低可调度性的基础上有效地提升了系统可用性,减少了系统综合开销,综合性能提高显著. With the rapid development of Internet plus, cloud computing, big data and other fields, heterogeneous system has become an important platform for the deployment of scientific computing, industrial control, cloud storage and other key applications. Because of the heterogeneity of processor performance and software/hardware structure, heterogeneous platform shows better scalability and high cost-performance ratio. However, with the scale of platform becoming larger and the system application becoming more complex, system schedulability becomes worse, and availability decreases. To solve this problem, we propose a fault-tolerant scheduling algorithm aiming to improve availability for real-time tasks on heterogeneous platform, namely AIFSAL. The algorithm uses processor utilization and availability cost to design real-time task scheduling model, and combines availability cost and primary/backup copy (PB) method together for fault-tolerant. During task scheduling, no matter task's primary or backup copy, processors with lower availability cost is chosen preferentially in order to improve system availability, meanwhile tasks' backup copies are executed as the type of passive backup copy preferentially" in order to achieve fault-tolerant and ensure the schedulability of task allocation. Simulation experiments and comparison analysis with other task scheduling algorithms, including availability approached task scheduling algorithm (AATSAL), task partition based fault tolerant rate-monotonic (TPFTRM) and the earliest completion algorithm (MinMin), verify the effectiveness of the proposed algorithm on availability improving and schedulability assuring. Hence, the system comprehensive cost is reduced and comprehensive performance is improved significantly.
出处 《计算机研究与发展》 EI CSCD 北大核心 2015年第12期2669-2683,共15页 Journal of Computer Research and Development
基金 国家"八六三"高技术研究发展计划基金项目(2008AA01A202) 国家自然科学基金项目(61202041)
关键词 异构平台 任务调度 可用性 容错 主副版本 heterogeneous platform task scheduling availability fault-tolerant primary/backupcopy (PB)
  • 相关文献

参考文献34

  • 1Beccari G, Caselli S, Zanichelli F. A technique for adaptive scheduling of soft real-time tasks [J]. Real-Time System, 2005, 30(3) : 187-215.
  • 2Li W, Kavi K, Akl R. A non-preemptive scheduling algorithm for soft real-time systems [J]. Computers and Electrical Engineering, 2007, 33(1): 12-29.
  • 3A1 O, Somani A, Manimaran G. An adaptive scheme for fault tolerant scheduling of soft real time tasks in multiprocessor systems[J]. Journal of Parallel and Distributed Computing, 2005, 65(5): 595-608.
  • 4Abhaya K S, Rajib M, Chittaranjan T. Fault tolerant scheduling of hard real-time tasks on multiprocessor system using a hybrid genetic algorithm [J]. Swarm and Evolutionary Computation, 2014, 14:92-105.
  • 5Atdelzater T F, Atkins E M, Shin K G. QoS negotiation in real-time systems and its applications to automated flight control [J]. IEEE Trans on Computers, 2000, 49(11): 1170-1183.
  • 6Bertossi A, Mancini L, Rossini F. Fault-tolerant rate- monotonic first fit scheduling in hard real-time systems [J]. IEEE Trans on Parallel and Distributed Systems, 1999, 10 (9) : 934-945.
  • 7朱晓敏,陆佩忠.异构集群系统中安全关键实时应用调度研究[J].计算机学报,2010,33(12):2364-2377. 被引量:14
  • 8罗威,阳富民,庞丽萍,涂刚.异构分布式系统中实时周期任务的容错调度算法[J].计算机学报,2007,30(10):1740-1749. 被引量:15
  • 9Qiu Weiwei, Zheng Zibin, Wang Xinyu, et al. An efficient fault-tolerant scheduling algorithm for periodic real-time tasks in heterogeneous platforms [C] //Proc of the 16th Int Syrup on Object/Component/Service-Oriented Real-Time Distributed Computing. Piscataway, NJ: IEEE, 2013:1-7.
  • 10Wang Ji, Bao Weidong, Zhu Xiaomin, et al. FESTAL: Fault-tolerant elastic scheduling algorithm for real time tasks in virtualized clouds [J]. IEEE Trans on Computers, 2015, 64(1) : 1-14.

二级参考文献58

  • 1YANG Fumin LUO Wei PANG Liping.An Efficient Real-Time Fault-Tolerant Scheduling Algorithm Based on Multiprocessor Systems[J].Wuhan University Journal of Natural Sciences,2007,12(1):113-116. 被引量:6
  • 2罗威,阳富民,庞丽萍,李俊.基于延迟主动副版本的分布式实时容错调度算法[J].计算机研究与发展,2007,44(3):521-528. 被引量:18
  • 3Krishna C M,Shin K G.Real-Time Systems.USA:McGraw-Hill,1997.
  • 4Atdelzater T F,Atkins E M,Shin K G.QoS negotiation in real-time systems and its applications to automated flight control.IEEE Transactions on Computers,2000,49(11):1170-1183.
  • 5Beccari G,Caselli S,Zanichelli F.A technique for adaptive scheduling of soft real-time tasks.Real-Time Systems,2005,30(3):187-215.
  • 6Pourzandi M,Gordon D,Yurcik W,Koenig G A.Clusters and security:Distributed security for distributed systems//Proceedings of the 5th IEEE International Symposium on Cluster Computing and the Grid.Cardiff,UK,2005:96-104.
  • 7Zhang Yan-Yong,Sivasubramaniam A,Moreira J,Franke H.Impact of workload and system parameters on next generation cluster scheduling mechanisms.IEEE Transactions on Parallel and Distributed Systems,2001,12(9):967-985.
  • 8Ullman J D.NP-complete scheduling problems.Journal of Computer and System Sciences,1975,10(3):384-393.
  • 9Subramani V,Kettimuthu R,Srinivasan S,Johnston J,Sadayappan P.Selective buddy allocation for scheduling parallel jobs on clusters//Proceedings of the IEEE International Conference on Cluster Computing.Chicago,USA,2002:107-116.
  • 10Vallee G,Morin C,Berthou J-Y,Rilling L.A new approach to configurable dynamic scheduling in clusters based on single system image technologies//Proceedings of the 17th International Parallel and Distributed Processing Symposium.Nice,France,2003:22-26.

共引文献52

同被引文献46

引证文献3

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部