异构分布式系统性能得到大幅度提升的同时,却造成故障率大增,以有向无环图(Directed Acyclic Graph,DAG)任务模型研究异构分布式系统的容错调度成为当前的研究热点.广泛采用的基于任务复制的容错算法存在以下问题:(1)DAG任务可靠性需求...异构分布式系统性能得到大幅度提升的同时,却造成故障率大增,以有向无环图(Directed Acyclic Graph,DAG)任务模型研究异构分布式系统的容错调度成为当前的研究热点.广泛采用的基于任务复制的容错算法存在以下问题:(1)DAG任务可靠性需求与DAG可靠性需求的约束存在缺陷且缺乏严谨的理论证明;(2)每个任务仅有一个副版任务,不足以应对任务潜在的多次发生的故障;(3)盲目地使每个任务拥有ε+1个副版来容忍可能的ε个故障,虽然提高了系统的可靠性但易造成系统冗余度过高,并付出昂贵的计算资源.文中首先分析DAG图中任务依赖关系,确定DAG任务的可靠性概率模型,并建立DAG可靠性模型;接着提出满足可靠性目标的任务复制下限值算法、经济的任务复制策略算法和贪婪的任务复制策略算法,精确量化各个任务需要复制的次数,最后在上述算法的基础上提出可选策略的DAG容错算法OPDFT(Optional Policy on DAG Fault-Tolerant).实验表明,OPDFT算法的经济复制策略和贪婪复制策略的可靠性代价分别是盲目策略算法可靠性代价的60%和70%左右.展开更多
传统的异构分布式实时调度算法基本没有考虑任务的动态特性。提出一种非周期不可抢占式异构分布式的动态容错模型,在该模型上基于不同调度需求给出两种不同容错调度算法:DRFSA(Dynamic and Reliability-driven of hybrid with Fault-tol...传统的异构分布式实时调度算法基本没有考虑任务的动态特性。提出一种非周期不可抢占式异构分布式的动态容错模型,在该模型上基于不同调度需求给出两种不同容错调度算法:DRFSA(Dynamic and Reliability-driven of hybrid with Fault-tolerant Scheduling Algorithm)算法与DSFSA(Dynamic and Schedulability-driven of hybrid with Fault-tolerant Scheduling Algorithm)算法。DRFSA算法以提高可靠性代价为调度目标,通过合理调度提高系统可靠性。DSFSA算法以可调度性为调度目标,通过减少任务执行时间来增加系统可调度性。算法能够在异构系统中调度动态的实时任务,且能够尽可能响应任务需求。展开更多
In order to help manage the complexity, heterogeneity and dynamic inherent in distributed real-time systems, middleware of next generation should deal with changing environments and different client requirements. it i...In order to help manage the complexity, heterogeneity and dynamic inherent in distributed real-time systems, middleware of next generation should deal with changing environments and different client requirements. it is very necessary to resolve interoperability between heterogeneous schedulers. Therefore, this promotes the research of a heterogeneous distributed real-time scheduling system based on reflective middleware. The features of reflection technology and reflective middleware are described in detail first. As a result, proposed reflective middleware is used to resolve interoperability between heterogeneous schedulers in distributed real time system. The high performance,adaptability and feasibility of this middleware platform used to resolve interoperability between heterogeneous schedulers are proved in details by a concrete example. Future trends of research in this field are listed at the end.展开更多
文摘异构分布式系统性能得到大幅度提升的同时,却造成故障率大增,以有向无环图(Directed Acyclic Graph,DAG)任务模型研究异构分布式系统的容错调度成为当前的研究热点.广泛采用的基于任务复制的容错算法存在以下问题:(1)DAG任务可靠性需求与DAG可靠性需求的约束存在缺陷且缺乏严谨的理论证明;(2)每个任务仅有一个副版任务,不足以应对任务潜在的多次发生的故障;(3)盲目地使每个任务拥有ε+1个副版来容忍可能的ε个故障,虽然提高了系统的可靠性但易造成系统冗余度过高,并付出昂贵的计算资源.文中首先分析DAG图中任务依赖关系,确定DAG任务的可靠性概率模型,并建立DAG可靠性模型;接着提出满足可靠性目标的任务复制下限值算法、经济的任务复制策略算法和贪婪的任务复制策略算法,精确量化各个任务需要复制的次数,最后在上述算法的基础上提出可选策略的DAG容错算法OPDFT(Optional Policy on DAG Fault-Tolerant).实验表明,OPDFT算法的经济复制策略和贪婪复制策略的可靠性代价分别是盲目策略算法可靠性代价的60%和70%左右.
文摘传统的异构分布式实时调度算法基本没有考虑任务的动态特性。提出一种非周期不可抢占式异构分布式的动态容错模型,在该模型上基于不同调度需求给出两种不同容错调度算法:DRFSA(Dynamic and Reliability-driven of hybrid with Fault-tolerant Scheduling Algorithm)算法与DSFSA(Dynamic and Schedulability-driven of hybrid with Fault-tolerant Scheduling Algorithm)算法。DRFSA算法以提高可靠性代价为调度目标,通过合理调度提高系统可靠性。DSFSA算法以可调度性为调度目标,通过减少任务执行时间来增加系统可调度性。算法能够在异构系统中调度动态的实时任务,且能够尽可能响应任务需求。
文摘In order to help manage the complexity, heterogeneity and dynamic inherent in distributed real-time systems, middleware of next generation should deal with changing environments and different client requirements. it is very necessary to resolve interoperability between heterogeneous schedulers. Therefore, this promotes the research of a heterogeneous distributed real-time scheduling system based on reflective middleware. The features of reflection technology and reflective middleware are described in detail first. As a result, proposed reflective middleware is used to resolve interoperability between heterogeneous schedulers in distributed real time system. The high performance,adaptability and feasibility of this middleware platform used to resolve interoperability between heterogeneous schedulers are proved in details by a concrete example. Future trends of research in this field are listed at the end.