期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
面向容错COTS服务器的PB机制研究 被引量:1
1
作者 莫毓昌 崔刚 曲峰 《哈尔滨工业大学学报》 EI CAS CSCD 北大核心 2006年第10期1617-1621,共5页
采用自顶向下的思想对利用PB机制构造容错COTS服务器进行了系统性的研究.首先基于实际研究开发工作给出了容错COTS服务器的系统模型,然后在该系统模型基础上设计了的通用PB机制,最后设计和分析了PB机制的四个核心算法:分布式故障诊断算... 采用自顶向下的思想对利用PB机制构造容错COTS服务器进行了系统性的研究.首先基于实际研究开发工作给出了容错COTS服务器的系统模型,然后在该系统模型基础上设计了的通用PB机制,最后设计和分析了PB机制的四个核心算法:分布式故障诊断算法、分布式选举算法、分布式一致性控制算法和客户端/服务器协同算法. 展开更多
关键词 容错COTS服务器 primary-backups机制 算法分析和设计
下载PDF
A novel fault-tolerant scheduling approach for collaborative workflows in an edge-IoT environment
2
作者 Tingyan Long Yong Ma +6 位作者 Lei Wu Yunni Xia Ning Jiang Jianqi Li Xiaodong Fu Xiangmi You Bo Zhang 《Digital Communications and Networks》 SCIE CSCD 2022年第6期911-922,共12页
As a newly emerging computing paradigm, edge computing shows great capability in supporting and boosting 5G and Internet-of-Things (IoT) oriented applications, e.g., scientific workflows with low-latency, elastic, and... As a newly emerging computing paradigm, edge computing shows great capability in supporting and boosting 5G and Internet-of-Things (IoT) oriented applications, e.g., scientific workflows with low-latency, elastic, and on-demand provisioning of computational resources. However, the geographically distributed IoT resources are usually interconnected with each other through unreliable communications and ever-changing contexts, which brings in strong heterogeneity, potential vulnerability, and instability of computing infrastructures at different levels. It thus remains a challenge to enforce high fault-tolerance of edge-IoT scientific computing task flows, especially when the supporting computing infrastructures are deployed in a collaborative, distributed, and dynamic environment that is prone to faults and failures. This work proposes a novel fault-tolerant scheduling approach for edge-IoT collaborative workflows. The proposed approach first conducts a dependency-based task allocation analysis, then leverages a Primary-Backup (PB) strategy for tolerating task failures that occur at edge nodes, and finally designs a deep Q-learning algorithm for identifying the near-optimal workflow task scheduling scheme. We conduct extensive simulative case studies on multiple randomly-generated workflow and real-world edge-IoT server position datasets. Results clearly suggest that our proposed method outperforms the state-of-the-art competitors in terms of task completion ratio, server active time, and resource utilization. 展开更多
关键词 Edge computing Fault tolerance DQN algorithm primary-backup model
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部