A Double-Timescale Reinforcement Learning Based Cloud-Edge Collaborative Framework for Decomposable Intelligent Services in Industrial Internet of Things

下载PDF

导出

摘要 With the proportion of intelligent services in the industrial internet of things(IIoT)rising rapidly,its data dependency and decomposability increase the difficulty of scheduling computing resources.In this paper,we propose an intelligent service computing framework.In the framework,we take the long-term rewards of its important participants,edge service providers,as the optimization goal,which is related to service delay and computing cost.Considering the different update frequencies of data deployment and service offloading,double-timescale reinforcement learning is utilized in the framework.In the small-scale strategy,the frequent concurrency of services and the difference in service time lead to the fuzzy relationship between reward and action.To solve the fuzzy reward problem,a reward mapping-based reinforcement learning(RMRL)algorithm is proposed,which enables the agent to learn the relationship between reward and action more clearly.The large time scale strategy adopts the improved Monte Carlo tree search(MCTS)algorithm to improve the learning speed.The simulation results show that the strategy is superior to popular reinforcement learning algorithms such as double Q-learning(DDQN)and dueling Q-learning(dueling-DQN)in learning speed,and the reward is also increased by 14%.

作者 Zhang Qiuyang Wang Ying Wang Xue

机构地区 School of Information and Communication Engineering State Key Laboratory of Networking and Switching Technology(Beijing University of Posts and Telecommunications)

出处《China Communications》 SCIE CSCD 2024年第10期181-199,共19页 中国通信（英文版）

基金 supported by the National Natural Science Foundation of China(No.62171051)。

关键词 computing service edge intelligence industrial internet of things(IIoT) reinforcement learning(RL)

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

1Rong Ma,Zhen Zhang,Yide Ma,Xiping Hu,Edith C.H.Ngai,Victor C.M.Leung.An improved pulse coupled neural networks model for semantic IoT[J].Digital Communications and Networks,2024,10(3):557-567.
2杨子强,卢家辉,周子安,朱佳妮,陆贝妮,张彬.传统农业电商数字化赋能转型平台的设计与研究[J].电子商务评论,2024,13(3):8921-8931.
3徐启阳.大数据和人工智能如何改变中国的数字鸿沟——以ChatGPT为例[J].现代管理,2024,14(9):2251-2259.
4苏靖雅.生成式人工智能浪潮下图书馆发展的策略研究[J].现代管理,2024,14(9):2193-2198.
5梁明霄,樊世达,许璧雯,王超.缆绳破断条件下FPSO码头系泊系统安全性分析[J].船舶标准化工程师,2024,57(S01):55-61.
6王心雨.数字经济对农业上市企业经营绩效的影响研究[J].电子商务评论,2024,13(3):5442-5451.
7Fangmin Wang,Wenlin Li,Hongfei Dai,Chunyi Li,Jianhua Zhou,Shenhui Xue,Bo Wang.A real-time performance improvement method for composite time scale[J].Chinese Physics B,2024,33(9):350-357.
8Hongzhao Xie,Zihang Gao,Guanglu Jia,Shingo Shimoda,Qing Shi.Learning Rat-Like Behavioral Interaction Using a Small-Scale Robotic Rat[J].Cyborg and Bionic Systems,2023(1):225-232.
9Maria Gabriela Garcia CAMPOS,Paul van BEURDEN.Potential of Applying Artificial Intelligence to Hot Metal Logistics Management[J].China's Refractories,2024,33(3):37-41.
10Yongbo Pan,Junzhi Cui,Zhenhao Xu.Multiscale method for identifying and marking the multiform fractures from visible-light rock-mass images[J].Underground Space,2024(3):279-300.

China Communications

2024年第10期

浏览历史

内容加载中请稍等...

A Double-Timescale Reinforcement Learning Based Cloud-Edge Collaborative Framework for Decomposable Intelligent Services in Industrial Internet of Things

相关作者

相关机构

相关主题

浏览历史