无人机任务卸载与充电协同优化

Joint Optimization of UAV Task Offloading and Charging

下载PDF

导出

摘要在野外恶劣环境应用中,可以使用具有灵活性和便捷性的无人机(UAV),通过无线数据传输辅助携带用户任务到边缘服务器。然而,UAV飞行平台难以提供长时间的任务卸载服务,大大限制了其应用前景。本文研究了在移动边缘计算环境中,如何有效整合UAV的任务卸载和充电调度。首先,构建了一个新的应用模型,该模型协同处理UAV的任务卸载调度和自身充电需求,并在UAV辅助任务卸载应用场景中加入了若干个无线充电平台。其次,考虑了用户任务的价值和UAV的充电需求,以在时延敏感和能量约束的条件下优化UAV辅助用户设备进行任务卸载的收益。最后,采用深度强化学习算法,对深度Q网络(DQN)进行调优后形成Fixed DQN算法,以有效处理模型中的大规模状态动作搜索空间问题。本文以UAV仅作为任务载体并考虑其自主充电需求为前提,通过在一个半径为3000 m、含有11个节点的区域验证Fixed DQN算法的可行性;并在不同用户节点数量、充电节点数量及服务时间条件下,通过与蚁群算法、遗传算法和DQN算法的对比实验评估其性能。实验结果表明:本文提出的Fixed DQN算法在所有测试条件下均显著优于蚁群算法、遗传算法和DQN算法,特别是在节点数量增加和服务时间延长的情景中;此外,Fixed DQN算法相对于DQN算法的性能提升突显了深度强化学习在参数调优方面的有效性。研究结果证实了Fixed DQN算法在解决UAV任务卸载和充电调度问题中的高效性和调参策略的重要性。 In applications of harsh outdoor environments,unmanned aerial vehicles(UAVs),known for their flexibility and convenience,were utilized to assist in carrying user tasks to edge servers through wireless data transmission.However,it was found that UAV flight platforms struggled to provide long-duration task offloading services,significantly limiting their application prospects.This study investigated how to effectively integrate UAV task offloading and charging scheduling in a mobile edge computing environment.Firstly,a new application model was constructed,which cohesively managed UAV task offloading scheduling and its own charging needs,incorporating several wireless charging platforms into the UAV-assisted task offloading application scenario.These platforms enabled UAVs to autonomously recharge during task execution,providing automated charging services without the need for human intervention.UAVs independently decided whether to proceed to the nearest charging node for power replenishment based on their current power level and upcoming task offloading plans.However,opting to recharge at a charging station not only incurred additional time and energy consumption from cruising altitude to the charging station but also required consideration of the time cost during the charging process and its impact on overall task scheduling.When UAVs decided to recharge,additional time and effort were needed to descend from cruising altitude to the charging node.Secondly,the value of user tasks and UAV charging needs were considered to optimize the benefits of UAV-assisted user device task offloading under conditions sensitive to delay and energy constraints.This involved not only optimizing the UAV’s flight path and task allocation but also its charging schedule,ensuring sufficient charging and efficient operation while executing tasks.Such a cooperative scheduling strategy enabled UAVs to maximize the processing of user tasks while maintaining necessary operational energy,thereby enhancing the performance of the entire mobile edge computing system.Finally,a deep reinforcement learning algorithm was employed,and the deep Q network(DQN)was fine-tuned to form the Fixed DQN algorithm,effectively addressing the large-scale state-action search space issue within the model.This approach capably handled complex decision-making problems and facilitated effective learning and optimization across a wide state space.With the deep learning framework,the algorithm processed high-dimensional input data and made accurate offloading and charging decisions in various dynamic environments.This was significantly important for improving the efficiency and effectiveness of UAV task offloading and charging scheduling.The design of the algorithm comprehensively considered the following key aspects:Initially,the state space and action space of the algorithm were defined,ensuring that the agent could accurately perceive the environment and make effective decisions.Subsequently,the composition of the reward function was detailed,guiding the agent to progress towards the desired goal during training.Solely using the maximization of task offloading benefits as a constraint was found to prevent the agent from meeting the condition of serving each user at least once.Therefore,a method of minor learning goal constraints was proposed in the study.Specifically,the task offloading rewards accumulated by the agent in the phase of not completing minor learning goals were not directly awarded to prevent deviation from the path to achieving these goals.Afterwards,an experience replay mechanism was introduced,which improved learning efficiency and reduced correlations between samples by storing and reusing past experiences.Additionally,two asynchronously updated neural networks were employed to stabilize the learning process.Based on this,the hyperparameters of the Fixed DQN algorithm were meticulously optimized to further enhance the algorithm’s performance.Most current research was based on the assumption that UAVs possess certain task processing capabilities.However,a different assumption was adopted in the paper,where the primary role of UAVs was only to carry tasks,not directly participate in task processing.Additionally,the autonomous charging needs of UAVs were also considered.This assumption is closer to actual application scenarios,where UAVs are primarily used for data collection and transmission,rather than data processing.The limitations of UAV endurance and the need for charging during task execution were also taken into account.In the study,11 nodes were set up within a circular area with a radius of 3000 meters as a test environment to verify the feasibility of the Fixed DQN algorithm.To comprehensively evaluate the performance of the proposed Fixed DQN algorithm,extensive experiments were subsequently conducted under various conditions,including different numbers of user nodes,charging nodes,and varying lengths of service time.For comparative analysis,the experiments also included comparisons with ant colony algorithms,genetic algorithms,and DQN algorithms.In this way,the effectiveness of the Fixed DQN algorithm in different scenarios,especially in complex and dynamically changing environments,were deeply explored.The experimental results showed that under all test conditions,the Fixed DQN algorithm significantly outperforms the ant colony algorithm,genetic algorithm,and DQN algorithm,particularly in scenarios with an increased number of nodes and extended service times.Furthermore,the performance improvement of Fixed DQN over DQN highlights the effectiveness of deep reinforcement learning in parameter tuning.These findings confirms the efficiency of the Fixed DQN algorithm and the importance of parameter tuning strategies in addressing UAV task offloading and charging scheduling issues.

作者何涵刘鹏赵亮王青山 HE Han;LIU Peng;ZHAO Liang;WANG Qingshan(School of Computer Sci.and Technol.,Hangzhou Dianzi Univ.,Hangzhou 310018,China;School of Computer Sci.,Shenyang Aerospace Univ.,Shenyang 110136,China;School of Mathematics,Hefei Univ.of Technol.,Hefei 230009,China)

机构地区杭州电子科技大学计算机学院沈阳航空航天大学计算机学院合肥工业大学数学学院

出处《工程科学与技术》 EI CAS CSCD 北大核心 2024年第1期99-109,共11页 Advanced Engineering Sciences

基金国家自然科学基金面上项目(62172134)。

关键词边缘计算无人机任务卸载强化学习充电调度 edge computing UAV task offloading reinforcement learning charging scheduling

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献7

1屈毓锛,秦蓁,马靖豪,戴海鹏,董超,王海,吴帆,陈贵海.面向空地协同移动边缘计算的服务布置策略[J].计算机学报,2022,45(4):781-797. 被引量：8
2苏命峰,王国军,李仁发.边云协同计算中基于预测的资源部署与任务调度优化[J].计算机研究与发展,2021,58(11):2558-2570. 被引量：19
3李保罡,石泰,陈静,李诗璐,王宇,张天魁.基于强化学习的非正交多址接入和移动边缘计算联合系统信息年龄更新[J].电子与信息学报,2022,44(12):4238-4245. 被引量：5
4Dongsheng Han,Tianhao Shi.Secrecy Capacity Maximization for a UAV-Assisted MEC System[J].China Communications,2020,17(10):64-81. 被引量：11
5刘晓宇,许驰,曾鹏,于海斌.面向异构工业任务高并发计算卸载的深度强化学习算法[J].计算机学报,2021,44(12):2367-2381. 被引量：14
6许小龙,方子介,齐连永,窦万春,何强,段玉聪.车联网边缘计算环境下基于深度强化学习的分布式服务卸载方法[J].计算机学报,2021,44(12):2382-2405. 被引量：25
7Hao-nan WANG,Ning LIU,Yi-yun ZHANG,Da-wei FENG,Feng HUANG,Dong-sheng LI,Yi-ming ZHANG.Deep reinforcement learning:a survey[J].Frontiers of Information Technology & Electronic Engineering,2020,21(12):1726-1744. 被引量：16

二级参考文献17

1Yaoxue Zhang,Yuezhi Zhou.Transparent Computing: Spatio-Temporal Extension on von Neumann Architecture for Cloud Services[J].Tsinghua Science and Technology,2013,18(1):10-21. 被引量：7
2林闯,苏文博,孟坤,刘渠,刘卫东.云计算安全:架构、机制与模型评价[J].计算机学报,2013,36(9):1765-1784. 被引量：321
3柳兴,李建彬,杨震,李振军.移动云计算中的一种任务联合执行策略[J].计算机学报,2017,40(2):364-377. 被引量：16
4施巍松,孙辉,曹杰,张权,刘伟.边缘计算:万物互联时代新型计算模型[J].计算机研究与发展,2017,54(5):907-924. 被引量：502
5刘全,翟建伟,章宗长,钟珊,周倩,章鹏,徐进.深度强化学习综述[J].计算机学报,2018,41(1):1-27. 被引量：473
6Qian Wang,Zhi Chen,Hang Li.Energy-Efficient Trajectory Planning for UAV-Aided Secure Communication[J].China Communications,2018,15(5):51-60. 被引量：13
7于博文,蒲凌君,谢玉婷,徐敬东,张建忠.移动边缘计算任务卸载和基站关联协同决策问题研究[J].计算机研究与发展,2018,55(3):537-550. 被引量：27
8Lingyan Fan,Wu Yan,Xihan Chen,Zhiyong Chen,Qingjiang Shi.An Energy Efficient Design for UAV Communication With Mobile Edge Computing[J].China Communications,2019,16(1):26-36. 被引量：10
9王兴伟,王子健,李福亮,黄敏.信息中心网络缓存节点位置选择算法[J].国防科技大学学报,2019,41(1):152-160. 被引量：7
10刘建伟,高峰,罗雄麟.基于值函数和策略梯度的深度强化学习综述[J].计算机学报,2019,42(6):1406-1438. 被引量：131

共引文献89

1王晨华,侯守璐,刘秀磊.边云协同计算中成本感知的物联网数据处理方法[J].计算机科学,2022,49(S02):820-826. 被引量：5
2韩东升,念欣然,李然.无人机辅助无线通信位置和波束联合优化方法[J].电子测量技术,2023,46(20):88-97.
3Chengcheng Zhou,Chao Gong,Hongwen Hui,Fuhong Lin,Guangping Zeng.A Task-Resource Joint Management Model with Intelligent Control for Mission-Aware Dispersed Computing[J].China Communications,2021,18(10):214-232. 被引量：2
4Yuanzhi He,Biao Sheng,Hao Yin,Di Yan,Yingchao Zhang.Multi-Objective Deep Reinforcement Learning Based Time-Frequency Resource Allocation for Multi-Beam Satellite Communications[J].China Communications,2022,19(1):77-91. 被引量：2
5WanliWen,Yunjian Jia,Wenchao Xia.Joint Scheduling and Resource Allocation for Federated Learning in SWIPT-Enabled Micro UAV Swarm Networks[J].China Communications,2022,19(1):119-135. 被引量：3
6李文权,齐琦,李霓,刘永娜.基于带惩罚的点概率距离策略优化算法在展示广告实时竞标中的研究[J].计算机应用研究,2022,39(2):461-467.
7Xiaoyu LIU,Chi XU,Haibin YU,Peng ZENG.Multi-agent deep reinforcement learning for end—edge orchestrated resource allocation in industrial wireless networks[J].Frontiers of Information Technology & Electronic Engineering,2022,23(1):47-60. 被引量：3
8蒋伟进,孙永霞,朱昊冉,陈萍萍,张婉清,陈君鹏.边云协同计算下基于ST-GCN的监控视频行为识别机制[J].南京大学学报（自然科学版）,2022,58(1):163-174.
9卢为党,詹悦者,花俏枝,高原,曹江,韩会梅,黄国兴.基于无人机无线能量传输的边缘计算系统能耗优化方法研究[J].电子与信息学报,2022,44(3):899-905. 被引量：10
10卞志勇,牛利平.基于多径环境下的非同步OFDM网络定位技术[J].计算机与现代化,2022(4):86-91.

1柳叶青.问题解决视角下中职课程思政教学实施问题表征与对策分析[J].中国职业技术教育,2023(32):12-19. 被引量：1
2刘志学.基于狭小空间固定翼无人机起飞回收系统设计方法与知识应用[J].科技视界,2023(12):62-65.
3陈小娟.基于学术情境的高中地理问题式教学实践——以“自然环境整体性”为例[J].地理教育,2024(2):58-61.
4陈浩.高瓦斯矿井掘进工作面遇煤厚变化带瓦斯防治技术[J].能源与节能,2024(1):87-90. 被引量：1
5刘洋.高职建工专业课程标准制订探究——以“建筑制图与CAD”课程为例[J].新课程研究,2023(33):13-15.
6邓安城.金河煤矿1123采煤工作面防突研究与实践[J].煤炭新视界,2023(2):66-68.

工程科学与技术

2024年第1期

浏览历史

内容加载中请稍等...

无人机任务卸载与充电协同优化

参考文献7

二级参考文献17

共引文献89

相关作者

相关机构

相关主题

浏览历史