基于深度强化学习的无人机路径规划与无线电测绘

UAV Path Planning and Radio Mapping Based on Deep Reinforcement Learning

下载PDF

导出

摘要针对传统无人机轨迹优化设计方法在构建通信模型上具有局限性的问题,本文面向蜂窝连接无人机通信方式,引入一种基于深度强化学习的无人机路径规划与无线电测绘方法。该方法利用扩展后的双深Q网络模型,结合无线电预测网络,生成无人机轨迹并预测由于动作选择而累计的奖励值。此外,基于Dyna框架将实际飞行和模拟飞行相结合,进一步训练双深Q网络模型,从而大大提高学习效率。仿真结果表明,与Direct-RL算法相比,该方法能更有效地利用学习到的覆盖区域概率图,使无人机避开弱覆盖区域,减小飞行时间和预期中断时间的加权和。 To address the limitations of traditional UAV trajectory optimization design methods in building communication models,this paper presents a deep reinforcement learning-based UAV path planning and radio mapping in cellular-connected UAV communication systems.The proposed method utilizes an extended double-deep Q-network(DDQN)model combined with a radio prediction network to generate UAV trajectories and predict the reward values accumulated due to action selection.Furthermore,the method trains the DDQN model by combining actual and simulated flights based on Dyna framework,which greatly improves the learning efficiency.Simulation results show that the proposed method utilizes the learned coverage area probability map more effectively compared to the Direct-RL algorithm,enabling the UAV to avoid weak coverage areas and reducing the weighted sum of flight time and expected interruption time.

作者王鑫仲伟志王俊智肖丽君朱秋明 WANG Xin;ZHONG Weizhi;WANG Junzhi;XIAO Lijun;ZHU Qiuming(College of Astronautics,Nanjing University of Aeronautics and Astronautics,Nanjing 211106,Jiangsu,China;College of Electronic and Information Engineering,Nanjing University of Aeronautics and Astronautics,Nanjing 211106,Jiangsu,China)

机构地区南京航空航天大学航天学院南京航空航天大学电子信息工程学院

出处《应用科学学报》 CAS CSCD 北大核心 2024年第2期200-210,共11页 Journal of Applied Sciences

关键词无人机蜂窝通信路径规划深度强化学习无线电测绘 UAV cellular communication path planning deep reinforcement learning radio mapping

分类号 TN929.5 [电子电信—通信与信息系统]

引文网络
相关文献

1曾祥燕,张博,潘仕彬.基于OBE教育理念的“无人机模拟飞行”课程教学设计与实施研究[J].科教导刊,2024(6):114-116.
2李云,张剑鑫,姚枝秀,夏士超.邻域感知的分布式智能边缘计算卸载和资源分配算法[J].中国科学：信息科学,2024,54(2):413-429.
3Ziying Zhang,Xian Li,Yuhua Wang.Research on Path Planning of Mobile Robots Based on Dyna-RQ[J].国际计算机前沿大会会议论文集,2023(1):49-59.
4徐晓雅,王章琼,李雷烈,赵歧林,周意,蔡永辉.山岭隧道地上地下一体化三维建模方法[J].科学技术与工程,2024,24(8):3373-3380.
5Haokun Yuan,Ruiqin Fang,Chi Fu,Shuo Wang,Xiaoqin Tong,Deyi Feng,Xiaoqing Wei,Xirong Hu,Yuan Wang.ATIP/ATIP1 regulates prostate cancer metastasis through mitochondrial dynamic-dependent signaling[J].Acta Biochimica et Biophysica Sinica,2024,56(2):304-314.
6陈洋,周锐.通信受限条件下多无人机协同环境覆盖路径规划[J].中国惯性技术学报,2024,32(3):273-281.
7王建红,宋倩宇.新媒体背景下思想政治教学信息干扰:模型、类型及管控[J].徐州工程学院学报（社会科学版）,2024,39(1):93-101.
8庄陵,刘宇航.密集城市群中基于智能反射面的传输方案[J].华南理工大学学报（自然科学版）,2024,52(3):112-118.

应用科学学报

2024年第2期

浏览历史

内容加载中请稍等...

基于深度强化学习的无人机路径规划与无线电测绘

相关作者

相关机构

相关主题

浏览历史