期刊文献+

基于深度强化学习的无人机航路规划算法研究

Research on UAV Path Planning Method Based on Deep Reinforcement Learning
下载PDF
导出
摘要 航路规划是无人机在复杂战场环境中完成作战任务的关键技术之一。本文提出了一种基于PER-D3QN的无人机航路规划算法,通过网络模型设计、状态空间设计、动作空间设计和收益函数设计实现无人机在战场环境下的航路规划。PER-D3QN算法将目标网络模型、竞争网络模型和优先级经验重现策略进行结合,有效地解决了深度强化学习方法存在的过拟合问题和网络优化不稳定问题。最后,通过仿真试验验证了所提算法相较于Double DQN和DQN算法具有更好的收敛性、稳定性和适用性,相较于A*算法具有较好的实时性,可高效实现无人机在复杂战场环境下的航路规划,有效帮助无人机遂行作战任务。 Path planning is one of the key technologies for UAVs to accomplish operation missions in complex battlefield environments.In this paper,we propose a UAV path planning method based on PER-D3QN,which realizes the path planning for UAVs in the battlefield environment through network model design,state space design,action space design and reward function design.The PER-D3QN algorithm combines the target network,dueling network and prioritized experience replay,which effectively solves the overfitting problem and unstable problem in deep reinforcement learning.In the end,it is verified through simulation experiments that the proposed method achieved better convergence,stability and applicability compared with double DQN and DQN algorithms,and better real-time performance compared with A*algorithm,which can efficiently realize the path planning of UAVs in the complex battlefield environment,and effectively help the UAVs to attempt the operational mission.
作者 毕文豪 段晓波 Bi Wenhao;Duan Xiaobo(Northwestern Polytechnical University,Xi’an 710072,China)
机构地区 西北工业大学
出处 《航空科学技术》 2023年第12期118-124,共7页 Aeronautical Science & Technology
基金 航空科学基金(201905053001) 国家自然科学基金(62073267,61903305)。
关键词 无人机 航路规划 深度强化学习 战场环境建模 PER-D3QN UAV path planning deep reinforcement learning battlefield environment modeling PER-D3QN
  • 相关文献

参考文献9

二级参考文献67

  • 1李宪强,马戎,张伸,侯砚泽,裴毅飞.蚁群算法的改进设计及在航迹规划中的应用[J].航空学报,2020(S02):213-219. 被引量:31
  • 2吴坤,谭劭昌.基于改进鲸鱼优化算法的无人机航路规划[J].航空学报,2020(S02):107-114. 被引量:23
  • 3Zhang X Y,Wu M, Peng J, et al. A rescue robot path planning based on ant colony optimization algorithm [ C ] //International Conference on Information Technology and Computer Science 2009.Piscataway, N J: IEEE Press ,2009 : 180-183.
  • 4Khanmohammadi S,Zarrin R S. intelligent path planning for res- cue robot[ J]. World Academy of Science, Engineering and Tech- nology,2011,5 ( 7 ) :607-612.
  • 5Norouzi M, Bruijn F D, Miro J V. Planning stable paths for urban search and rescue robots [ J ]. Computer Science, 2012,7416 : 90-101.
  • 6Pang T,Ruan X G, Wang E S, et al. Search and rescue robot path planning in unknown environment [ J ]. Applied Mechanics and Materials ,2013,241 : 1682-1687.
  • 7Pang T,Ruan X G,Wang E S,et al. Based on A * and Q-learn- ing search and rescue robot navigation [ J ]. Telkomnika- Indone- sian Journal of Electrical Engineering, 2012,10 ( 7 ) : 1889-1896.
  • 8Sun H L, Yue L Y, Yao S Y. Study on selection of emergency rescue based on GIS [ J ]. Advanced Materials Research, 2014, 864 : 2804-2807.
  • 9Sullivan T A, Van J D. Multi-objective, multi-domain genetic op- timization of a hydraulic rescue spreader [ J ]. Mechanism and Machine Theory,2014,80:35-51.
  • 10Liu T L,Wu C D, Li B, et al. The adaptive path planning re-search for a shape-shifting robot using particle swarm optimiza- tion [ C ] // International Conference on Natural Computation 2009. Piscataway, NJ : IEEE Press ,2009:324-328.

共引文献41

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部