基于深度强化学习的AUV路径规划研究

Research on AUV Path Planning Based on Deep Reinforcement Learning

下载PDF

导出

摘要针对三维海洋环境水下自主航行器(AUV)路径规划问题,传统的路径规划算法在三维空间中搜索时间长,对环境的依赖性强,且环境发生改变时,需要重新规划路径,不满足实时性要求。为了使AUV能够自主学习场景并做出决策,提出一种改进的Dueling DQN算法,更改了传统的网络结构以适应AUV路径规划场景。此外,针对路径规划在三维空间中搜寻目标点困难的问题,在原有的优先经验回放池基础上提出了经验蒸馏回放池,使智能体学习失败经验从而提高模型前期的收敛速度和稳定性。仿真实验结果表明:所提出的算法比传统路径规划算法具有更高的实时性,规划路径更短,在收敛速度和稳定性方面都优于标准的DQN算法。 Traditional path planning algorithms for autonomous underwater vehicles(AUV)in 3D marine environments suffer from long search times,strong dependence on environment,and the need for re-planning when environment changes,which fails to meetreal-timerequirements.To enable AUVs to autonomously learn scenes and make decisions,an improved Dueling DeepQ-Network(DQN)algorithm was proposed,in which the traditional network structure was modified to adapt to AUV path planning scenarios.Additionally,addressing the difficulty of searching for target points in 3D space,an experience distillation replay pool was introduced based on the existing prioritized experience replay pool.This allowed the agent to learn from failure experiences and improved the convergence speed and stability of the model in the early stages.Simulation experimental results demonstrate that the proposed algorithm outperforms traditional path planning algorithms in terms of real-time performance and shorter planned paths.It also surpasses the standard DQN algorithm in terms of convergence speed and stability.

作者房鹏程周焕银董玫君 FANG Pengcheng;ZHOU Huanyin;DONG Meijun(School of Mechanical and Electronic Engineering,East China University of Technology,Nanchang Jiangxi 330000,China)

机构地区东华理工大学机械与电子工程学院

出处《机床与液压》北大核心 2024年第9期134-141,共8页 Machine Tool & Hydraulics

基金国家自然科学基金项目(62063001) 江西省自然科学基金项目(20224ACB204022)。

关键词自主水下航行器(AUV) 三维路径规划深度强化学习 Dueling DQN算法 autonomous underwater vehicles(AUV) 3D path planning deep reinforcement learning Dueling DQN algorithm

分类号 U675.73 [交通运输工程—船舶及航道工程]

引文网络
相关文献

参考文献2

1朱蟋蟋,孙兵,朱大奇.基于改进D^(*)算法的AUV三维动态路径规划[J].控制工程,2021,28(4):736-743. 被引量：19
2Madhusmita Panda,Bikramaditya Das,Bidyadhar Subudhi,Bibhuti Bhusan Pati.A Comprehensive Review of Path Planning Algorithms for Autonomous Underwater Vehicles[J].International Journal of Automation and computing,2020,17(3):321-352. 被引量：17

二级参考文献13

1方忆湘,刘文学.基于几何特性的三次均匀B样条曲线构造描述[J].工程图学学报,2006,27(2):96-102. 被引量：28
2张立川,刘明雍,徐德民,严卫生.多UUV协同导航与定位研究(英文)[J].系统仿真学报,2008,20(19):5342-5344. 被引量：16
3章国林,李平,韩波,郑巍.多雷达威胁环境下的无人机路径规划[J].计算机工程,2011,37(4):206-209. 被引量：15
4肖轶军,丁明跃,彭嘉雄.基于迭代最近点的B样条曲线拟合方法研究[J].中国图象图形学报（A辑）,2000,5(7):585-588. 被引量：33
5Mohammad Pourmahmood Aghababa Mohammad Hossein Amrollahi Mehdi Borjkhani.Application of GA, PSO, and ACO Algorithms to Path Planning of Autonomous Underwater Vehicles[J].Journal of Marine Science and Application,2012,11(3):378-386. 被引量：8
6饶盛,初磊,王珊.基于D＊算法方向指针的水下远程武器航路规划方法研究[J].舰船电子工程,2012,32(11):31-32. 被引量：2
7吴剑,张东豪.基于改进D*算法的无人机航路规划及光顺[J].航空科学技术,2013(6):69-71. 被引量：4
8占伟伟,王伟,陈能成,王超.一种利用改进A*算法的无人机航迹规划[J].武汉大学学报（信息科学版）,2015,40(3):315-320. 被引量：55
9张贺,胡越黎,王权,燕明.基于改进D*算法的移动机器人路径规划[J].工业控制计算机,2016,29(11):73-74. 被引量：14
10刘晓涛,蔡云飞,王田橙.基于SVM的受约束D*算法在无人车寻路中的应用[J].计算机与数字工程,2017,45(9):1748-1754. 被引量：8

共引文献34

1Mohammad Al-Fetyani,Mohammad Hayajneh,Adham Alsharkawi.Design of an Executable ANFIS-based Control System to Improve the Attitude and Altitude Performances of a Quadcopter Drone[J].International Journal of Automation and computing,2021,18(1):124-140. 被引量：3
2Wen-Jing Hong,Peng Yang,Ke Tang.Evolutionary Computation for Large-scale Multi-objective Optimization: A Decade of Progresses[J].International Journal of Automation and computing,2021,18(2):155-169. 被引量：6
3孙辉辉,胡春鹤,张军国.移动机器人运动规划中的深度强化学习方法[J].控制与决策,2021,36(6):1281-1292. 被引量：31
4Bin Xin,Junxi Zhang,Jie Chen,Qing Wang,Yun Qu.Overview of Research on Transformation of Multi-AUV Formations[J].Complex System Modeling and Simulation,2021,1(1):1-14. 被引量：3
5Nacer Hacene,Boubekeur Mendil.Behavior-based Autonomous Navigation and Formation Control of Mobile Robots in Unknown Cluttered Dynamic Environments with Dynamic Target Tracking[J].International Journal of Automation and computing,2021,18(5):766-786. 被引量：6
6M.R.Rahimi Khoygani,R.Ghasemi,P.Ghayoomi.Robust Observer-based Control of Nonlinear Multi-omnidirectional Wheeled Robot Systems via High Order Sliding-mode Consensus Protocol[J].International Journal of Automation and computing,2021,18(5):787-801. 被引量：1
7程建华,李鹏程,管行,葛靖宇.基于改进A^(*)算法的UUV冰下避障航迹规划算法[J].导航定位与授时,2021,8(6):13-18. 被引量：2
8Ru-Xiang Hua,Wei Zou,Guo-Dong Chen,Hong-Xuan Ma,Wei Zhang.A Model of Spray Tool and a Parameter Optimization Method for Spraying Path Planning[J].International Journal of Automation and computing,2021,18(6):1017-1031. 被引量：1
9王巍,邢朝洋,冯文帅.自主导航技术发展现状与趋势[J].航空学报,2021,42(11):11-29. 被引量：36
10潘绍飞.无人驾驶汽车路径规划算法研究综述[J].汽车实用技术,2022,47(4):162-165. 被引量：3

1付为刚,廖喆.车辆与无人机协同配送路径规划问题研究进展[J].内燃机与配件,2024(9):129-131. 被引量：1
2黄小红.探析大数据应用背景下内审人员应具备的能力[J].IT经理世界,2023(12):13-16.
3西工大仿蝠鲼柔体潜水器对南海开展珊瑚礁监测[J].陕西教育（高教版）,2024(6):9-9.
4王景楠,齐向东,刘丹.基于SSA-模糊PID的AUV姿态控制研究[J].计算机测量与控制,2024,32(5):144-150.
5王浩亮,柴亚星,王丹,刘陆,王安青,彭周华.基于事件触发机制的多自主水下航行器协同路径跟踪控制[J].自动化学报,2024,50(5):1024-1034. 被引量：1
6王怀民.群智范式:软件开发的范式变革[J].科学中国人,2024(4):30-31.
7徐啟蕾,任文杰,庞衍硕,张嘉琪.基于启发式映射法的未知三维环境路径规划[J].计算机应用与软件,2024,41(5):304-309.
8孔晓.基于物流管理的井下原煤运输路径规划与调度研究[J].内蒙古煤炭经济,2024(6):82-84.
9彭振春,王涛,刘含,朱耀辉,雷文静.移动边缘计算中无人机三维轨迹和计算卸载的联合优化策略研究[J].工业控制计算机,2024,37(5):93-95.
10岳旭生,李军,王耀弘.自动驾驶汽车路径规划研究综述[J].传感器世界,2024,30(3):1-8.

机床与液压

2024年第9期

浏览历史

内容加载中请稍等...

基于深度强化学习的AUV路径规划研究

参考文献2

二级参考文献13

共引文献34

相关作者

相关机构

相关主题

浏览历史