双层循环神经网络框架下的USV路径规划方法

USV path planning method under the framework of double-layer recurrent neural network

下载PDF

导出

摘要针对全连接神经网络结构下Actor-Critic算法在复杂路径规划环境下训练时间长、不宜收敛且难以处理长动作记忆序列的不足,本文提出了基于双层循环神经网络的水面无人艇(unmanned surface vessel,USV)路径规划算法。该算法的输入并不是单独的一个状态,而是由状态、动作和奖励所组成的具有一定长度的序列(宏动作)。从网络架构上来看,循环神经网络(recurrent neural network,RNN)会记住历史信息,并且使用历史信息影响当前的输入输出,基于RNN结构的双层循环神经网络(double-layer recurrent neural network,DRNN)也具有同样的性质,由于DRNN考虑了一定时间内的环境交互历史,有助于神经网络对于连续动作序列(宏动作)模式的识别。通过仿真实验,在多个地图上与常规的Actor-Critic算法进行对比验证。结果表明:该算法在平均步数、成功率与平均奖励上比Actor-Critic算法有明显提高。 In view of the shortcomings of Actor Critic algorithm based on a fully connected neural network structure in a complex path planning environment,such as long training time,improper convergence and difficulty in handling long action memory sequences,we propose a unmanned surface vessel(USV)path planning algorithm based on two-layer recurrent neural network.The input of the algorithm is not a single state,but a sequence(macro action)of certain length composed of states,actions and rewards.From the perspective of network architecture,RNN will remember historical information,and use historical information to affect current input and output.DRNN based on RNN structure also has the same properties.Because DRNN considers the environmental interaction history in a certain period of time,it is helpful for neural network to recognize continuous action sequence(macro action)patterns.Through simulation,the algorithm is compared with the conventional Actorr-Critic algorithm on several maps.The results show that the Actor Critic algorithm has a significant improvement in average steps,success rate and average reward.

作者张志鑫高健赵大威 ZHANG Zhixin;GAO Jian;ZHAO Dawei(Shenyang Bureau of the Naval Equipment Department,Harbin 150001,China;College of Intelligent Systems Science and Engineering,Harbin Engineering University,Harbin 150001,China)

机构地区海装沈阳局哈尔滨工程大学智能科学与工程学院

出处《应用科技》 CAS 2023年第3期100-107,共8页 Applied Science and Technology

关键词全连接神经网络路径规划循环神经网络记忆序列宏动作双层网络架构状态历史信息 fully connected neural network path planning recurrent neural network memory sequences macro action double-layer network architecture state historical information

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1谭冠政,贺欢,Aaron Sloman.Global optimal path planning for mobile robot based onimproved Dijkstra algorithm and ant system algorithm[J].Journal of Central South University of Technology,2006,13(1):80-86. 被引量：20

二级参考文献10

1YANWei min,WUWei min.DataStructure(Cver sion)[]..1997
2GeSS,CuiYJ.Newpotentialfunctionsformobile robotpathplanning[].IEEETransactionsonRobot icsandAutomation.2000
3LILei,YETao,TANMin.Presentstateandfuturedevelopmentofmobilerobottechnologyresearch[].Robotica.2002
4ManiezzoV,ColorniA.Theantsystemappliedtothequadraticassignmentproblem[].IEEETransac tionsonKnowledgeDataEngineering.1999
5QINYuan qing,SUNDe bao,LINing,etal.Path planningformobilerobotbasedonparticleswarmop timization[].Robotica.2004
6BoschianV,PruskiA.Gridmodelingofrobotcells:a memory efficientapproach[].JournalofIntelligentandRoboticSystems.1993
7YungNHC,CangYe.Anintelligentmobilevehicle navigatorbasedonfuzzylogicandreinforcementlearn ing[].IEEETransactionsonSystemsManandCy berneticsPartB:Cybernetics.1999
8LebedevD.Neuralnetworkmodelforrobotpathplan ningindynamicallychangingenvironment[].Model ingandAnalysisofInformationSystems.2001
9DorigoM,BonabeauE,TheraulazG.Antalgo rithmsandstigmergy[].FutureGenerationComput erSystems.2000
10DorigoM,CaroGDi,GambardellaLM.Antalgo rithmsfordiscreteoptimization[].Artificial Intelligence.1999

共引文献19

1ZHUANG Hongchao,GAO Haibo,DING Liang,LIU Zhen,DENG Zongquan.Method for Analyzing Articulated Torques of Heavy-duty Six-legged Robot[J].Chinese Journal of Mechanical Engineering,2013,26(4):801-812. 被引量：10
2谭冠政,窦红权.ACS algorithm-based adaptive fuzzy PID controller and its application to CIP-I intelligent leg[J].Journal of Central South University of Technology,2007,14(4):528-536. 被引量：2
3叶小勇,雷勇,侯海军.蚁群算法在全局最优路径寻优中的应用[J].系统仿真学报,2007,19(24):5643-5647. 被引量：15
4任永新,李伟,陈晓,李吉,杨会华,谭豫之,杨庆华.非结构环境下基于机器视觉的机器人路径跟踪方法[J].北京工业大学学报,2008,34(10):1021-1025. 被引量：7
5张凤,孙哲,孟彬.一种基于特征点的移动机器人路径规划算法[J].沈阳建筑大学学报（自然科学版）,2009,25(6):1212-1216. 被引量：3
6李鹏,朴在林,王剑委.基于改进蚁群算法的农网送电线路设计路径寻优[J].农业工程学报,2009,25(11):232-235. 被引量：5
7周菁,戴冠中,蔡晓妍.基于蚁群系统的机器人全局最优路径规划的研究与仿真[J].计算机科学,2010,37(5):171-174. 被引量：3
8王薇,魏世民,杨月巧,姜运芳,李端玲.基于神经网络的移动机器人路径规划[J].北京工业大学学报,2010,36(9):1287-1291. 被引量：9
9高扬,孙树栋,赫东锋.部分未知环境中移动机器人动态路径规划方法[J].控制与决策,2010,25(12):1885-1890. 被引量：9
10赵凯,李声晋,孙娟,赵锋.改进蚁群算法在移动机器人路径规划中的研究[J].微型机与应用,2013,32(4):67-70. 被引量：14

1王子豪,张严心,黄志清,殷辰堃.部分可观测下基于RGMAAC算法的多智能体协同[J].控制与决策,2023,38(5):1267-1277. 被引量：2
2江净帆,李江.走向循证的教师教育实践:价值意蕴与行动框架[J].教育理论与实践,2023,43(4):36-41. 被引量：3
3陈维金.浅析区域规划环境影响跟踪评价理论与实践[J].中国科技期刊数据库工业A,2023(7):156-159.
4缸明义,夏兴国,潘小波,张奇.基于DRNN的结晶器液位控制系统研究[J].河北北方学院学报（自然科学版）,2023,39(1):23-28.
5张凯,蒋新军,孙德全,刘石林.基于路径特征点算法的机械臂跟踪目标轨迹优化[J].机械设计与研究,2023,39(2):30-33. 被引量：4
6林雀跃,张明童,罗轶,昝珂,张荣林.壮药横经席化学成分、药理作用及质量控制研究进展[J].中国药物评价,2023,40(3):224-230.
7黄宴委,林涛,黄文超,陈少斌.一种快速有限时间收敛的轨迹跟踪引导律[J].控制理论与应用,2023,40(6):965-976.
8李迟件,姚靖,高玉峰,赖溥祥,何悦之,齐苏敏,郑炜.利用深度学习扩展双光子成像视场[J].中国激光,2023,50(9):72-81.
9许艳艳,刘杰伟,黄帅,罗旭.土壤和沉积物环境检测方法在污染场地调查评估中的应用[J].中文科技期刊数据库（全文版）自然科学,2023(6):47-50.
10凤媛.从茅盾和叶圣陶的早期文学实践看“为人生”文学思潮的多重面向[J].山东师范大学学报（社会科学版）,2023,68(1):30-40. 被引量：1

应用科技

2023年第3期

浏览历史

内容加载中请稍等...

双层循环神经网络框架下的USV路径规划方法

参考文献1

二级参考文献10

共引文献19

相关作者

相关机构

相关主题

浏览历史