期刊文献+

基于增强学习算法的AGV导航研究

Research on AGV navigation based on reinforcement learning algorithm
下载PDF
导出
摘要 AGV(Automated Guided Vehicle)导航问题是一个连续状态空间问题,直接应用传统表格型增强学习方法存在维数灾难和学习效率低等问题。近年来,近似强化学习和策略梯度算法等增强学习方法的提出和改进为有效解决维数灾难和泛化问题以及AGV反应导航问题提供新的方法和手段。文中首先分析了AGV的运动环境模型,并建立了马尔可夫决策过程模型。在此基础上结合神经网络和策略梯度下降法,提出了一种基于Actor-Critic算法框架的AGV导航算法,并利用Matlab仿真验证了算法的适用性。 AGV(Automated Guided Vehicle)navigation problem is a continuous state space problem.Direct application of traditional tabular reinforcement learning method has encountered problems such as dimension disaster,low learning efficiency etc.In recent years,the present and improvement of reinforcement learning methods such as approximate reinforcement learning and strategy gradient algorithm provide new methods and means to effectively solve dimension disaster and generalization problems,as well as AGV response navigation problems.Firstly,this paper analyzes the movement environment model of AGV and establishes the Markov decision processes model.Based on the above,combined with neural network and strategy gradient descent method,an AGV navigation algorithm based on Actor-Critic algorithm framework is put forward,and the applicability of the algorithm is verified by applying Matlab simulation.
作者 王翔 吴洪明 周星 Wang Xiang;Wu Hongming;Zhou Xing
机构地区 武汉理工大学
出处 《起重运输机械》 2019年第12期40-45,共6页 Hoisting and Conveying Machinery
关键词 增强学习算法 AGV导航 马尔可夫决策 MATLAB仿真 reinforcement learning algorithm AGV navigation Markov decision Matlab simulation
  • 相关文献

参考文献1

二级参考文献7

共引文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部