期刊文献+

基于神经网络Q-learning算法的智能车路径规划 被引量:17

Intelligent Vehicle Path Planning Based on Neural Network Q-learning Algorithm
下载PDF
导出
摘要 针对智能小车行走过程中的全局路径规划和路障规避问题,提出了一种基于神经网络Q-learning强化学习算法,采用RBF(Radial Basis Function)网络对Q学习算法的动作值函数进行逼近,基于MATLAB环境开发了智能小车全局路径规划和路障规避仿真系统。与传统的以及基于势场的Q学习算法相比,所采用的算法能更加有效地完成智能小车在行驶环境中的路径规划和路障规避。仿真结果表明:算法具有更好的收敛速度,可增强智能小车的自导航能力。 A reinforcement learning algorithm based on neural network Q-learning is proposed to solve the problem of global path planning and obstacle avoidance.RBF(Radial Basis Function)network is used to approximate the action value function of Q learning algorithm.The global path planning and obstacle avoidance simulation system is developed by MATLAB.Compared with the traditional and potential field Q algorithm,the algorithm can be more effective to complete the path planning and obstacle avoidance of intelligent car in the driving environment.The simulation results show that the algorithm has better convergence speed and the ability of self navigation.
作者 卫玉梁 靳伍银 WEI Yu-liang;JIN Wu-yin(School of Mechno-Electronic Engineering,Lanzhou University of Technology,Lanzhou 730050,China)
出处 《火力与指挥控制》 CSCD 北大核心 2019年第2期46-49,共4页 Fire Control & Command Control
基金 国家自然科学基金资助项目(11372122)
关键词 路径规划 智能小车 Q-LEARNING 神经网络 仿真 path planning intelligent car Q-learning neural network simulation
  • 相关文献

参考文献6

二级参考文献35

  • 1石鸿雁,孙茂相,孙昌志.未知环境下移动机器人路径规划方法[J].沈阳工业大学学报,2005,27(1):63-69. 被引量:10
  • 2戴博,肖晓明,蔡自兴.移动机器人路径规划技术的研究现状与展望[J].控制工程,2005,12(3):198-202. 被引量:75
  • 3丁伟,孙华,曾建辉.基于多传感器信息融合的移动机器人导航综述[J].传感器与微系统,2006,25(7):1-3. 被引量:12
  • 4孙章军,田海晏,邓双成,孙晨.移动机器人路径规划仿真平台设计[J].北京石油化工学院学报,2006,14(3):16-19. 被引量:2
  • 5Khatib O. Real - time Obstacle Avoidance for Manipulators and Mobile Robot [J]. The International Journal of Robotic Research,1986,5(1) :90 - 98.
  • 6Gemeinder M,Gerke M. GA - based Path Planning for Robot System Employing an Aetive Search Algorithm [J]. Applied Soft Computing, 2003(3) : 149 - 158.
  • 7Sutton R S, Barto A G. Reinforcement Learning: An Introduction [M]. Cambridge, MA : MIT Press, 1998.
  • 8Miyazaki K, Yamamura M, Kobayashi S. On the Rationality of Profit Sharing in Reinforcement Learning [C]. Proc. of the 3rd International Conference on Fuzzy Logic Neural Net and Soft Computing, 1994 : 285 - 288.
  • 9Labb A M, Kavraki L E. Measure Theoretic Analysis of Probabilistic Path Planning [J]. Robotics and Automation, IEEE Transactions on, 2004,20 (2): 229 - 242.
  • 10Smith, Andrew James. Applications of the Self - organizing Map to Reinforcement Learning[J]. Neural Networks,2002 (15):1 107-1 124.

共引文献21

同被引文献197

引证文献17

二级引证文献51

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部