期刊文献+

未知环境下基于有先验知识的滚动Q学习机器人路径规划 被引量:11

Path planning of robot for unknown environment based on prior knowledge rolling Q-learning
原文传递
导出
摘要 提出一种未知环境下基于有先验知识的滚动Q学习机器人路径规划算法.该算法在对Q值初始化时加入对环境的先验知识作为搜索启发信息,以避免学习初期的盲目性,可以提高收敛速度.同时,以滚动学习的方法解决大规模环境下机器人视野域范围有限以及因Q学习的状态空间增大而产生的维数灾难等问题.仿真实验结果表明,应用该算法,机器人可在复杂的未知环境中快速地规划出一条从起点到终点的优化避障路径,效果令人满意. A path planning of rolling Q-learning algorithm based on the prior knowledge in the unknown environment is proposed. The prior knowledge about the environment is added as heuristic information of Q learning to initialize the Q value, so as to avoid the blindness of early-stage learning and improve rate of convergence. Besides, the method of rolling learning is used for solving the problems of limited visual domain of the robot as well as dimensionality disaster caused by the increase in state space of Q-learning in a large scale environment. The simulation results show that, the robot can not only avoid collision safely, but also find out an optimal path by using the algorithm in the unknown environment, and the results obtained are satisfactory.
作者 胡俊 朱庆保
出处 《控制与决策》 EI CSCD 北大核心 2010年第9期1364-1368,共5页 Control and Decision
基金 国家自然科学基金项目(60673102) 江苏省自然科学基金项目(BK2006218)
关键词 滚动路径规划 移动机器人 先验知识 Q学习 未知环境 Rolling path planning Mobile robot Prior knowledge Q-learning Unknown environment
  • 相关文献

参考文献13

  • 1Ahuh D J, Park J H. Path planning and navigation for autonomous mobile robot[C]. IEEE 28th the Annual Conf of the Industrial Electronics Society. Seville: IEEE Press, 2002: 1538-1542.
  • 2Cabin I, Land S. Adaptation of the A* algorithm for the computation of fastest paths in deterministic discrete- time dynamic networks[J]. IEEE Trans on Intelligent Transportation Systems, 2002, 3(1): 60-74.
  • 3Rimon E. Exact robot navigation using artificial potential functions[J]. IEEE Trans on Robotics and Automation, 1992, 8(5): 501-518.
  • 4Lavelle S M, Kuffner J. Randomized kino dynamic planning[J]. Int J of Robotics Research, 2001, 20(5): 378- 398.
  • 5张纯刚,席裕庚.基于局部探测信息的机器人滚动路径规划(英文)[J].自动化学报,2003,29(1):38-44. 被引量:14
  • 6席裕庚,张纯刚.一类动态不确定环境下机器人的滚动路径规划[J].自动化学报,2002,28(2):161-175. 被引量:93
  • 7Sutton R, Barto A G. Reinforcement learning: An introduction[M]. Cambridge: MIT Press, 1998.
  • 8Smart W D, Kaelbling L E Effective reinforcement learning for mobile robots[C]. Proc of the IEEE Int Conf on Robotics and Automation. Washington, 2002: 3404-3410.
  • 9Steven D W, Lin L J. Reinforcement learning of non- Markov decision processes[J]. Artificial Intelligent, 1995, 73: 271-306.
  • 10宋清昆,胡子婴.基于经验知识的Q-学习算法[J].自动化技术与应用,2006,25(11):10-12. 被引量:7

二级参考文献14

  • 1MitchellTM著 曾华军 张银奎译.机器学习[M].北京:机械工业出版社,2003..
  • 2Sankaranarayanan A, Vidyasagar M. Anew path planning algorithm for moving a point object amidst unknown obstacles in a plane.In: Proceedings of IEEE Conference on Robotics and Automation, France:Nice, 1990. 1930~1936
  • 3Borenstein J, Koren Y. Real time obstacle avoidance for fast mobile robots. IEEETransactions on Systems, Man and Cybernetics, 1989, 19(5):1179~1187
  • 4Tilove R B. Local obstacle avoidance for mobile robots based on the method ofartificial potentials. In: Proceedings of IEEE Conference on Robotics and Automation,France: Nice, 1990. 566~571
  • 5Lumelsky V J. Algorithm and complexity issues of robot motion in an uncertainenvironment. Journal of Complexity, 1987, 3(2):146~182
  • 6Iyengar S S, Jorgensen C C, Rao S V N, Weisbin C R. Learned navigation paths for arobot in unexplored terrain. In: Proceedings of 2nd Conference on Artificial IntelligenceApplications and Engineering of Knowledge Based Systems, USA:Miami Beach, Florida, 1985.11~13
  • 7Xi Yu-Geng. Predictive control. Beijing: National Defense Industry Press, 1993(inChinese)
  • 8Zhang Chun-Gang, Xi Yu-Geng. Robot path planning in globally unknown environmentsbased on rolling windows. Science in China(E), 2001, 44(2): 131~139(in Chinese)
  • 9C.J.C.H.WATKINS,"Learning from delayed rewards"[D],PhD Thesis of the King's College,University of Cambridge,England,1989.
  • 10席裕庚.动态不确定环境下广义控制问题的预测控制[J].控制理论与应用,2000,17(5):665-670. 被引量:71

共引文献110

同被引文献134

引证文献11

二级引证文献44

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部