期刊文献+

基于支持向量机和Q学习的移动机器人导航 被引量:2

Mobile robot navigation based on support vector machine and Q-learning
下载PDF
导出
摘要 基于神经网络的连续状态空间Q学习已应用在机器人导航领域。针对神经网络易陷入局部极小,提出了将支持向量机与Q学习相结合的移动机器人导航方法。首先以研制的CASIA-I移动机器人和它的工作环境为实验平台,确定出Q学习的回报函数;然后利用支持向量机对Q学习的状态——动作对的Q值进行在线估计,同时,为了提高估计速度,引入滚动时间窗机制;最后对所提方法进行了实验,实验结果表明所提方法能够使机器人无碰撞的到达目的地。 Continuous Q-learning algorithm based on neural has been used in robotic navigation domain for its simplicity and well-developed theory.Aiming at the neural easily falling into local minimum,a new mobile robot navigation method using Q-learning based on a Support Vector Machine(SVM) is proposed.According to the developed mobile robot CASIA-I and its working environment,an approach is proposed,used to determine the reward/penalty function of Q-learning.A SVM is used to estimate the Q-value of state-action pair on-line,at the same time,in order to decrease the on-time learning time of SVM,a sliding time-window is introduced.Experimental results are included to show that the action policy obtained through Q-learning based on SVM can make the mobile robot reach the destination without obstacle collision.
作者 侯艳丽
出处 《计算机工程与应用》 CSCD 北大核心 2011年第23期242-244,248,共4页 Computer Engineering and Applications
基金 河南省科学技术厅基础与前沿技术研究计划项目(No.102300410242) 河南省教育厅自然科学基金项目(No.2010A510009) 河南省科技厅科技发展计划项目(No.112300410210)
关键词 移动机器人 Q学习 支持向量机 导航 在线学习 mobile robot Q-learning support vector machine navigation on-line learning
  • 相关文献

参考文献9

  • 1王雪松,田西兰,程玉虎.基于支持向量机的连续状态空间Q学习[J].中国矿业大学学报,2008,37(1):93-98. 被引量:5
  • 2秦政,丁福光,边信黔.强化学习在移动机器人自主导航中的应用[J].计算机工程与应用,2007,43(18):215-217. 被引量:5
  • 3Preu P,Delepoulies S,Raqcheville J C.A generic architecture for adaptive agents based on reinforcement learning. Informa-tion Sciences . 2004
  • 4Cicirelli G,,D’’Orazio T,Distante A.Neural Q-learning control ar-chitectures for wall-following behavior. Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems . 2003
  • 5Yang G S,Chen K A,Cheng W.Mobile robot navigation using neural Q-learning. IEEE Proceedings of International Confer-ence on Machine Learning and Cybernetics . 2004
  • 6Yang G S,Hou Z G,Liang Z Z.Distributed visual navigation based on neural Q-learning for a mobile robot. International Journal of Vehicle Autonomous Systems . 2006
  • 7Bo C M,Wang Z Q,Lu A J.Study and application on dynamic modeling method based on SVM and sliding time window tech-niques. Proceedings of the6th World Congress on Intelli-gent Control and Automation . 2006
  • 8Carreras M,Ridao P,EI-Fakdi A.Semi-online Neural Q-learning for Real-time Robot Learning. Proceedings of the IEEFALSJ International Conference on Intelligent Robots and Systems . 2003
  • 9T.Kondo,K.Ito."A reinforcement learning with evolutionary state recruitment strategy for autonomous mobile robots control". Robotics and Autonomous Systems . 2004

二级参考文献22

  • 1许建华,张学工,李衍达.支持向量机的新发展[J].控制与决策,2004,19(5):481-484. 被引量:132
  • 2柳晓菁,易建强,赵冬斌,王伟.基于最小二乘支持向量机的自适应逆扰动消除控制系统[J].控制与决策,2005,20(8):947-950. 被引量:13
  • 3阎平凡.再励学习——原理、算法及其在智能控制中的应用[J].信息与控制,1996,25(1):28-34. 被引量:30
  • 4王雪松,程玉虎,易建强.一种自适应模糊Actor-Critic学习[J].控制与决策,2006,21(9):1068-1072. 被引量:3
  • 5Krogh B H.A generalized potential field approach to obstacle avoidance control[C]//Proceeding of the International Robotics Research Conference,1984:1150-1156.
  • 6Koren Y,Borenstein J.Potential field methods and their inherent limitations for mobile robot navigation[C]//Proceedings of the IEEE Conference on Robotics and Automation,Sacramento,California,1991:1394-1404.
  • 7Kamon I,Rivlin E,Rimon E.A new range-sensor based globally convergent navigation algorithm for mobile robots[C]//Proceedings of the IEEE International Conference on Robotics and Automation,Minneapolis,MN,1996:429-435.
  • 8Carreras M,Ridao P,Battle J,et al.Learning reactive robot behaviors with Neural-Q_learning[C]//IEEE-TTTC International Conference on Automation,Quality and Testing,Robotics,2002.
  • 9Gaskett CQ-learning for robot control[D].Australian National University,2002:5-13.
  • 10Ge S S,Cui Y J.New potential functions for mobile robot path planning[J].IEEE Transactions on Robotics and Automation,2000,16(5).

共引文献8

同被引文献10

引证文献2

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部