期刊文献+

移动机器人模糊Q-学习沿墙导航 被引量:7

Wall-following control of a mobile robot with fuzzy Q-learning
下载PDF
导出
摘要 针对在基于行为的移动机器人沿墙导航控制器的设计中缺乏足够的先验知识的问题,采用Q-学习方法让机器人通过学习来自动构建导航控制器。将模糊神经网络和Q-学习相结合,用模糊神经网络直接逼近连续状态和动作空间中的Q值函数。利用对Q值函数的优化获得控制输出。模糊神经网络中的节点根据状态动作对的各个分量和时间差分的新颖性进行自适应地添加和构造,这样不仅能克服节点选择的困难还能使网络保持适度的规模。网络中的参数采用扩展卡尔曼滤波方法进行自适应调整。基于Khepera 2机器人的沿墙导航实验验证了该方法的有效性和优越性。 The Q-learning was introduced into navigation control of the wall-following task of mobile robots where there was no enough priori knowledge available.The Q-value function was approached directly u-sing Fuzzy Neural Network(FNN).The optimization method was used to search the greedy action with maximum Q-value.The nodes of FNN were created incrementally and adaptively according to every ele-ment of the current pair of state-action and Temporal Difference(TD),which overcame the difficulties of the choice of nodes and ensured an economic size of the network.Moreover the parameters of the FNN were updated using Extended Kalman Filter(EKF).The results of the wall-following task of Khepera 2 mobile robot demonstrate the superiority and validity of the proposed method.
出处 《电机与控制学报》 EI CSCD 北大核心 2010年第6期83-88,97,共7页 Electric Machines and Control
基金 国家自然科学基金(60703106)
关键词 Q-学习 模糊神经网络 沿墙导航 移动机器人 Q-learning fuzzy neural network wall-following navigation mobile robots
  • 相关文献

参考文献17

二级参考文献51

  • 1王红睿,赵黎明.基于增强学习规则的倒立摆模糊神经网络控制器[J].吉林大学学报(信息科学版),2006,24(5):561-566. 被引量:1
  • 2康怀祺,史彩成,何佩琨,李晓琼.Novel Sequential Neural Network Learning Algorithm for Function Approximation[J].Journal of Beijing Institute of Technology,2007,16(2):197-200. 被引量:1
  • 3SUTTON R S, BARTO A G. Reinforcement Learning: An Introductin [ M]. Cambridge, MA: MIT Press, 1998.
  • 4THURN S, MITCHEIL T M. Lifelong Robot Leaning [J]. Robotics and Autonomous System, 1995, 15 (1) : 25-46.
  • 5WATKINS C, DAYAN P. Q-Learning [J]. Machine Learning, 1992, 8 (3/4): 279-292.
  • 6WIDROW B, RUMELHART D E, LEHR M A. The Basic Ideas in Neural Networks [ J]. Communications of the ACM, 1994, 37 (3) : 87-92.
  • 7WANG Xue-song, CHENG Yu-hu, SUN Wei. Q Learning Based on Self-Organizing Fuzzy Radial Basis Function Network [ C] //Thrid International symposium on Neural Networks. Berlin Heidelberg: Springer Verlag, 2006: 607-615.
  • 8PARK J, SANDBERG I W. Universal Approximation Using Radial Basis Functions Networks [ J ]. Neural Computation, 1991, 3 (2): 246-257.
  • 9JUN L. Learning Reactive Behaviors with Constructive Neural Network in Mobile Robotics [ D]. [ S.l. ] : Orebro Studies in Technology, 2006.
  • 10STASTNY J, SKORPIL. Analysis of Algorithms for Radial Basis Function Neural Network [ C ] // IFIP International Federation for Information Processing. [ S. l. ] : Springer, 2007, 245 : 54-62.

共引文献31

同被引文献140

引证文献7

二级引证文献43

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部