基于神经网络的Q学习在Khepera Ⅱ机器人避障中的应用

Neural Network-based Q-learning Applied to Obstacle Avoidance in Khepera Ⅱ Mobile Robots

导出

摘要为了提高移动机器人的自主学习能力,在传统的机器人行为控制结构基础上设计了智能控制结构,同时引入了基于神经网络的Q学习模块算法,克服了传统算法只能应用到离散状态中的不足。移动机器人的避障实验结果表明,该方法能够使移动机器人通过自学习实现自主避障。 An intelligent control architecture was designed based on a behavior architecture to improve the learning ability of mobile robots. At the same time, a Q-learning algorithm based on a neural network was led into the intelligent control architecture,while normal algorithm can only be applied to discrete states. Experiments of obstacle avoidance show that the mobile robot can learn to avoid obstacles with the algo- rithm.

作者盛维涛张文君张建兴

机构地区重庆大学自动化学院

出处《世界科技研究与发展》 CSCD 2013年第3期374-376,407,共4页 World Sci-Tech R&D

基金国家自然科学基金(61075096)资助

关键词移动机器人神经网络 Q学习避障 mobile robot neural network Q-learning obstacle avoidance

分类号 TP242 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献16

1BROOKS R . A robust layered control system for mobile robot [ J ]. IEEE Journal of Robotics and Automation, 1986, RA-2 ( 1 ) : 14-23.
2ANDRY P,GAUSSIER P,NADEL B J. Learning invariant sensorimo- tot behaviors: A developmental approach to imitation mechanisms[ J ]. Adaptive Behavior,2004,12(2) :117-119.
3LEE MH, MENG Q, CHAO F. Staged competence learning in develop- mental robotics[ J]. Adaptive Behaviour,2007,15 ( 3 ) :241-255.
4SUTTON R S,BARTO S. Reinforcement learning[ M ]. Cambridge:M IT Press,1998,42 (3) :36-42.
5张文志,吕恬生.Reactive fuzzy controller design by Q-learning for mobile robot navigation[J].Journal of Harbin Institute of Technology(New Series),2005,12(3):319-324. 被引量：5
6HOTZ P, GOMEZ G, PFEIFER R. Evolving the morphology of a neu- ral network for controlling a foveating retina-and its test on a real robot [ J ]. Artificial Life VIII-Sth International Conference on the Simula- tion and Synthesis of Living Systems ,2003,28 (3) :52-54.
7NAGARAJAN N,STEVENS C. How does the speed of thought com- pare for brains and digital computers[ J]. Current Biology,2008,18 (17) :756-758.
8KRAUZLIS R. The control of voluntary eye movements : New perspec- tives[ J ]. The Neuroscientist ,2005,11 ( 2 ) : 124-128.
9SCHLESIGER M, AMSO D ,jOHNSON S. Simulating infants'gaze pat- terns during the development of perceptual completion [ J ]. Proceed- ings of the 7th International Conference on Epigenetic Robotics ,2007, 12(3) :157-164.
10CHAO F, LEE M H, LEE J J. A developmental algorithm for ocular_ motor coordination[ J]. Robotics and Autonomous Systems,2010,58(3) :239-248.

二级参考文献57

1段勇,徐心和.自主足球机器人视觉系统结构及关键技术[J].东北大学学报（自然科学版）,2006,27(1):9-12. 被引量：8
2王红睿,赵黎明.基于增强学习规则的倒立摆模糊神经网络控制器[J].吉林大学学报（信息科学版）,2006,24(5):561-566. 被引量：1
3康怀祺,史彩成,何佩琨,李晓琼.Novel Sequential Neural Network Learning Algorithm for Function Approximation[J].Journal of Beijing Institute of Technology,2007,16(2):197-200. 被引量：1
4SUTTON R S, BARTO A G. Reinforcement Learning: An Introductin [ M]. Cambridge, MA: MIT Press, 1998.
5THURN S, MITCHEIL T M. Lifelong Robot Leaning [J]. Robotics and Autonomous System, 1995, 15 (1) : 25-46.
6WATKINS C, DAYAN P. Q-Learning [J]. Machine Learning, 1992, 8 (3/4): 279-292.
7WIDROW B, RUMELHART D E, LEHR M A. The Basic Ideas in Neural Networks [ J]. Communications of the ACM, 1994, 37 (3) : 87-92.
8WANG Xue-song, CHENG Yu-hu, SUN Wei. Q Learning Based on Self-Organizing Fuzzy Radial Basis Function Network [ C] //Thrid International symposium on Neural Networks. Berlin Heidelberg: Springer Verlag, 2006: 607-615.
9PARK J, SANDBERG I W. Universal Approximation Using Radial Basis Functions Networks [ J ]. Neural Computation, 1991, 3 (2): 246-257.
10JUN L. Learning Reactive Behaviors with Constructive Neural Network in Mobile Robotics [ D]. [ S.l. ] : Orebro Studies in Technology, 2006.

共引文献28

1周济,陈锋.基于强化神经网络的区域协调控制研究[J].电子技术（上海）,2010(9):20-22.
2杨立苹,洪炳镕,周浦城.基于Motor Schema的移动机器人反应式导航[J].哈尔滨商业大学学报（自然科学版）,2005,21(5):612-614.
3付帅,刘淑华,张之雅,程宇.基于改进人工协调场的多机器人运动编队[J].吉林大学学报（信息科学版）,2010,28(2):153-157. 被引量：3
4郭新辰,吴希,陈书坤,吴春国.基于RBFNN和PSO求解第二类Volterra积分方程的混合方法[J].吉林大学学报（理学版）,2010,48(4):658-661. 被引量：3
5徐明亮,柴志雷,须文波.移动机器人模糊Q-学习沿墙导航[J].电机与控制学报,2010,14(6):83-88. 被引量：7
6徐生林,刘艳娜.两足机器人的SimMechanics建模[J].浙江大学学报（工学版）,2010,44(7):1361-1367. 被引量：9
7但斌斌,王超.重轨矫直参数控制模型的自学习功能研究[J].微型机与应用,2010,29(18):83-85.
8潘海鹏,胡丽花,刘瑜.一种基于颜色聚类和种子填充的目标识别算法[J].机电工程,2011,28(7):769-773. 被引量：1
9葛锁良,杨旭玮,张亚东.RBF网络自整定PID控制在网络化控制系统中的应用[J].合肥工业大学学报（自然科学版）,2011,34(10):1489-1491. 被引量：7
10徐安,寇英信,于雷,李战武.基于RBF神经网络的Q学习飞行器隐蔽接敌策略[J].系统工程与电子技术,2012,34(1):97-101. 被引量：8

1乔俊飞,侯占军,阮晓钢.基于神经网络的强化学习在避障中的应用[J].清华大学学报（自然科学版）,2008,48(S2):1747-1750. 被引量：27
2王富东,高衿畅,周春晖.基于黑板模型的智能控制结构[J].信息与控制,1990,19(2):1-6. 被引量：6
3孙捷先.智能机器人系统分层递阶智能控制结构的分析与综合[J].计算技术与自动化,1993,12(4):1-5.
4刘勇,范丽辉.对中文分词系统预处理模块算法的研究和改进[J].科技信息,2012(8):264-265.
5王艳,杨惠茹.智能化试车台控制系统[J].华东科技（学术版）,2014(8):15-15.
6李远生.基于单片机自动控制系统的信息处理[J].控制工程,2003,10(z1):81-83.
7李远生.基于单片机自动控制系统的信息处理[J].广东电力,2003,16(2):31-33.
8艾霞,陈辉煌.二维离散余弦变换模块算法及其Hierarchical物理实现[J].中国集成电路,2005(4):39-45.
9姜孝华,段伟伟,诸昌钤.新型智能控制系统研究[J].仪器仪表学报,1996,17(5):489-494. 被引量：1
10丁忠校.汇编语言静态分析工具设计与应用[J].微计算机信息,2007,23(21):288-289.

世界科技研究与发展

2013年第3期

浏览历史

内容加载中请稍等...

基于神经网络的Q学习在Khepera Ⅱ机器人避障中的应用

参考文献16

二级参考文献57

共引文献28

相关作者

相关机构

相关主题

浏览历史