再励学习及其在移动机器人行为规划中的应用

Reinforcement Learning with Application to Mobile Robots

下载PDF

导出

摘要再励学习(Reinforcement Learning,RL)是一种成功地结合动态编程和控制问题的机器智能方法,它将动态编程和有监督学习方法结合到机器学习系统中,通常用于解决预测和控制两类问题。提出了以矢量形式表示的评估函数,为了实现多维再励学习,用一专门的神经网络(Q网络)实现评判网络,研究其在移动机器人行为规划中的应用。 Reinforcement Learning（RL） is an approach to machine intelligence that combines two problems of Dynamic Programming and Control successfully.It combines the fields of dynamic programming and supervised learning to yield powerful machine-learning systems.RL has traditionally been used to solve problems of prediction and control.This paper proposes an evaluation function expressed in a vector form in order to realize multi-dimensional reinforcement learning.Q-learning,A special neural network （Q-net） is proposed to realize critic networks.at the end,we investigate the application of a Reinforcement learning in behavior planning.

作者林雄于洪孙志雄韩建文

机构地区琼州大学信息科学与技术学院

出处《工业控制计算机》 2009年第8期58-59,共2页 Industrial Control Computer

基金海南省教育厅自然科学基金资助项目(Hj2009-134)

关键词再励学习神经网络智能机器人行为规划应用 reinforcement learning,neural networks,intelligent robot,behavior planning,application

分类号 TP242 [自动化与计算机技术—检测技术与自动化装置] U491.51 [交通运输工程—交通运输规划与管理]

引文网络
相关文献

参考文献5

1D. J.White. A survey of applications of Markov decision processes.The Journal of the Operational Research Society, 44 (11):1073-1096, 1993.
2C.J. Watkins, P. Dayan. Q-learning. Machine Learning, 8, 1992, pp. 279-292.
3K.Kiguchi, T.Nanayakkara, etc. Multi-Dimensional Reinforcement Learning Using a Vector Q-Net-Application to Mobile Robots, FIRA Robot Congress, 2002.
4R.S.Sutton and A.G.Barto, Reinforcement Learning,MIT Press, 1998.
5E.Uchibe and M.Asada, "Multiple Reward Criterion for Cooperative Behavior Acquisition in a Multiagent Envi-ronmentj', Proc. of IEEE International Conf.on Systems,Man, and Cybernetics, pp.Vl 710-Vl 715, 1999.

1黄海滨.机器学习及其主要策略[J].河池师范高等专科学校学报,2000,20(4):85-89. 被引量：6
2谷歌推出TensorFlow机器学习系统[J].电信工程技术与标准化,2015,28(11):92-92. 被引量：5
3尹绪森,吴甘沙.让机器学习突破大数据的重围[J].程序员,2013(11):113-117.
4赵沁平,魏华,王军玲.机器学习技术与机器学习系统[J].计算机科学,1993,20(5):27-40. 被引量：5
5何友鸣,方辉云.一种机器学习系统的设计与实现[J].计算机应用,2001,21(z1):160-162. 被引量：1
6王雅萍.专家系统在超声无损检测中的应用[J].西南科技大学学报,2005,20(1):53-55. 被引量：5
7于凤.机器学习方法及其技术应用[J].电脑学习,2003(1):3-4. 被引量：2
8黄宜华.大数据机器学习系统研究进展[J].大数据,2015,1(1):28-47. 被引量：51
9梁晓欢.Causata机器学习系统探讨[J].电脑与电信,2013(9):5-7.
10叶飞,龚俭,杨望.基于支持向量机的Webshell黑盒检测[J].南京航空航天大学学报,2015,47(6):924-930. 被引量：16

工业控制计算机

2009年第8期

浏览历史

内容加载中请稍等...

再励学习及其在移动机器人行为规划中的应用

参考文献5

相关作者

相关机构

相关主题

浏览历史