期刊文献+

强化学习在机器人足球半场进攻中的应用 被引量:1

Application of Reinforcement Learning in Half Field Offense of Robot Soccer
下载PDF
导出
摘要 本文主要研究了强化学习在机器人足球比赛半场进攻中的应用,机器人足球比赛环境状态是一个连续的状态空间,利用强化学习必须将状态空间离散化,文中利用给定的状态变量来描述坏境状态。为了克服机器人单独更新值函数的缺点,采用机器人之间通信的方式来更新所有进攻机器人的值函数,最后将算法在4V5的机器人比赛环境中进行了实验,取得了理想的效果。 This article main introduce the applicatiion of reinforcement learning in half field offense to robot soccer, the environment of robot soccer is a continuous state space, we should discretize the state of environment,we define the state using a set of variables. In order to overcome the shortcoming of the agent update Q value independent, we adopt communication between robots to update Q value of all offense agent.Finally we perform an experiment in 4V5 half field offense, and get an ideal result.
出处 《微计算机信息》 2011年第12期104-105,84,共3页 Control & Automation
关键词 强化学习 半场进攻 机器人通信 Reinforcement Learning Half Field Offense Robot Communication
  • 相关文献

参考文献6

二级参考文献18

  • 1初永丽.一种GA算法的改进及其实现[J].微计算机信息,2006,22(03S):128-129. 被引量:3
  • 2Stone P. Layered learning in Multi - Agent System[D]. Pittsburgh: Computer Science Department, Carnegie Mellon University, 1998.
  • 3Mihal B, Kay S, Jan W. Learning of kick in artificial soccer [ C]///Robot Soccer World Cup IV. Berlin: [ s. n. ], 2000.
  • 4Kaelbling LP, Littrnan M L, Moore A W. Reinforcement learning:A survey[J]. Journal of Artificial Intelliegence, 1996, 4:237 - 285.
  • 5Sutton R S, Barto A G. Reinforcement Learning[M]. Cambridge, MA: The MIT Press, 1998.
  • 6J. Laird and M. Lent, “Human-level Al's killer application: Interactive computer games,” Al Magazine [J], vol. 22, no. 2, pp. 15-26, 2005.
  • 7http://www - 106.ibm.com/developerworks/java/library/j -robocode 2004.
  • 8M. Mitchell, An Introduction to Genetic Algorithms [M], MIT Press, 1996.
  • 9Richard S. Sutton & Andrew G. Reinforcement learning: an introduction[M]. MIT Press, Cambridge, MA. 1998 A.
  • 10Kaelhling L P, Littman M &Moore A. Reinforcement learning: a survey[J]. Journal of Artificial Intelligence Research. 1994.(4): 237-285.

共引文献12

同被引文献14

  • 1方宝富,王浩.机器人足球仿真[M].合肥:合肥工业大学出版社,2011.
  • 2Multi-Agent Systems Laboratory of University of Science and Technology of China.WrightEagle 2D Soccer Simulation Team[EB/OL].2013-9-1.http://www.wrighteagle.org/2d.
  • 3Riedmiller M,Gabel T,Hafner R.Reinforcement learning for robot soccer[J].Autonomous Robots,2009,27 (1):55-73.
  • 4Gabel T,Riedmiller M.On Progress in Robocup:the Simulation League Showcase[J].RoboCup-2010:Robot Soccer World Cup XIV,2011,6556:36-47.
  • 5BAI Ai-jun,WU Feng,CHEN Xiao-ping.Towards a Principled Solution to Simulated Robot Soccer[J].RoboCup-2012:Robot Soccer World Cup XVI,Lecture Notes in Artificial Intelligence,2013,7500:1-12.
  • 6BAI Ai-jun,WU Feng,CHEN Xiao-ping.Online planning for large MDPs with MAXQ decomposition[J].AAMAS 2012 Workshop on Autonomous Robots and Multirobot Systems,2012.
  • 7BAI Ai-jun,CHEN Xiao-ping,Patrick MacAlpine.WrightEagle and UT Austin Villa:RoboCup 2011 Simulation League Champions[J].RoboCup-2011:Robot Soccer World Cup XV,Lecture Notes in Computer Science,2012,7416:1-12.
  • 8Bertsekas D.Dynamic Programming and Optimal Control[M].The forth Nashua:Athena Scientific,2012.493-509.
  • 9Hidehisa A.RoboCup Simulation 2D Guide Book[EB/OL].2013-9-1.http://sourceforge.jp/projects/rctools.
  • 10CHEN M,Klaus D,Ehsan F.RoboCup Soccer Server[EB/OL].2013-9-1.http://sourceforge.net/projects/sserver/files.

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部