期刊文献+

多智能体的增强学习及其在RoboCup中的应用

Reinforcement learning for Multi-Agents Systems and its application in RoboCup
下载PDF
导出
摘要 针对非确定马尔可夫环境下的多智能体系统,提出了多智能体Q学习模型和算法。算法中通过对联合动作的统计来学习其它智能体的行为策略,并利用智能体策略向量的全概率分布保证了对联合最优动作的选择。在实验中,成功实现了智能体的决策,提高了AFU队的整体的对抗能力,证明了算法的有效性和可行性。 Due to the presence of other agents,the environment of Multi-Agent Systems(MAS) cannot be simply treated as Markov Decision Processes (MDPs).The current reinforcement learning which are based on MDPs must be reformed before it can be applicable to MAS.Based on an agent's independent learning ability,this paper proposes a novel Q-learning algorithm for MAS-an agent learning other agents action policies through observing the joint action.The politicies of other agents are expressed as action probability distribution matrixes.A concise and yet useful updating method for the matrixes is proposed.The full joint probability of distribution matrixes guarantees the learning agent to choose its optimal action.In experiment,the implemention of the agent and the enhancement of AFU shows that the approach is valid and efficient.
出处 《计算机工程与应用》 CSCD 北大核心 2008年第23期46-48,共3页 Computer Engineering and Applications
关键词 多智能体 增强学习 机器人世界杯足球锦标赛 Multi-Agents Systems (MAS) reinforcement learning Robot World Cup (RoboCup)
  • 相关文献

参考文献1

二级参考文献12

  • 1Kaelbling L P,Littman M L,Moore A W.Reinforcement learning:A survey.Journal of Artificial Intelligence Research,1996,4(2):237~285
  • 2Sandip S C.Adaption,coevolution and learning in multiagent systems.In:Proceedings of AAAI Spring Symposium,AAAI Technical Report SS-96-01,AAAI,1996.57~62
  • 3Weiss G,Dillenbourg P.What is multi in multiagent learning? Collaborative Learning,Cognitive and Computational Approaches.Amsterdam,Holland:Pergamon Press,1998.64~80
  • 4Narendra P,Sandip S,Maria G.Shared memory based cooperative coevolution.In:Proceedings of IEEE International Conference on Evolutionary Computation,IEEE,1998.570~574
  • 5Littman M L.Markov games as a framework for multiagent reinforcement learning.In:Proceedings of the 11th Interna tional Conference on Machine learning,Morgan Kaufmann,1994.157163
  • 6Littman M L.Friend-or-foe:Q-learning in general-sum games.In:Proceedings of the 18th International Conference on Machine Learning,Morgan Kaufmann,2001.322~328
  • 7Hu J,Wellman M P.Nash Q-Learning for General-Sum stochastic games.Journal of Machine Learning,2003,4:1039~1069
  • 8Mitchell T M.Machine Learning.USA:McGraw-Hill Companics Inc.1997,367~387
  • 9Watkins C J C H,Dayan P.Technical note Q-learning.Journal of Machine Learning,1992,(8):279~292
  • 10Haussler D.Quantifying inductive bias:AI learning algorithms and valiant's learning framework.Artificial Intelligence,1988,36(2):177~221

共引文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部