期刊文献+

A MULTI-AGENT LOCAL-LEARNING ALGORITHM UNDER GROUP ENVIROMENT

A MULTI-AGENT LOCAL-LEARNING ALGORITHM UNDER GROUP ENVIROMENT
下载PDF
导出
摘要 In this paper,a local-learning algorithm for multi-agent is presented based on the fact that individual agent performs local perception and local interaction under group environment.As for in-dividual-learning,agent adopts greedy strategy to maximize its reward when interacting with envi-ronment.In group-learning,local interaction takes place between each two agents.A local-learning algorithm to choose and modify agents' actions is proposed to improve the traditional Q-learning algorithm,respectively in the situations of zero-sum games and general-sum games with unique equi-librium or multi-equilibrium.And this local-learning algorithm is proved to be convergent and the computation complexity is lower than the Nash-Q.Additionally,through grid-game test,it is indicated that by using this local-learning algorithm,the local behaviors of agents can spread to globe. In this paper,a local-learning algorithm for multi-agent is presented based on the fact that individual agent performs local perception and local interaction under group environment.As for in-dividual-learning,agent adopts greedy strategy to maximize its reward when interacting with envi-ronment.In group-learning,local interaction takes place between each two agents.A local-learning algorithm to choose and modify agents' actions is proposed to improve the traditional Q-learning algorithm,respectively in the situations of zero-sum games and general-sum games with unique equi-librium or multi-equilibrium.And this local-learning algorithm is proved to be convergent and the computation complexity is lower than the Nash-Q.Additionally,through grid-game test,it is indicated that by using this local-learning algorithm,the local behaviors of agents can spread to globe.
出处 《Journal of Electronics(China)》 2009年第2期229-236,共8页 电子科学学刊(英文版)
基金 Supported by NSFC(No.60503024, No.60374032)
关键词 Multi-agent learning Game theory Nash-Q Local-learning algorithm Q学习算法 多Agent 环境 计算复杂度 代理人 贪婪策略 相互作用
  • 相关文献

参考文献10

  • 1I.Kazunori,,I.Kazushi,,S.Hideaki.The asymp-totic equi-partition property in reinforcement learning and its relation to return maximization[].Neural Networks.2006
  • 2R.K.Jelle,,T.J.Matthijs,,V.Nikos.An approach to non-communicative multi-agent coordination in continuous domains[].Proceedings of the th Bel- gian-Dutch Conference on Machine Learning.2002
  • 3P.G.Harrison,T.T.Lee.Separable equilibrium state probabilities via time reversal in Markovian process algebra[].Theoretical Computer Science.2005
  • 4P.B.Arthur,F.R.Aluzio.Influence zones: A strategy to enhance reinforcement learning[].Neurocomputing.2006
  • 5M.Bowling.Learning to predict by the method of multi-agent reinforcement learning[].Proceedings of the th International Conference on Machine Learning.2000
  • 6A.Soltania,,D.Leeb,,X.J.Wang.Neural mechanism for stochastic behaviour during a com- petitive game[].Neural Networks.2006
  • 7D.Banerjee,S.Sen.Reaching pareto-optimality in prisoner’s dilemma using conditional joint action learning[].Auton Agent Multi-Agent System.2007
  • 8clustering.Proceedings of the 10th International Conference on Machine Learning[]..1993
  • 9Filar J,Vrieze K.Competitive Markov decision processes[]..1997
  • 10Bowling M,Veloso M.Multiagent learning using a variable learning rate[].Artificial Intelligence.2002

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部