期刊文献+

一般和对策中基于协商的多代理强化学习

Multiagent Reinforcement Learning Based on Negotiation in General-Sum Games
下载PDF
导出
摘要 一般和对策中,只考虑个体理性的多代理协作是一种无全局目标的协作.代理学习基于对手策略假设,不能保证假设的正确性.为此通过定义代理协作的集体目标,提出了一种基于多代理协商的代理强化学习算法.代理选择协商策略,并惩罚偏离该策略的代理来保证协商策略的执行.文中给出了学习收敛的条件及证明,并以实例加以分析. In general-sum games, multiagent cooperation has no global objective, and only individual rationality is concerned. Agent s learning is based on the assumption of opponents policies, and this assumption may be wrong. By defining the global objective of agents, a novel multiagent reinforcement learning algorithm was proposed. All agents selected negotiated policies during learning, and punished those agents deviating from negotiated policies to ensure the execution of these policies. It was proved that the ...
出处 《上海交通大学学报》 EI CAS CSCD 北大核心 2005年第S1期108-112,共5页 Journal of Shanghai Jiaotong University
关键词 MARKOV对策 强化学习 多代理协作 协商 Markov games reinforcement learning multiagent coordination negotiation
  • 相关文献

参考文献10

  • 1Watkins C J C H,Dayan P,Q-learning. Machine Learning . 1992
  • 2Littman M L.Markov games as a framework for multi -agent reinforcement learning[].In th ICML.1994
  • 3Hu J,Wellman M P.Nash Q-learning for general sum stochastic games[].Journal of Machine Learning Research.2003
  • 4Kaelbling L,Littman M L,Moore A W.Reinforcement learning: A survey[].Journal of Artificial Organs.1996
  • 5Boutilier C.Sequential optimality and coordination in multiagent systems[].th IJCAI.1999
  • 6Bowling M,Veloso M.Variable learning rate and the convergence of gradient dynamics[].Proc of th ICML.2001
  • 7BOWLING M,VELOSO M.Multiagent learning using a variable learning rate[].Artificial Intelligence.2002
  • 8Szepesvari C,Littman M L.A unified analysis of value-function-based reinforcement learning algorithms[].Neural Computation.1999
  • 9ZHANG Hua-xiang,ZHANG Liang,HUANG Shang-teng,et al.A machine learning approach to automated negotiation[].High Technology.2004
  • 10SuttonRS,BartoA.Reinforcementlearning:Anin-troduction[]..1998

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部