期刊文献+

多Agent协作的强化学习模型和算法 被引量:6

Reinforcement Learning Model and Algorithm Based on Multi-agent Cooperation
下载PDF
导出
摘要 结合强化学习技术讨论了多Agent协作学习的过程,构造了一个新的多Agent协作学习模型。在这个模型的基础上,提出一个多Agent协作学习算法。算法充分考虑了多Agent共同学习的特点,使得Agent基于对动作长期利益的估计来预测其动作策略,并做出相应的决策,进而达成最优的联合动作策略。最后,通过对猎人-猎物追逐问题的仿真试验验证了该算法的收敛性,表明这种学习算法是一种高效、快速的学习方法。 The multi-agent cooperative learning process based on Reinforcement Learning is addressed and a new multiagent cooperative learning model is proposed. Based on this model, a cooperative learning algorithm is introduced. This algorithm pays fully attention to multl-agent cooperative learning together simultaneity, so it can make each agent predict its action policy based on the estimation on its action's long-time reward, At last relevant decisions to be the best associated action policy is made. We conduct a series of empirical evaluation of the algorithm on the hunter-prey problem to validate its astringency. The result shows this algorithm is an efficient and fast method for multi-agent learning.
出处 《计算机科学》 CSCD 北大核心 2006年第12期156-158,186,共4页 Computer Science
基金 国家自然科学基金项目资助(编号:60573169)。
关键词 协作学习 强化学习 多AGENT学习 学习模型 学习算法 Cooperative learning, Reinforcement learning, Multi-agent learning, Learning model, Learning algorithm
  • 相关文献

参考文献8

  • 1Sutton R, Barto AG. Reinforcement Learning: An Introduction.MIT Press, 1998
  • 2Tan Ming. Multi-agent reinforcement learning: independent vs,cooperative Agents. In:Proceedings of the 10^th International Conference on Machine Learning(ICML-93), 1993. 330-337
  • 3Watkins C J C H, Dayan P. Q-learning. Machine learning. 1992,8:272-292
  • 4蔡庆生,张波.一种基于Agent团队的强化学习模型与应用研究[J].计算机研究与发展,2000,37(9):1087-1093. 被引量:31
  • 5Irwig K, Wobeke W. Muhi-Agent Reinfoecement Learning with Vicarious Rewards. Electronic Transactions on Artificial Intelligence, 1999,3(B): 23-45
  • 6Mataric M J. Interaction and intelligent behavior: [Ph D Thesis].Department of Electrical Engineering and Computer Science,MIT, USA, 1994
  • 7Bowling M. Convergence problems of general-sum multi-agent reinforcement learning [A]. In: Langley P,ed. Proceedings of the Seventeenth International Conference on Machine Learning [C],San Francisco: Morgan Kaufmann Publishers,2000. 89-94
  • 8Benda M,Jagannathan V,Dodhiawalla R. On optimal cooperation of knowledge sources: [Technical Report]. BCS-G2010-28. Boeing AI Center, Boeing Computer Services, Bellevue, WA, August 1985

共引文献30

同被引文献57

引证文献6

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部