期刊文献+

基于Q-learning的一种多Agent系统结构模型 被引量:2

A Structure Model of Multi-agent System Based on Q-learning
下载PDF
导出
摘要 多Agent系统是近年来比较热门的一个研究领域,而Q-learning算法是强化学习算法中比较著名的算法,也是应用最广泛的一种强化学习算法。以单Agent强化学习Q-learning算法为基础,提出了一种新的学习协作算法,并根据此算法提出了一种新的多Agent系统体系结构模型,该结构的最大特点是提出了知识共享机制、团队结构思想和引入了服务商概念,最后通过仿真实验说明了该结构体系的优越性。 Multi-agent system(MAS) is a very popular research field in recent years, and Q-learning algorithm is a more famous algorithm, also is one of the most widely used reinforcement learning algorithms. In this paper, a new algorithm based on learning cooperation is proposed. Its basis is Q-learning, a single agent reinforcement learning algorithm. Fi nally, a novel structure model of MAS is proposed. The most significant characteristic of the model is to put forward the mechanism of knowledge sharing, the ideal of team structure and to introduce the concept of facilitator. In the end, it shows the advantage of the structure system by the simulation experiment.
作者 许培 薛伟
出处 《计算机与数字工程》 2011年第8期8-11,共4页 Computer & Digital Engineering
关键词 多AGENT系统 强化学习 Q学习 体系结构 知识共享 multi-agent system(MAS), reinforcement learning, Q-learning, structure system, knowledge sharing
  • 相关文献

参考文献9

  • 1MICHAEL Wooldridge. An Introduction To MultiAgent System[M]. New York: J. Wiley,2002.
  • 2WATKINS C, DAYAN P. QLearning[J]. Machine Learning, 1992,8(3) :279-292.
  • 3Bellman R E. A Markov Decision Process[J]. Journal of Mathematical Mechanics, 1957,6 (5) : 679 - 684.
  • 4吴元斌.单agent强化学习与多agent强化学习比较研究[J].电脑与信息技术,2009,17(1):8-11. 被引量:2
  • 5WATKINS C. Learning From Delayed Rewards[D]. Cambridge: University of Cambridge, 1989.
  • 6郑淑丽,韩江洪,骆祥峰,蒋建文.多Agent系统的协作及强化学习算法研究[J].模式识别与人工智能,2002,15(4):453-457. 被引量:2
  • 7王长缨,尹晓虎,鲍翊平,姚莉.一种共享经验元组的多agent协同强化学习算法[J].模式识别与人工智能,2005,18(2):234-239. 被引量:4
  • 8MICHAEL R. Genesereth, STEVEN P. Ketchpel. Soft Agents[J]. Communications of the ACM, 1994,37 (7) :48-53.
  • 9TAN M. Multi-agent Reinforcement Learning: independent vs. Cooperative Agents[C]//Proceedings of the Tenth International Conference on Machine Learning. [s. l.]..[s. n ],1993:330-337.

二级参考文献31

  • 1Busoniu,L. Babuska, R. De Schutter,B. A Comprehensive Survey of Multiagent Reinforcement Learning [J]. IEEE Transactions on Systems, Man, and Cybernetics - Part C: Applications and Reviews,2008,38 (2):156-172.
  • 2M. L. Littman. Value-function reinforcement learning in Markov games [J]. Journal of Cognitive Systems Research ,2001, (2):55-66.
  • 3C. J. C. H. Watkins, P. Dayan. Technical note: Q-learning [J]. Machine Learning, 1992, ( 8):279-292.
  • 4Weiss G.Muhiagent Systems: A Modem Approach to Distributed Artificial Intelligence [M]. Cambridge, MA: MIT Press, 1999.
  • 5Peter Stone. Learning and Multiagent Reasoning for Autonomous Agents [A]. In 20th International Joint Conference on Artificial Intelligence(IJCAI-07 )[C], Hyderabad, India, 2007:13-30.
  • 6Russell S, Norvig P. Artificial Intelligence: A Modern Approach (Second Edition)[M].北京:清华大学出版社,2006.
  • 7R. S. Sutton, A. G. Barto. Reinforcement Learning: An Introduction [M]. Cambridge, MA: MIT Press, 1998.
  • 8L.P. Kalebling, M. L. Littman, A. W. Moore. Reinforcement learning:. A survey [J].Journal of Artificial Intelligence Research, 1996, (4):237-285.
  • 9Kaelbling L P, Littman M L, Moore A W. Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research, 1996, 4:237-285.
  • 10Watkins C, Dayan P. Q-Learning. Machine Learning, 1992, 8:279-292.

共引文献5

同被引文献17

引证文献2

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部