期刊文献+

基于过滤机制筛选信息的多智能体策略方法

Research on multi-agent strategy based on filtering mechanism to filter information
原文传递
导出
摘要 多智能体系统在进行协作或竞争时,会面临联合信息空间扩大、智能体间信息提取效率降低的问题.对此,采用增加过滤机制来筛选信息的多智能体强化学习策略方法(FMAC),以增强智能体间信息交流能力.该方法通过找到彼此相关联的智能体,根据相关性计算智能体的信息贡献,过滤掉无关智能体信息,从而实现在合作、竞争或者混合环境下智能体间有效的沟通.与此同时,采用集中训练分散执行的方式解决环境的非平稳性问题.通过对比算法进行实验,结果表明改进算法提高了策略迭代效率以及泛化能力,并且智能体数量增多时仍可保持稳定的效果,有助于将多智能体强化学习应用到更广泛的领域. When multi-agent systems cooperate or compete,the joint information space will be enlarged and the efficiency of information extraction between agents will be reduced.In this paper,a multi-agent reinforcement learning strategy(FMAC)with filtering mechanism to filter information is adopted to enhance the ability of information communication between agents.By finding the related agents and calculating their information contribution according to the correlation,the method filters out the irrelevant agent information so as to realize the effective communication between agents in cooperative competition or mixed environment.At the same time,the centralized training decentralized execution method is adopted to solve the non-stationarity of environment.In this paper,experiments are carried out by comparing algorithms to verify that the improved algorithm improves the strategy iteration efficiency and generalization ability,and can maintain stable effects when the number of agents increases,which is conducive to the application of multi-agent reinforcement learning to a wider range of fields.
作者 陈亮 郭婷 刘韵婷 杨佳明 CHEN Liang;GUO Ting;LIU Yun-ting;YANG Jia-ming(School of Automation and Electrical Engineering,Shenyang Ligong University,Shenyang 110159,China)
出处 《控制与决策》 EI CSCD 北大核心 2022年第6期1643-1648,共6页 Control and Decision
关键词 强化学习 多智能体决策 信息过滤 集中训练分散执行 reinforcement learning multi-agent system filtering mechanism centralized training decentralized execution
  • 相关文献

参考文献3

二级参考文献52

  • 1Hewitt C. Viewing Control Ctructures as Patterns of Passing Messages. Artificial Intelligence, 1977,8(3) :323-364
  • 2Wooldridge M,Jennings N R. Agent Theories,Architectures,and Languages: a Survey. In: Wooldridge, Jennings, eds. Intelligent Agents,Berlin: Springer-Verlag, 1995. 1-22.
  • 3Wei β G. Learning to Coordinate Actions in Multi-Agent Systems Proceedings of IJCAI'93, 1993
  • 4Dworman,Garett,Kimbrough S,Laing J. Bargaining by Artificial Agents in Two Coalition Games: A Study in Genetic Programming for Electronic Commerce. In: Proc. of the AAAI Genetic Programming Conf. Stanford,CA,Aug. 1996
  • 5Kaelbling L P. Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research, 1996,4: 237-285
  • 6Singh S. Agents and Reinforcement Learning. Miller freeman publish Inc,San Mateo,CA,USA,1997
  • 7Bellman R. Dynamic Programming. Prentice-Hall, Englewood Cliffs, NJ, 1957
  • 8Sutton R S. Learning to predict by the methods of temporal differences. Machine Learning, 1988,3: 9 - 44
  • 9Sutton R S. Convergence theory for a new kind of prediction learning. In:Proc. of the 1988 Workshop on Computational Learning Theory, 1988. 421-442
  • 10Watkins C J C H,Dayan P. Q-Learning. Machine Learning,8(3):279-292

共引文献277

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部