摘要
强化学习为多 Agent之间的协作提供了鲁棒的学习方法 .本文首先介绍了强化学习的原理和组成要素 ,其次描述了多 Agent马尔可夫决策过程 MMDP,并给出了 Agent强化学习模型 .在此基础上 ,对多 Agent协作过程中存在的两种强化学习方式 :IL(独立学习 )和 JAL(联合动作学习 )进行了比较 .最后分析了在有多个最优策略存在的情况下 ,协作多
Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate their action choices in fully cooperative multi agent systems (MAS). This paper first introduces the basic principles and components of reinforcement learning, then describes multi agent extension MMDP and presents reinforcement learning model of agents in cooperative MAS. After that we distinguish reinforcement learners that ignore the presence of other agents from those that explicitly attempt to learn the value of joint actions and strategies of their counterparts. In the last, some simple and commonly used coordination mechanisms are examined.
出处
《小型微型计算机系统》
CSCD
北大核心
2003年第11期1986-1988,共3页
Journal of Chinese Computer Systems
基金
安徽省自然科学基金 ( 0 0 0 43 115 )资助
关键词
多AGENT系统
强化学习
MMDP
协调机制
multi agent system
reinforcement learning
multi agent MDP
coordination mechanisms