期刊文献+

Knowledge transfer in multi-agent reinforcement learning with incremental number of agents 被引量:1

下载PDF
导出
摘要 In this paper, the reinforcement learning method for cooperative multi-agent systems(MAS) with incremental number of agents is studied. The existing multi-agent reinforcement learning approaches deal with the MAS with a specific number of agents, and can learn well-performed policies. However, if there is an increasing number of agents, the previously learned in may not perform well in the current scenario. The new agents need to learn from scratch to find optimal policies with others,which may slow down the learning speed of the whole team. To solve that problem, in this paper, we propose a new algorithm to take full advantage of the historical knowledge which was learned before, and transfer it from the previous agents to the new agents. Since the previous agents have been trained well in the source environment, they are treated as teacher agents in the target environment. Correspondingly, the new agents are called student agents. To enable the student agents to learn from the teacher agents, we first modify the input nodes of the networks for teacher agents to adapt to the current environment. Then, the teacher agents take the observations of the student agents as input, and output the advised actions and values as supervising information. Finally, the student agents combine the reward from the environment and the supervising information from the teacher agents, and learn the optimal policies with modified loss functions. By taking full advantage of the knowledge of teacher agents, the search space for the student agents will be reduced significantly, which can accelerate the learning speed of the holistic system. The proposed algorithm is verified in some multi-agent simulation environments, and its efficiency has been demonstrated by the experiment results.
出处 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2022年第2期447-460,共14页 系统工程与电子技术(英文版)
基金 supported by the National Key R&D Program of China (2018AAA0101400) the National Natural Science Foundation of China (62173251 61921004 U1713209) the Natural Science Foundation of Jiangsu Province of China (BK20202006) the Guangdong Provincial Key Laboratory of Intelligent Decision and Cooperative Control。
  • 相关文献

参考文献2

二级参考文献3

共引文献13

同被引文献6

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部