期刊文献+

基于多智能体强化学习的协同目标分配 被引量:1

Cooperative targets assignment based on multi-agent reinforcement learning
下载PDF
导出
摘要 针对传统方法难以适用于动态不确定环境下的大规模协同目标分配问题,提出一种基于多智能体强化学习的协同目标分配模型及训练方法。通过对相关概念和数学模型的描述,将协同目标分配转化为多智能体协作问题。聚焦于顶层分配策略的学习,构建了策略评分模型和策略推理模型,采用Advantage Actor-Critic算法进行策略优化。仿真实验结果表明,所提方法能够准确刻画作战单元之间的协同演化内因,有效地实现了大规模协同目标分配方案的动态生成。 Aiming at the problem that traditional methods are difficult to apply to large-scale cooperative targets assignment in dynamic uncertain environment,a cooperative targets assignment model and training method based on multi-agent reinforcement learning is proposed.Through the description of related concepts and mathematical models,the cooperative targets assignment is transformed into a multi-agent cooperation problem.Focusing on the learning of top-level assignment strategy,the scoring model and reasoning model of strategy are constructed,and the Advantage Actor-Critic algorithm is used for strategy optimization.The simulation results show that the proposed method can accurately describe the evolution of the cooperative relationship between operational units,and effectively realize the dynamic generation of large-scale cooperative targets assignment scheme.
作者 马悦 吴琳 许霄 MA Yue;WU Lin;XU Xiao(Graduate School,National Defense University,Beijing 100091,China;Unit 31002 of the PLA,Beijing 100091,China;Academy of Joint Operation,National Defense University,Beijing 100091,China)
出处 《系统工程与电子技术》 EI CSCD 北大核心 2023年第9期2793-2801,共9页 Systems Engineering and Electronics
关键词 协同目标分配 多智能体协作 强化学习 神经网络 Advantage Actor-Critic cooperative targets assignment multi-agent cooperation reinforcement learning neural network Advantage Actor-Critic
  • 相关文献

参考文献6

二级参考文献67

共引文献135

同被引文献12

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部