摘要
装备体系是作战体系在装备视角的反映,研究装备体系作战效能评估问题,对装备体系优化、建设发展具有重要现实意义。集群装备作战体系对抗,具有大规模、高动态、强对抗特点,传统方法难以对其作战效能直接进行评估,针对单一任务同构集群装备体系(如无人机侦察蜂群、地面无人平台火力突击集群等),从多智能体博弈理论的视角,将装备体系对抗过程看作为多智能体系统马尔可夫博弈过程,提出一种基于多智能体博弈强化学习(reinforcement learning of multiagent game,RLoMAG)的装备体系作战效能评估方法。分析了评估方法原理,建立了装备体系对抗模型。给出了装备体系作战效能评估方法框架,包括智能体建模、博弈算法设计、装备体系作战效能指标设计,开展探索性体系对抗仿真,求解装备体系博弈最优策略,分析最优策略下的装备体系作战效能指标等步骤。以基地防御作战场景为背景,给出了无人机蜂群装备体系作战效能评估方法应用示例,验证了方法的有效性。
The equipment system is the reflection of the combat system from the perspective of equipment.The research on the combat effectiveness evaluation of the equipment system is of great practical significance for the optimization,construction,and development of the equipment system.Cluster equipment combat system confrontation is characterized by large-scale,highly dynamic and strong confrontation,and it is difficult to directly evaluate combat effectiveness with traditional methods.Aiming at the single task homogeneous cluster equipment system(such as UAV reconnaissance swarm and ground unmanned platform fire assault cluster),this paper regards the confrontation process of equipment system as the Markov game process of multi-agent system from the perspective of multi-agent game theory.The combat effectiveness evaluation method of equipment system based on reinforcement learning of multi-agent game(RLoMAG)is proposed.Firstly,the principle of evaluation method is analyzed and the model of equipment system confrontation is built.Secondly,the framework of the combat effectiveness evaluation method of the equipment system is given,including conducting the agent modeling,game algorithm design and combat effectiveness index design of the equipment system,carrying out exploratory system confrontation simulation,solving the game optimal strategy of the equipment system,and analyzing the combat effectiveness index of the equipment system under the optimal strategy.Finally,based on the base defense combat scenario,an application example of the combat effectiveness evaluation method of the UAV swarm equipment system is given to verify the effectiveness of the method.
作者
张国辉
高昂
张雅楠
Zhang Guohui;Gao Ang;Zhang Ya'nan(Department of Information and Communication,Academy of Army Armored Force,Beijing 100072,China;Joint Operations College,National Defence University,Beijing 100091,China)
出处
《系统仿真学报》
CAS
CSCD
北大核心
2024年第1期160-169,共10页
Journal of System Simulation
关键词
装备体系
作战效能评估
多智能体博弈强化学习
最优策略
equipment system
combat effectiveness evaluation
reinforcement learning of multi-agent game
optimum strategy