The scale of ground-to-air confrontation task assignments is large and needs to deal with many concurrent task assignments and random events.Aiming at the problems where existing task assignment methods are applied to...The scale of ground-to-air confrontation task assignments is large and needs to deal with many concurrent task assignments and random events.Aiming at the problems where existing task assignment methods are applied to ground-to-air confrontation,there is low efficiency in dealing with complex tasks,and there are interactive conflicts in multiagent systems.This study proposes a multiagent architecture based on a one-general agent with multiple narrow agents(OGMN)to reduce task assignment conflicts.Considering the slow speed of traditional dynamic task assignment algorithms,this paper proposes the proximal policy optimization for task assignment of general and narrow agents(PPOTAGNA)algorithm.The algorithm based on the idea of the optimal assignment strategy algorithm and combined with the training framework of deep reinforcement learning(DRL)adds a multihead attention mechanism and a stage reward mechanism to the bilateral band clipping PPO algorithm to solve the problem of low training efficiency.Finally,simulation experiments are carried out in the digital battlefield.The multiagent architecture based on OGMN combined with the PPO-TAGNA algorithm can obtain higher rewards faster and has a higher win ratio.By analyzing agent behavior,the efficiency,superiority and rationality of resource utilization of this method are verified.展开更多
瞄准制衡强敌“马赛克战”“决策中心战”等新技术驱动的作战概念及威胁挑战,聚焦未来跨域联合作战指挥控制(command and control,C2)全流程决策需求,遵循平行智能理论框架,提出了基于筹划-准备-执行-评估(planning-readiness-execution...瞄准制衡强敌“马赛克战”“决策中心战”等新技术驱动的作战概念及威胁挑战,聚焦未来跨域联合作战指挥控制(command and control,C2)全流程决策需求,遵循平行智能理论框架,提出了基于筹划-准备-执行-评估(planning-readiness-execution-assessment,PREA)环与观察-判断-决策-行动(observe-orient-decide-act,OODA)环的平行指挥控制与管理(command&control and management,C2M)新范式,以期实现智能机器辅助指挥员作战全流程的分层次、个性化决策支持,减轻指挥员认知负担、降低决策复杂度,实现机器对指挥员“人脑”的智能扩展与增强,为塑造全局决策优势提供牵引和支撑。展开更多
基金the Project of National Natural Science Foundation of China(Grant No.62106283)the Project of National Natural Science Foundation of China(Grant No.72001214)to provide fund for conducting experimentsthe Project of Natural Science Foundation of Shaanxi Province(Grant No.2020JQ-484)。
文摘The scale of ground-to-air confrontation task assignments is large and needs to deal with many concurrent task assignments and random events.Aiming at the problems where existing task assignment methods are applied to ground-to-air confrontation,there is low efficiency in dealing with complex tasks,and there are interactive conflicts in multiagent systems.This study proposes a multiagent architecture based on a one-general agent with multiple narrow agents(OGMN)to reduce task assignment conflicts.Considering the slow speed of traditional dynamic task assignment algorithms,this paper proposes the proximal policy optimization for task assignment of general and narrow agents(PPOTAGNA)algorithm.The algorithm based on the idea of the optimal assignment strategy algorithm and combined with the training framework of deep reinforcement learning(DRL)adds a multihead attention mechanism and a stage reward mechanism to the bilateral band clipping PPO algorithm to solve the problem of low training efficiency.Finally,simulation experiments are carried out in the digital battlefield.The multiagent architecture based on OGMN combined with the PPO-TAGNA algorithm can obtain higher rewards faster and has a higher win ratio.By analyzing agent behavior,the efficiency,superiority and rationality of resource utilization of this method are verified.
文摘瞄准制衡强敌“马赛克战”“决策中心战”等新技术驱动的作战概念及威胁挑战,聚焦未来跨域联合作战指挥控制(command and control,C2)全流程决策需求,遵循平行智能理论框架,提出了基于筹划-准备-执行-评估(planning-readiness-execution-assessment,PREA)环与观察-判断-决策-行动(observe-orient-decide-act,OODA)环的平行指挥控制与管理(command&control and management,C2M)新范式,以期实现智能机器辅助指挥员作战全流程的分层次、个性化决策支持,减轻指挥员认知负担、降低决策复杂度,实现机器对指挥员“人脑”的智能扩展与增强,为塑造全局决策优势提供牵引和支撑。