期刊文献+

基于多智能体深度强化学习的分布式协同干扰功率分配算法 被引量:4

Allocation Algorithm of Distributed Cooperative Jamming Power Based on Multi-Agent Deep Reinforcement Learning
下载PDF
导出
摘要 针对战场通信对抗协同干扰中的干扰功率分配难题,本文基于多智能体深度强化学习设计了一种分布式协同干扰功率分配算法.具体地,将通信干扰功率分配问题构建为完全协作的多智能体任务,采用集中式训练、分布式决策的方式缓解多智能体系统环境非平稳、决策维度高的问题,减少智能体之间的通信开销,并加入最大策略熵准则控制各智能体的探索效率,以最大化累积干扰奖励和最大化干扰策略熵为优化目标,加速各智能体间协同策略的学习.仿真结果表明,所提出的分布式算法能有效解决高维协同干扰功率分配难题,相比于已有的集中式分配算法具有学习速度更快、波动性更小等优点,且相同条件下干扰效率可高出集中式算法16.8%. In order to solve the problem of jamming power allocation in battlefield cooperative communication countermeasures,this paper designs a distributed cooperative jamming power allocation method based on multi-agent deep reinforcement learning.Specifically,modeling the communication jamming power allocation as a fully cooperative multi-agent task,then the framework of centralized training and distributed decision-making is adopted to alleviate the characteristic of non-stationary environment and high dimensions in multi-agent system,reducing the communication overhead between agents as well,and introducing the maximum policy entropy criterion to control the exploration efficiency of each agent.Regarding maximizing the cumulative jamming reward and maximizing the entropy of the jamming policy as the optimization goal,then accelerates the learning of cooperative strategies.Simulation results indicate the proposed distributed method can effectively solve the high-dimensional cooperative jamming power allocation problem.Compared with the existing centralized allocation method,it has faster learning speed and less volatility,and the jamming efficiency is 16.8%higher than that of the centralized method under the same conditions.
作者 饶宁 许华 蒋磊 宋佰霖 史蕴豪 RAO Ning;XU Hua;JIANG Lei;SONG Bai-lin;SHI Yun-hao(Information and Navigation College of Air Force Engineering University,Xi’an,Shaanxi 710077,China)
出处 《电子学报》 EI CAS CSCD 北大核心 2022年第6期1319-1330,共12页 Acta Electronica Sinica
关键词 通信对抗 协同功率分配 多智能体深度强化学习 分布式策略 最大策略熵 communication countermeasures cooperative resource allocation multi-agent deep reinforcement learning distributed strategy maximum policy entropy
  • 相关文献

参考文献7

二级参考文献13

共引文献105

同被引文献81

引证文献4

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部