期刊文献+

Autonomous air combat decision-making of UAV based on parallel self-play reinforcement learning 被引量:2

下载PDF
导出
摘要 Aiming at addressing the problem of manoeuvring decision-making in UAV air combat,this study establishes a one-to-one air combat model,defines missile attack areas,and uses the non-deterministic policy Soft-Actor-Critic(SAC)algorithm in deep reinforcement learning to construct a decision model to realize the manoeuvring process.At the same time,the complexity of the proposed algorithm is calculated,and the stability of the closed-loop system of air combat decision-making controlled by neural network is analysed by the Lyapunov function.This study defines the UAV air combat process as a gaming process and proposes a Parallel Self-Play training SAC algorithm(PSP-SAC)to improve the generalisation performance of UAV control decisions.Simulation results have shown that the proposed algorithm can realize sample sharing and policy sharing in multiple combat environments and can significantly improve the generalisation ability of the model compared to independent training.
出处 《CAAI Transactions on Intelligence Technology》 SCIE EI 2023年第1期64-81,共18页 智能技术学报(英文)
基金 National Natural Science Foundation of China,Grant/Award Number:62003267 Fundamental Research Funds for the Central Universities,Grant/Award Number:G2022KY0602 Technology on Electromagnetic Space Operations and Applications Laboratory,Grant/Award Number:2022ZX0090 Key Core Technology Research Plan of Xi'an,Grant/Award Number:21RGZN0016。
  • 相关文献

同被引文献19

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部