摘要
研究了编队防空目标分配问题,采用马尔科夫决策模型描述了编队防空动态目标分配过程,构建了编队防空目标分配强化学习系统,描述了系统组成,给出了基于Q-Learning算法的模型求解方法,并对模型效果进行了仿真分析,证明了该模型的有效性。
The target assignment of formation air defense is studied,markov decision model is used to describe the dynamic target assignment process of formation air defense,the formation air defense target allocation reinforcement learning system is constructed,the system composition is described,the model solving method based on Q-Learning algorithm is given,and the model affect is simulated and analyzed,which proves the effectiveness of the model.
作者
李双霖
李琳
潘浩
张修社
韩春雷
LI Shuanglin;LI Lin;PAN Hao;ZHANG Xiushe;HAN Chunlei
出处
《现代导航》
2022年第3期207-211,共5页
Modern Navigation
基金
国防科技基础加强计划资助。