期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Tube-based robust reinforcement learning for autonomous maneuver decision for UCAVs
1
作者 Lixin WANG Sizhuang ZHENG +3 位作者 Haiyin PIAO changqian lu Ting YUE Hailiang LIU 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2024年第7期391-405,共15页
Reinforcement Learning(RL)algorithms enhance intelligence of air combat AutonomousManeuver Decision(AMD)policy,but they may underperform in target combat environmentswith disturbances.To enhance the robustness of the ... Reinforcement Learning(RL)algorithms enhance intelligence of air combat AutonomousManeuver Decision(AMD)policy,but they may underperform in target combat environmentswith disturbances.To enhance the robustness of the AMD strategy learned by RL,thisstudy proposes a Tube-based Robust RL(TRRL)method.First,this study introduces a tube todescribe reachable trajectories under disturbances,formulates a method for calculating tubes basedon sum-of-squares programming,and proposes the TRRL algorithm that enhances robustness byutilizing tube size as a quantitative indicator.Second,this study introduces offline techniques forregressing the tube size function and establishing a tube library before policy learning,aiming toeliminate complex online tube solving and reduce the computational burden during training.Furthermore,an analysis of the tube library demonstrates that the mitigated AMD strategy achievesgreater robustness,as smaller tube sizes correspond to more cautious actions.This finding highlightsthat TRRL enhances robustness by promoting a conservative policy.To effectively balanceaggressiveness and robustness,the proposed TRRL algorithm introduces a“laziness factor”as aweight of robustness.Finally,combat simulations in an environment with disturbances confirm thatthe AMD policy learned by the TRRL algorithm exhibits superior air combat performance comparedto selected robust RL baselines. 展开更多
关键词 Air combat Autonomous maneuver decision Robust reinforcement learning Tube-based algorithm Combat simulation
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部