期刊文献+

航空器智能引导机动决策奖励重塑方法 被引量:1

Reward Shaping for Intelligent Maneuver Decision Generation in Aircraft Guidance
下载PDF
导出
摘要 针对使用深度强化学习进行航空器智能引导研究中存在的飞行轨迹质量差、训练效率低等问题,对应用于机动决策生成的奖励重塑方法进行了研究。首先,构建了航空器引导机动决策生成的深度强化学习模型;其次,从指令连续性和相对姿态两个角度设计了奖励重塑函数,并证明了使用重塑函数前后的策略一致性;最后,在不同类型目的地场景中进行了仿真实验。仿真结果表明:奖励重塑方法对航空器飞行轨迹质量和智能体训练效率有明显的提升。使用本方法快速训练的智能体,可以准确、高效地生成机动决策,引导航空器完成任务。 A reward shaping approach applied to maneuver decision generation was proposed to solve the poor trajectory performance and low training efficiency problems in aircraft intelligent guidance using deep reinforcement learning.Firstly,a deep reinforcement learning model for aircraft guidance maneuver decision generation was built.Secondly,the reward shaping function was designed from the two aspects of instruction continuity and relative position,and the policy consistency using reward shaping was proved.Finally,simulations were carried out in different kinds of destination scenarios.Simulation results show that flight trajectory performance and agent training efficiency are significantly improved by using reward remodeling.The agent trained by the proposed method can generate maneuver decision accurately and efficiently and guide the aircraft to complete the guidance task.
作者 王壮 艾毅 文旭光 李辉 WANG Zhuang;AI Yi;WEN Xu-guang;LI Hui(College of Air Traffic Management,Civil Aviation Flight University of China,Guanghan 618307,China;Guangxi Key Laboratory of International Join for China-ASEAN Comprehensive Transportation,Nanning University,Nanning 530000,China;College of Computer Science,Sichuan University,Chengdu 610065,China)
出处 《科学技术与工程》 北大核心 2023年第8期3535-3543,共9页 Science Technology and Engineering
基金 四川省科技计划(2021JDRC0083) 中央高校基本科研业务费基金(J2022-051) 广西中国-东盟综合交通国际联合重点实验室资助课题(21-220-21-01)。
关键词 航空器引导 机动决策 深度强化学习 奖励重塑 aircraft guidance maneuver decision deep reinforcement learning reward shaping
  • 相关文献

参考文献11

二级参考文献84

共引文献85

同被引文献2

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部