期刊文献+

航天器姿态稳定强化学习鲁棒最优控制方法

Reinforcement learning robust optimal control for spacecraft attitude stabilization
原文传递
导出
摘要 针对外部干扰力矩作用下的刚体航天器姿态稳定最优控制问题,提出了一种在线强化学习的智能鲁棒控制方法。该方法基于自适应动态规划框架,设计单Critic神经网络在线地学习无干扰作用的航天器的最优姿态控制律,并设计一种新的自适应律在线估计Critic神经网络的权值,实现了近似最优的控制性能。在学习的近似最优控制律的基础上,嵌入鲁棒控制量,形成鲁棒智能控制器,并应用Lyapunov理论证明了闭环姿态控制系统是一致最终有界稳定的,且Critic神经网络的权值估计误差是收敛的。相比于采用Actor-Critic神经网络结构的自适应动态规划方法,该方法一方面削弱了对持续激励条件的依赖,另一方面降低了计算复杂度,并保证了姿态稳定控制性能对外部干扰具有较强的鲁棒性。 The problem of optimal attitude stabilization control of rigid spacecraft despite external disturbances is in⁃vestigated.An online reinforcement learning-based intelligent and robust control approach is presented via the adap⁃tive dynamic programming technique.In this approach,a critic-only neural network is developed to learn the optimal control policy of the spacecraft attitude system with external disturbance.A new estimation law is synthesized to esti⁃mate the weights of that network online.The learned controller can achieve near-optimal control performance.Then,a robust control effort is designed and added into the learned controller to formulate an intelligent and robust controller.It is proven that the closed-loop attitude system obtained from the proposed controller is uniformly ultimately bounded and that the weight estimation error of the Critic NN is convergent by Lyapunov theory.Comparison with the traditional actor-critical neural network-based control schemes shows that with less computation complexity and great robustness to external disturbances,the proposed control approach is less dependent of the persistent excitation condition.Simu⁃lation results verify the superior control performance of the proposed approach.
作者 肖冰 张海朝 XIAO Bing;ZHANG Haichao(School of Automation,Northwestern Polytechnical University,Xi’an 710072,China)
出处 《航空学报》 EI CAS CSCD 北大核心 2024年第1期51-65,共15页 Acta Aeronautica et Astronautica Sinica
关键词 航天器 姿态控制 强化学习 自适应动态规划 外部干扰 鲁棒性 spacecraft attitude control reinforcement learning adaptive dynamic programming external distur⁃bance robustness
  • 相关文献

参考文献8

二级参考文献49

  • 1邵书义,陈谋,招启军.基于干扰观测器的四旋翼无人机离散时间容错控制[J].航空学报,2020(S02):89-97. 被引量:5
  • 2童旭东,李惠康,葛玉君,高树义.神舟号载人飞船回收着陆分系统设计与性能评估[J].航天返回与遥感,2004,25(3):1-6. 被引量:15
  • 3霍伟,高为炳,程勉.一种新的机器人模型及其控制算法[J].自动化学报,1994,20(3):278-285. 被引量:2
  • 4Li F,Bainum P M.Numerical approach for solving rigid spacecraft minimum time attitude maneuvers[J].Guidance Control Dyn,1994,13(1):38-45.
  • 5Junkins J L,Turner J D.Optimal continuous torque attitude ma-neuvers[J].Guidance Control Dyn,1980,3(3):210 -217.
  • 6Vadali S R,Junkins J L.Optimal open-loop and stable feedback control of rigid spacecraft attitude maneuvers[J].Astronaut.Sc,1984,32:105-122.
  • 7Vadali S R,Junkins J L.Spacecraft large angle rotational maneuvers with optimal momentum transfer[J].Astronaut.Sci,1983,31; 217 -235.
  • 8Dixon M V,Edelbaum T,Potter J E,Vandervelde W E.Fueloptimal reorientation of axis symmetric spacecraft[J].Spacecr,1970,7:1345 -1351.
  • 9Scrivener S L,Thomson R C.Survey of time-optimal attitude maneuvers[J].Guidance Control Dyn,1994,17 (2):225 -233.
  • 10Vadali S R,Kraige L G,Junkins J L.New results on the optimal spacecraft attitude maneuver problem[J].Guidance Control Dyn,1984,7(3):378-380.

共引文献45

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部