摘要
The libration control problem of space tether system(STS)for post-capture of payload is studied.The process of payload capture will cause tether swing and deviation from the nominal position,resulting in the failure of capture mission.Due to unknown inertial parameters after capturing the payload,an adaptive optimal control based on policy iteration is developed to stabilize the uncertain dynamic system in the post-capture phase.By introducing integral reinforcement learning(IRL)scheme,the algebraic Riccati equation(ARE)can be online solved without known dynamics.To avoid computational burden from iteration equations,the online implementation of policy iteration algorithm is provided by the least-squares solution method.Finally,the effectiveness of the algorithm is validated by numerical simulations.
研究了基于空间系绳系统载荷捕获后的摆振控制问题。载荷捕获会造成系绳的摆振并导致系绳偏离标称位置。由于捕获后系统存在未知的动力学参数,提出了基于策略迭代的自适应最优控制算法,应用于载荷捕获后系绳系统摆动的稳定控制。通过引入积分强化学习方法,在系统动力学未知情况下在线求解代数黎卡提方程。为了避免迭代方程求解的计算负担,采用最小二乘方法在线实施策略迭代算法。最后,通过数值仿真验证了算法的有效性。
基金
supported by the National Natural Science Foundation of China(No.62111530051)
the Fundamental Research Funds for the Central Universities(No.3102017JC06002)
the Shaanxi Science and Technology Program,China(No.2017KW-ZD-04).