期刊文献⁺

任意字段

题名或关键词

题名

关键词

文摘

作者

第一作者

机构

刊名

分类号

参考文献

作者简介

基金资助

栏目信息

Optimal Multi-impulse Linear Rendezvous via Reinforcement Learning

原文传递

导出

摘要 A reinforcement learning-based approach is proposed to design the multi-impulse rendezvous trajectories in linear relative motions.For the relative motion in elliptical orbits,the relative state propagation is obtained directly from the state transition matrix.This rendezvous problem is constructed as a Markov decision process that reflects the fuel consumption,the transfer time,the relative state,and the dynamical model.An actor-critic algorithm is used to train policy for generating rendezvous maneuvers.The results of the numerical optimization(e.g.,differential evolution)are adopted as the expert data set to accelerate the training process.By deploying a policy network,the multi-impulse rendezvous trajectories can be obtained on board.Moreover,the proposed approach is also applied to generate a feasible solution for many impulses(e.g.,20 impulses),which can be used as an initial value for further optimization.The numerical examples with random initial states show that the proposed method is much faster and has slightly worse performance indexes when compared with the evolutionary algorithm.

作者 Longwei Xu Gang Zhang Shi Qiu Xibin Cao

机构地区 Research Center of Satellite Technology

出处《Space(Science & Technology)》 EI 2023年第1期362-373,共12页 空间科学与技术（英文）

基金 supported in part by the Key Research and Development Plan of Heilongjiang Province under Grant GZ20210120.

关键词 optimization. process. OPTIMAL

分类号 O17 [理学—基础数学]

引文网络
相关文献

参考文献3

1Zibin Sun,Jules Simo,Shengping Gong.Satellite Attitude Identification and Prediction Based on Neural Network Compensation[J].Space(Science & Technology),2023,3(1):71-79. 被引量：1
2LI Jian,ZHANG Gang.Multi-spacecraft Intelligent Orbit Phasing Control Considering Collision Avoidance[J].Transactions of Nanjing University of Aeronautics and Astronautics,2022,39(4):379-388. 被引量：1
3Yuanzhi He,Biao Sheng,Hao Yin,Yun Liu,Yingchao Zhang.Distributed Satellite Cluster Laser Networking Algorithm with Double-Layer Markov DRL Architecture[J].Space(Science & Technology),2023,3(1):80-97. 被引量：1

二级参考文献7

1Lin Cheng,Zhenbo Wang,Fanghua Jiang.Real-time control for fuel-optimal Moon landing based on an interactive deep reinforcement learning algorithm[J].Astrodynamics,2019,3(4):375-386. 被引量：9
2Minglei ZHUANG,Liguo TAN,Kehang LI,Shenmin SONG.Fixed-time position coordinated tracking control for spacecraft formation flying with collision avoidance[J].Chinese Journal of Aeronautics,2021,34(11):182-199. 被引量：4
3Xiangyu Huang,Maodeng Li,Xiaolei Wang,Jinchang Hu,Yu Zhao,Minwen Guo,Chao Xu,Wangwang Liu,Yunpeng Wang,Ce Hao,Lijia Xu.The Tianwen-1 Guidance, Navigation, and Control for Mars Entry, Descent, and Landing[J].Space(Science & Technology),2021(1):84-96. 被引量：9
4Qi Li,Wu Yuan,Rui Zhao,Haogong Wei.Study on Effect of Aerodynamic Configuration on Aerodynamic Performance of Mars Ascent Vehicles[J].Space(Science & Technology),2022(1):120-130. 被引量：2
5Lin Chen,Xiaoyu Fu,Santos Ramil,Ming Xu.Intelligent Fuzzy Control in Stabilizing Solar Sail with Individually Controllable Elements[J].Space(Science & Technology),2022(1):274-285. 被引量：2
6Baojian Yang,Hao Huang,Lu Cao.Centered Error Entropy-Based Sigma-Point Kalman Filter for Spacecraft State Estimation with Non-Gaussian Noise[J].Space(Science & Technology),2022(1):381-393. 被引量：4
7董方昊,冯有前,尹忠海,梁晓龙,周诚,李明杰.具有精英策略的深度强化学习无人机集群通信网络拓扑设计[J].空军工程大学学报（自然科学版）,2019,20(4):52-58. 被引量：7

Space(Science & Technology)

2023年第1期

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...

;

使用帮助返回顶部