Trajectory optimization of spacecraft autonomous far-distance rapid rendezvous based on deep reinforcement learning

被引:0
|
作者
Di, Peng [1 ,2 ]
Yao, Ye [1 ,2 ]
Lin, Zheng [1 ,2 ]
Yin, Zengshan [1 ,2 ]
机构
[1] Chinese Acad Sci, Innovat Acad Microsatellites, Shanghai 201306, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
J; 2; perturbation; Safe area reward; Uncertainty analysis; Far-distance rapid rendezvous; Deep reinforcement learning; TIME OPTIMAL-CONTROL;
D O I
10.1016/j.asr.2024.09.066
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper investigates the application of Deep Reinforcement Learning (DRL) in the trajectory optimization of spacecraft fardistance rapid rendezvous, and uses the most advanced DRL method Proximal Policy Optimization (PPO) to solve the continuous high-thrust minimum-fuel trajectory optimization problem. The space J2 perturbation was considered, its impact on the spacecraft's on-orbit operation and trajectory design was analyzed, and the effectiveness and accuracy of the proposed method were verified in two far-distance rapid rendezvous missions. In order to ensure the safety of the subsequent close-range operation phase, a safe area reward framework is proposed, and sparse and dense safe area reward functions are designed. The dense safe area reward function significantly improves the training efficiency of the algorithm on the basis of ensuring terminal performance. In addition, the modeling and analysis of possible uncertainties in the spacecraft's orbit operation, including observation uncertainty, state uncertainty and control uncertainty, is carried out to verify the performance of the proposed method through simulation. For uncertainties, the closed-loop performance of the policy is also evaluated by performing Monte Carlo simulations. The results show that the PPO algorithm can effectively deal with the rendezvous problem in uncertainty environments. These preliminary results demonstrate the great potential of the DRL (c) 2024 COSPAR. Published by Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
引用
收藏
页码:790 / 806
页数:17
相关论文
共 50 条
  • [1] Optimization control for the far-distance rapid cooperative rendezvous of spacecraft with different masses
    Feng, Weiming
    Zhao, Di
    Shi, Lei
    Yang, Kun
    Zhao, Junfeng
    AEROSPACE SCIENCE AND TECHNOLOGY, 2015, 45 : 449 - 461
  • [2] Quantum-behaved particle swarm optimization for far-distance rapid cooperative rendezvous between two spacecraft
    Yang, Kun
    Feng, Weiming
    Liu, Gang
    Zhao, Junfeng
    Su, Piaoyi
    ADVANCES IN SPACE RESEARCH, 2018, 62 (11) : 2998 - 3011
  • [3] Optimal control for far-distance rapid cooperative rendezvous
    Feng, Wei-ming
    Ren, Fei
    Shi, Lei
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2014, 228 (14) : 2662 - 2673
  • [4] Optimization for far-distance and fuel-limited cooperative rendezvous between two coplanar spacecraft based on Lambert method
    Wang, Zhanwen
    Dong, Yuming
    Feng, Weiming
    Zhao, Junfeng
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2020, 234 (07) : 1301 - 1310
  • [5] Optimization for far-distance cooperative rendezvous with multiple direction-fixed thrusts
    School of Mechanical and Automotive Engineering, Qilu University of Technology , Shandong, Jinan
    250353, China
    J. Phys. Conf. Ser., 1
  • [6] Autonomous Rendezvous Guidance via Deep Reinforcement Learning
    Wang, Xinyu
    Wang, Guohui
    Chen, Yi
    Xie, Yongfeng
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 1848 - 1853
  • [7] Spacecraft rendezvous trajectory optimization method based on EPSO
    School of Automation Science and Electrical Engineering, Beijing University of Aeronautics and Astronautics, Beijing 100191, China
    Yuhang Xuebao, 9 (1195-1201):
  • [8] Reentry trajectory optimization based on Deep Reinforcement Learning
    Gao, Jiashi
    Shi, Xinming
    Cheng, Zhongtao
    Xiong, Jizhang
    Liu, Lei
    Wang, Yongji
    Yang, Ye
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 2588 - 2592
  • [9] Research on autonomous decision-making method for spacecraft in the mission of rendezvous and approaching to maneuvering target based on deep reinforcement learning
    Huang, Cheng
    Xing, Aijia
    Zeng, Quanli
    Xiong, Fangyu
    ASIAN JOURNAL OF CONTROL, 2025,
  • [10] Trajectory optimization for spacecraft autonomous rendezvous and docking with compound state-triggered constraints
    Zhang, Yanquan
    Zhu, Baolong
    Cheng, Min
    Li, Shunli
    AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 127