Trajectory optimization of spacecraft autonomous far-distance rapid rendezvous based on deep reinforcement learning

被引:0
|
作者
Di, Peng [1 ,2 ]
Yao, Ye [1 ,2 ]
Lin, Zheng [1 ,2 ]
Yin, Zengshan [1 ,2 ]
机构
[1] Chinese Acad Sci, Innovat Acad Microsatellites, Shanghai 201306, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
J; 2; perturbation; Safe area reward; Uncertainty analysis; Far-distance rapid rendezvous; Deep reinforcement learning; TIME OPTIMAL-CONTROL;
D O I
10.1016/j.asr.2024.09.066
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper investigates the application of Deep Reinforcement Learning (DRL) in the trajectory optimization of spacecraft fardistance rapid rendezvous, and uses the most advanced DRL method Proximal Policy Optimization (PPO) to solve the continuous high-thrust minimum-fuel trajectory optimization problem. The space J2 perturbation was considered, its impact on the spacecraft's on-orbit operation and trajectory design was analyzed, and the effectiveness and accuracy of the proposed method were verified in two far-distance rapid rendezvous missions. In order to ensure the safety of the subsequent close-range operation phase, a safe area reward framework is proposed, and sparse and dense safe area reward functions are designed. The dense safe area reward function significantly improves the training efficiency of the algorithm on the basis of ensuring terminal performance. In addition, the modeling and analysis of possible uncertainties in the spacecraft's orbit operation, including observation uncertainty, state uncertainty and control uncertainty, is carried out to verify the performance of the proposed method through simulation. For uncertainties, the closed-loop performance of the policy is also evaluated by performing Monte Carlo simulations. The results show that the PPO algorithm can effectively deal with the rendezvous problem in uncertainty environments. These preliminary results demonstrate the great potential of the DRL (c) 2024 COSPAR. Published by Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
引用
收藏
页码:790 / 806
页数:17
相关论文
共 50 条
  • [31] Deep Reinforcement Learning based Autonomous Air-to-Air Combat using Target Trajectory Prediction
    Yoo, Jaewoong
    Kim, Donghwi
    Shim, David Hyunchul
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 2172 - 2176
  • [32] Autonomous Flow Routing Based on Deep Reinforcement Learning
    Barzegar, S.
    Shakespear-Miles, H.
    Ruiz, M.
    Velasco, L.
    2024 24TH INTERNATIONAL CONFERENCE ON TRANSPARENT OPTICAL NETWORKS, ICTON 2024, 2024,
  • [33] A Deep Reinforcement Learning Based Approach for Autonomous Overtaking
    Li, Xiaoxiang
    Qiu, Xinyou
    Wang, Jian
    Shen, Yuan
    2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2020,
  • [34] Robust trajectory design and guidance for far-range rendezvous using reinforcement learning with safety and observability considerations
    Wijayatunga, Minduli Charithma
    Armellin, Roberto
    Holt, Harry
    AEROSPACE SCIENCE AND TECHNOLOGY, 2025, 159
  • [35] Reinforcement-Learning-Based Trajectory Learning in Frenet Frame for Autonomous Driving
    Yoon, Sangho
    Kwon, Youngjoon
    Ryu, Jaesung
    Kim, Sungkwan
    Choi, Sungwoo
    Lee, Kyungjae
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [36] Research on Inverse Reinforcement Learning-Based Trajectory Planning Optimization Mechanism for Autonomous Connected Vehicles
    Peng H.
    Tang M.
    Zha Q.
    Wang C.
    Wang W.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2023, 43 (08): : 820 - 831
  • [37] Multi-Agent Deep Reinforcement Learning Based UAV Trajectory Optimization for Differentiated Services
    Ning, Zhaolong
    Yang, Yuxuan
    Wang, Xiaojie
    Song, Qingyang
    Guo, Lei
    Jamalipour, Abbas
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 5818 - 5834
  • [38] Deep reinforcement learning based trajectory optimization for magnetometer-mounted UAV to landmine detection
    Barnawi, Ahmed
    Kumar, Neeraj
    Budhiraja, Ishan
    Kumar, Krishan
    Almansour, Amal
    Alzahrani, Bander
    COMPUTER COMMUNICATIONS, 2022, 195 : 441 - 450
  • [39] Supervised reinforcement learning based trajectory tracking control for autonomous vehicles
    Mihaly, Andras
    Van Tan Vu
    Trong Tu Do
    Gaspar, Peter
    IFAC PAPERSONLINE, 2024, 58 (10): : 140 - 145
  • [40] Reinforcement learning-based framework for whale rendezvous via autonomous sensing robots
    Jadhav, Ninad
    Bhattacharya, Sushmita
    Vogt, Daniel
    Aluma, Yaniv
    Tonessen, Pernille
    Prabhakara, Akarsh
    Kumar, Swarun
    Gero, Shane
    Wood, Robert J.
    Gil, Stephanie
    SCIENCE ROBOTICS, 2024, 9 (95)