Spacecraft Proximity Maneuvering and Rendezvous With Collision Avoidance Based on Reinforcement Learning

被引:24
|
作者
Qu, Qingyu [1 ]
Liu, Kexin [2 ]
Wang, Wei [2 ]
Lu, Jinhu [2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Automat Sci & Elect Engn, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China
基金
美国国家科学基金会;
关键词
Space vehicles; Heuristic algorithms; Aerodynamics; Collision avoidance; Oscillators; Orbits; Mathematical models; Aerospace control; autonomous spacecraft rendezvous (ASR); collision avoidance; deep reinforcement learning (DRL); SLIDING MODE CONTROL;
D O I
10.1109/TAES.2022.3180271
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The rapid development of the aerospace industry puts forward the urgent need for the evolution of autonomous spacecraft rendezvous technology, which has gained significant attention recently due to increased applications in various space missions. This article studies the relative position tracking problem of the autonomous spacecraft rendezvous under the requirement of collision avoidance. An exploration-adaptive deep deterministic policy gradient (DDPG) algorithm is proposed to train a definite control strategy for this mission. Similar to the DDPG algorithm, four neural networks are used in this method, where two of them are used to generate the deterministic policy, whereas the other two are used to score the obtained policy. Differently, adaptive noise is introduced to reduce the possibility of oscillations and divergences and to cut down the unnecessary computation by weakening the exploration of stabilization problems. In addition, in order to effectively and quickly adapt to some other similar scenarios, a metalearning-based idea is introduced by fine-tuning the prior strategy. Finally, two numerical simulations show that the trained control strategy can effectively avoid the oscillation phenomenon caused by the artificial potential function. Benefiting from this, the trained control strategy based on deep reinforcement learning technology can decrease the energy consumption by 16.44% during the close proximity phase, compared with the traditional artificial potential function method. Besides, after introducing the metalearning-based idea, a strategy available for some other perturbed scenarios can be trained in a relatively short period of time, which illustrates its adaptability.
引用
收藏
页码:5823 / 5834
页数:12
相关论文
共 50 条
  • [21] An Aircraft Collision Avoidance Method Based on Deep Reinforcement Learning
    Liu, Zuocheng
    Neretin, Evgeny
    Gao, Xiaoguang
    Wan, Kaifang
    2024 9TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS ENGINEERING, ICCRE 2024, 2024, : 241 - 246
  • [22] OPTIMIZED COLLISION AVOIDANCE OF SPACECRAFT IN ULTRA CLOSE PROXIMITY FOR FAILED SATELLITE
    Chu, Xiaoyu
    Zhang, Jingrui
    Liu, Fei
    SPACEFLIGHT MECHANICS 2016, PTS I-IV, 2016, 158 : 4033 - 4048
  • [23] Natural Motion-based Trajectories for Automatic Spacecraft Collision Avoidance During Proximity Operations
    Mote, Mark L.
    Hays, Christopher W.
    Collins, Alexander
    Feron, Eric
    Hobbs, Kerianne L.
    2021 IEEE AEROSPACE CONFERENCE (AEROCONF 2021), 2021,
  • [24] SPACECRAFT RENDEZVOUS GUIDANCE IN CLUTTERED ENVIRONMENTS VIA REINFORCEMENT LEARNING
    Broida, Jacob
    Linares, Richard
    SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 1777 - 1788
  • [25] UNCERTAINTY IN COLLISION AVOIDANCE MANEUVERING
    TAYLOR, DH
    JOURNAL OF NAVIGATION, 1990, 43 (02): : 238 - 245
  • [26] Collision probability based optimal collision avoidance maneuver in rendezvous and docking
    Wang, Hua
    Li, Hai-Yang
    Tang, Guo-Jin
    Yuhang Xuebao/Journal of Astronautics, 2008, 29 (01): : 220 - 223
  • [27] Deep Reinforcement Learning for Spacecraft Proximity Operations Guidance
    Hovell, Kirk
    Ulrich, Steve
    JOURNAL OF SPACECRAFT AND ROCKETS, 2021, 58 (02) : 254 - 264
  • [28] Research of Automatic Collision Avoidance based on Ship Maneuvering
    Liu, Ting
    Huang, Zhen
    Tang, Wenlong
    Wang, Zhao
    2016 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS - COMPUTING TECHNOLOGY, INTELLIGENT TECHNOLOGY, INDUSTRIAL INFORMATION INTEGRATION (ICIICII), 2016, : 232 - 235
  • [29] Deep-Reinforcement-Learning-Based Collision Avoidance in UAV Environment
    Ouahouah, Sihem
    Bagaa, Miloud
    Prados-Garzon, Jonathan
    Taleb, Tarik
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (06) : 4015 - 4030
  • [30] Deep reinforcement learning based collision avoidance system for autonomous ships
    Wang, Yong
    Xu, Haixiang
    Feng, Hui
    He, Jianhua
    Yang, Haojie
    Li, Fen
    Yang, Zhen
    OCEAN ENGINEERING, 2024, 292