Spacecraft Proximity Maneuvering and Rendezvous With Collision Avoidance Based on Reinforcement Learning

被引:24
|
作者
Qu, Qingyu [1 ]
Liu, Kexin [2 ]
Wang, Wei [2 ]
Lu, Jinhu [2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Automat Sci & Elect Engn, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China
基金
美国国家科学基金会;
关键词
Space vehicles; Heuristic algorithms; Aerodynamics; Collision avoidance; Oscillators; Orbits; Mathematical models; Aerospace control; autonomous spacecraft rendezvous (ASR); collision avoidance; deep reinforcement learning (DRL); SLIDING MODE CONTROL;
D O I
10.1109/TAES.2022.3180271
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The rapid development of the aerospace industry puts forward the urgent need for the evolution of autonomous spacecraft rendezvous technology, which has gained significant attention recently due to increased applications in various space missions. This article studies the relative position tracking problem of the autonomous spacecraft rendezvous under the requirement of collision avoidance. An exploration-adaptive deep deterministic policy gradient (DDPG) algorithm is proposed to train a definite control strategy for this mission. Similar to the DDPG algorithm, four neural networks are used in this method, where two of them are used to generate the deterministic policy, whereas the other two are used to score the obtained policy. Differently, adaptive noise is introduced to reduce the possibility of oscillations and divergences and to cut down the unnecessary computation by weakening the exploration of stabilization problems. In addition, in order to effectively and quickly adapt to some other similar scenarios, a metalearning-based idea is introduced by fine-tuning the prior strategy. Finally, two numerical simulations show that the trained control strategy can effectively avoid the oscillation phenomenon caused by the artificial potential function. Benefiting from this, the trained control strategy based on deep reinforcement learning technology can decrease the energy consumption by 16.44% during the close proximity phase, compared with the traditional artificial potential function method. Besides, after introducing the metalearning-based idea, a strategy available for some other perturbed scenarios can be trained in a relatively short period of time, which illustrates its adaptability.
引用
收藏
页码:5823 / 5834
页数:12
相关论文
共 50 条
  • [41] Deep Reinforcement Learning for Collision Avoidance of Autonomous Vehicle
    Tseng, Hsiao-Ting
    Hsieh, Chen-Chiung
    Lin, Wei-Ting
    Lin, Jyun-Ting
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [42] Advancing spacecraft rendezvous and docking through safety reinforcement learning and ubiquitous learning principles
    Sharma, Kanta Prasad
    Kumar, Indradeep
    Singh, Pavitar Parkash
    Anbazhagan, K.
    Albarakati, Hussain Mobarak
    Bhatt, Mohammed Wasim
    Ziyadullayevich, Avlokulov Anvar
    Rana, Arti
    Sivasankari, S. A.
    COMPUTERS IN HUMAN BEHAVIOR, 2024, 153
  • [43] COLLISION-AVOIDANCE AUTONOMOUS CONTROL ALGORITHM FOR MULTI-SPACECRAFT PROXIMITY OPERATIONS
    Xu, Dan-Dan
    Zhang, Jing
    Luo, Ya-Zhong
    FOURTH IAA CONFERENCE ON DYNAMICS AND CONTROL OF SPACE SYSTEMS 2018, PTS I-III, 2018, 165 : 123 - 136
  • [44] Research on collision avoidance method of intelligent ship navigation based on reinforcement learning
    Yuan, Zhongmi
    Ma, Lei
    Liu, Xiaoqiu
    Zhang, Weibin
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3220 - 3224
  • [45] COLREGs-compliant multiship collision avoidance based on deep reinforcement learning
    Zhao, Luman
    Roh, Myung-Il
    OCEAN ENGINEERING, 2019, 191
  • [46] Research on MASS Collision Avoidance in Complex Waters Based on Deep Reinforcement Learning
    Liu, Jiao
    Shi, Guoyou
    Zhu, Kaige
    Shi, Jiahui
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (04)
  • [47] Adaptive Environment Modeling Based Reinforcement Learning for Collision Avoidance in Complex Scenes
    Wang, Shuaijun
    Gao, Rui
    Han, Ruihua
    Chen, Shengduo
    Li, Chengyang
    Hao, Qi
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 9011 - 9018
  • [48] Research on cooperative collision avoidance problem of multiple UAV based on Reinforcement Learning
    Fang Bin
    Feng XiaoFeng
    Xu Shuo
    2017 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2017), 2017, : 103 - 109
  • [49] Nearest-Neighbor-based Collision Avoidance for Quadrotors via Reinforcement Learning
    Ourari, Ramzi
    Cui, Kai
    Elshamanhory, Ahmed
    Koeppl, Heinz
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022,
  • [50] Collision avoidance for AGV based on deep reinforcement learning in complex dynamic environment
    Cai Z.
    Hu Y.
    Wen J.
    Zhang L.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (01): : 236 - 245