Spacecraft Proximity Maneuvering and Rendezvous With Collision Avoidance Based on Reinforcement Learning

被引：30

作者：

Qu, Qingyu ^{[1
]}

Liu, Kexin ^{[2
]}

Wang, Wei ^{[2
]}

Lu, Jinhu ^{[2
]}

机构：

[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China

[2] Beihang Univ, Sch Automat Sci & Elect Engn, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China

来源：

IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS | 2022年 / 58卷 / 06期

基金：

美国国家科学基金会;

关键词：

Space vehicles; Heuristic algorithms; Aerodynamics; Collision avoidance; Oscillators; Orbits; Mathematical models; Aerospace control; autonomous spacecraft rendezvous (ASR); collision avoidance; deep reinforcement learning (DRL); SLIDING MODE CONTROL;

D O I：

10.1109/TAES.2022.3180271

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

The rapid development of the aerospace industry puts forward the urgent need for the evolution of autonomous spacecraft rendezvous technology, which has gained significant attention recently due to increased applications in various space missions. This article studies the relative position tracking problem of the autonomous spacecraft rendezvous under the requirement of collision avoidance. An exploration-adaptive deep deterministic policy gradient (DDPG) algorithm is proposed to train a definite control strategy for this mission. Similar to the DDPG algorithm, four neural networks are used in this method, where two of them are used to generate the deterministic policy, whereas the other two are used to score the obtained policy. Differently, adaptive noise is introduced to reduce the possibility of oscillations and divergences and to cut down the unnecessary computation by weakening the exploration of stabilization problems. In addition, in order to effectively and quickly adapt to some other similar scenarios, a metalearning-based idea is introduced by fine-tuning the prior strategy. Finally, two numerical simulations show that the trained control strategy can effectively avoid the oscillation phenomenon caused by the artificial potential function. Benefiting from this, the trained control strategy based on deep reinforcement learning technology can decrease the energy consumption by 16.44% during the close proximity phase, compared with the traditional artificial potential function method. Besides, after introducing the metalearning-based idea, a strategy available for some other perturbed scenarios can be trained in a relatively short period of time, which illustrates its adaptability.

引用

页码：5823 / 5834

页数：12

共 31 条

[1]

Alfriend KT, 2010, SPACECRAFT FORMATION FLYING: DYNAMICS, CONTROL, AND NAVIGATION, P1, DOI 10.1016/B978-0-7506-8533-7.00206-2

[2]

Camille P., 2020, IEEE T CONTROL SYST, V28, P1050

[3] Suboptimal artificial potential function sliding mode control for spacecraft rendezvous with obstacle avoidance [J].

Cao, Lu ;

Qiao, Dong ;

Xu, Jingwen .

ACTA ASTRONAUTICA, 2018, 143 :133-146

[4] Finite-time control for autonomous rendezvous and docking under safe constraint [J].

Guo, Yong ;

Zhang, Dawei ;

Li, Ai-jun ;

Song, Shenmin ;

Wang, Chang-qing ;

Liu, Zongming .

AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 109

[5]

Huang X., 2021, Acta Aeronaut. Astronaut. Sin, V42, P524201

[6] Actor-Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems [J].

Kiumarsi, Bahare ;

Lewis, Frank L. .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (01) :140-151

[7] Optimal control of loose spacecraft formations near libration points with collision avoidance [J].

Li, Mingwu ;

Peng, Haijun ;

Zhong, Wanxie .

NONLINEAR DYNAMICS, 2016, 83 (04) :2241-2261

[8] Robust adaptive control for spacecraft final proximity maneuvers with safety constraint and input quantization [J].

Li, Qi ;

Sun, Chong ;

Song, Shuo ;

Gou, Qiuxiong ;

Niu, Zhiqi .

ISA TRANSACTIONS, 2021, 111 :35-46

[9] Robust fault-tolerant saturated control for spacecraft proximity operations with actuator saturation and faults [J].

Li, Qi ;

Yuan, Jianping ;

Sun, Chong .

ADVANCES IN SPACE RESEARCH, 2019, 63 (05) :1541-1553

[10] Sliding mode control for autonomous spacecraft rendezvous with collision avoidance [J].

Li, Qi ;

Yuan, Jianping ;

Wang, Huan .

ACTA ASTRONAUTICA, 2018, 151 :743-751

← 1 2 3 4 →