Neural network-based reinforcement learning control for combined spacecraft attitude tracking maneuvers

被引：16

作者：

Liu, Yuhan ^{[1
]}

Ma, Guangfu ^{[1
]}

Lyu, Yueyong ^{[1
]}

Wang, Pengyu ^{[1
]}

机构：

[1] Harbin Inst Technol, Sch Astronaut, Harbin 150001, Peoples R China

来源：

NEUROCOMPUTING | 2022年 / 484卷

关键词：

Combined spacecraft; Attitude tracking; Reinforcement learning; Q-learning; TARGET; IDENTIFICATION; POSTCAPTURE; ROBOT; SYSTEMS;

D O I：

10.1016/j.neucom.2021.07.099

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a novel reinforcement learning-based attitude tracking control strategy for combined spacecraft takeover maneuvers with completely unknown dynamics. One major issue in the context of combined spacecraft attitude takeover control is that the accurate dynamic model is highly nonlinear, complex and costly to identify online, which makes it impractical for control design. To address this issue, we take the advantage of the Q-learning algorithm to acquire the control strategy directly from system input/output measurement data in a model-free manner, and thus the online inertia parameter identification procedure is avoided. More specifically, first, the attitude tracking is formulated as a regulation problem by introducing an argumented system, where the system dynamic model is still required in control design. Then, in order to achieve a model-free control strategy, an online policy iteration (PI) Q-learning procedure is derived to solve the Bellman optimality equation by utilizing the generated measurement data. In theoretical analysis, it is proved that the iteration sequences of Q value function and control strategy can converge to the optimal ones. In addition, rigorous proof of the stability and monotonicity guarantees of the proposed control strategy are also provided. Furthermore, for the purpose of online implementation, off-policy learning scheme is employed to find the optimal Q value function approximator with neural network structure after data-collection phase. Numerical simulations are exhibited to validate the effectiveness of the proposed strategy.(c) 2021 Elsevier B.V. All rights reserved.

引用

页码：67 / 78

页数：12

共 50 条

[1] Learning Chebyshev neural network-based spacecraft attitude tracking control ensuring finite-time prescribed performance
Jia, Qingxian
Li, Genghuan
Yu, Dan
Ahn, Choon Ki
Zhang, Chengxi
AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 148
[2] On adaptive attitude tracking control of spacecraft: A reinforcement learning based gain tuning way with guaranteed performance
Wei, Caisheng
Xiong, Yunwen
Chen, Qifeng
Xu, Dan
ADVANCES IN SPACE RESEARCH, 2023, 71 (11) : 4534 - 4548
[3] Reinforcement learning strategy for spacecraft attitude hyperagile tracking control with uncertainties
Zheng, Mohong
Wu, Yunhua
Li, Chaoyong
AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 119
[4] Adaptive Neural Network Model-based Event-triggered Attitude Tracking Control for Spacecraft
Xie, Hongyi
Wu, Baolin
Liu, Weixing
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2021, 19 (01) : 172 - 185
[5] Neural network-based nonsingular fixed-time pose tracking control for spacecraft with actuator faults
Ji, Yuxia
Chen, Li
Zhang, Dexin
Shao, Xiaowei
ADVANCES IN SPACE RESEARCH, 2022, 69 (06) : 2555 - 2573
[6] Optimal nonlinear tracking of spacecraft attitude maneuvers
Sharma, R
Tewari, A
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2004, 12 (05) : 677 - 682
[7] Object Tracking Using Siamese Network-Based Reinforcement Learning
Park, Sung Jun
Hwang, Seung Jun
Baek, Joong-Hwan
IEEE ACCESS, 2022, 10 : 63339 - 63352
[8] Neural Network-Based Optimal Tracking Control of Continuous-Time Uncertain Nonlinear System via Reinforcement Learning
Jingang Zhao
Neural Processing Letters, 2020, 51 : 2513 - 2530
[9] Neural Network-Based Optimal Tracking Control of Continuous-Time Uncertain Nonlinear System via Reinforcement Learning
Zhao, Jingang
NEURAL PROCESSING LETTERS, 2020, 51 (03) : 2513 - 2530
[10] Bridging Reinforcement Learning and Online Learning for Spacecraft Attitude Control
Elkins, Jacob G.
Sood, Rohan
Rumpf, Clemens
JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2021, : 62 - 69

← 1 2 3 4 5 →