Neural network-based reinforcement learning control for combined spacecraft attitude tracking maneuvers

被引：16

作者：

Liu, Yuhan ^{[1
]}

Ma, Guangfu ^{[1
]}

Lyu, Yueyong ^{[1
]}

Wang, Pengyu ^{[1
]}

机构：

[1] Harbin Inst Technol, Sch Astronaut, Harbin 150001, Peoples R China

来源：

NEUROCOMPUTING | 2022年 / 484卷

关键词：

Combined spacecraft; Attitude tracking; Reinforcement learning; Q-learning; TARGET; IDENTIFICATION; POSTCAPTURE; ROBOT; SYSTEMS;

D O I：

10.1016/j.neucom.2021.07.099

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a novel reinforcement learning-based attitude tracking control strategy for combined spacecraft takeover maneuvers with completely unknown dynamics. One major issue in the context of combined spacecraft attitude takeover control is that the accurate dynamic model is highly nonlinear, complex and costly to identify online, which makes it impractical for control design. To address this issue, we take the advantage of the Q-learning algorithm to acquire the control strategy directly from system input/output measurement data in a model-free manner, and thus the online inertia parameter identification procedure is avoided. More specifically, first, the attitude tracking is formulated as a regulation problem by introducing an argumented system, where the system dynamic model is still required in control design. Then, in order to achieve a model-free control strategy, an online policy iteration (PI) Q-learning procedure is derived to solve the Bellman optimality equation by utilizing the generated measurement data. In theoretical analysis, it is proved that the iteration sequences of Q value function and control strategy can converge to the optimal ones. In addition, rigorous proof of the stability and monotonicity guarantees of the proposed control strategy are also provided. Furthermore, for the purpose of online implementation, off-policy learning scheme is employed to find the optimal Q value function approximator with neural network structure after data-collection phase. Numerical simulations are exhibited to validate the effectiveness of the proposed strategy.(c) 2021 Elsevier B.V. All rights reserved.

引用

页码：67 / 78

页数：12

共 50 条

[41] Robust Control Allocation in Attitude Fault-Tolerant Control for Combined Spacecraft Under Measurement Uncertainty
Huang, Xiu-Wei
Duan, Guang-Ren
IEEE ACCESS, 2019, 7 : 156191 - 156206
[42] Spacecraft Attitude Maneuvers using Composite Adaptive Control with Invariant Sliding Manifold
Dando, Aaron
PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 4535 - 4540
[43] Attitude tracking control for spacecraft with robust adaptive RBFNN augmenting sliding mode control
Zou, Yao
AEROSPACE SCIENCE AND TECHNOLOGY, 2016, 56 : 197 - 204
[44] Flexible Spacecraft Model and Robust Control Techniques for Attitude Maneuvers
Morga, Pierangela
Mancini, Mauro
Capello, Elisa
2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 1120 - 1126
[45] Brain-inspired learning rules for spiking neural network-based control: a tutorial
Lee, Choongseop
Park, Yuntae
Yoon, Sungmin
Lee, Jiwoon
Cho, Youngho
Park, Cheolsoo
BIOMEDICAL ENGINEERING LETTERS, 2025, 15 (01) : 37 - 55
[46] Data-driven-based attitude control of combined spacecraft with noncooperative target
Jiang, Huaiyuan
Zhou, Bin
Li, Dongxu
Duan, Guangren
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2019, 29 (16) : 5801 - 5819
[47] Neural-Network-Based Reinforcement Learning Control for Path Following of Underactuated Ships
Zhang Lixing
Qiao Lei
Chen Jianliang
Zhang Weidong
PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 5786 - 5791
[48] Adaptive neural network tracking control-based reinforcement learning for wheeled mobile robots with skidding and slipping
Li, Shu
Ding, Liang
Gao, Haibo
Chen, Chao
Liu, Zhen
Deng, Zongquan
NEUROCOMPUTING, 2018, 283 : 20 - 30
[49] Robotic Curved Surface Tracking with a Neural Network for Angle Identification and Constant Force Control based on Reinforcement Learning
Tie Zhang
Meng Xiao
Yan-biao Zou
Jia-dong Xiao
Shou-yan Chen
International Journal of Precision Engineering and Manufacturing, 2020, 21 : 869 - 882
[50] Robotic Curved Surface Tracking with a Neural Network for Angle Identification and Constant Force Control based on Reinforcement Learning
Zhang Tie
Xiao Meng
Zou Yan-biao
Xiao Jia-dong
Chen Shou-yan
INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2020, 21 (05) : 869 - 882

← 1 2 3 4 5 →