Neural network-based reinforcement learning control for combined spacecraft attitude tracking maneuvers

被引：16

作者：

Liu, Yuhan ^{[1
]}

Ma, Guangfu ^{[1
]}

Lyu, Yueyong ^{[1
]}

Wang, Pengyu ^{[1
]}

机构：

[1] Harbin Inst Technol, Sch Astronaut, Harbin 150001, Peoples R China

来源：

NEUROCOMPUTING | 2022年 / 484卷

关键词：

Combined spacecraft; Attitude tracking; Reinforcement learning; Q-learning; TARGET; IDENTIFICATION; POSTCAPTURE; ROBOT; SYSTEMS;

D O I：

10.1016/j.neucom.2021.07.099

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a novel reinforcement learning-based attitude tracking control strategy for combined spacecraft takeover maneuvers with completely unknown dynamics. One major issue in the context of combined spacecraft attitude takeover control is that the accurate dynamic model is highly nonlinear, complex and costly to identify online, which makes it impractical for control design. To address this issue, we take the advantage of the Q-learning algorithm to acquire the control strategy directly from system input/output measurement data in a model-free manner, and thus the online inertia parameter identification procedure is avoided. More specifically, first, the attitude tracking is formulated as a regulation problem by introducing an argumented system, where the system dynamic model is still required in control design. Then, in order to achieve a model-free control strategy, an online policy iteration (PI) Q-learning procedure is derived to solve the Bellman optimality equation by utilizing the generated measurement data. In theoretical analysis, it is proved that the iteration sequences of Q value function and control strategy can converge to the optimal ones. In addition, rigorous proof of the stability and monotonicity guarantees of the proposed control strategy are also provided. Furthermore, for the purpose of online implementation, off-policy learning scheme is employed to find the optimal Q value function approximator with neural network structure after data-collection phase. Numerical simulations are exhibited to validate the effectiveness of the proposed strategy.(c) 2021 Elsevier B.V. All rights reserved.

引用

页码：67 / 78

页数：12

共 50 条

[21] Attitude and Orbit Optimal Control of Combined Spacecraft via a Fully-Actuated System Approach
Duan Guangquan
Liu Guo-Ping
JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2022, 35 (02) : 623 - 640
[22] Reinforcement learning robust optimal control for spacecraft attitude stabilization
Xiao B.
Zhang H.
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (01):
[23] Finite-Time Attitude Tracking Control for Spacecraft Using Terminal Sliding Mode and Chebyshev Neural Network
Zou, An-Min
Kumar, Krishna Dev
Hou, Zeng-Guang
Liu, Xi
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2011, 41 (04): : 950 - 963
[24] Adaptive neural network-based trajectory tracking outer loop control for a quadrotor
Lopez-Sanchez, Ivan
Moyron, Jeronimo
Moreno-Valenzuela, Javier
AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 129
[25] Deep reinforcement learning-based attitude control for spacecraft using control moment gyros
Oghim, Snyoll
Park, Junwoo
Bang, Hyochoong
Leeghim, Henzeh
ADVANCES IN SPACE RESEARCH, 2025, 75 (01) : 1129 - 1144
[26] Forecasting-based data-driven model-free adaptive sliding mode attitude control of combined spacecraft
Gao, Han
Ma, Guangfu
Lv, Yueyong
Guo, Yanning
AEROSPACE SCIENCE AND TECHNOLOGY, 2019, 86 : 364 - 374
[27] Neural network-based learning impedance control for a robot
Xiao, NF
Todo, I
JSME INTERNATIONAL JOURNAL SERIES C-MECHANICAL SYSTEMS MACHINE ELEMENTS AND MANUFACTURING, 2001, 44 (03): : 626 - 633
[28] Fault-tolerant attitude tracking control of combined spacecraft with reaction wheels under prescribed performance
Huang, Xiuwei
Duan, Guangren
ISA TRANSACTIONS, 2020, 98 (98) : 161 - 172
[29] Reinforcement Learning-based Attitude Control for Spacecraft with Reaction Jets: Theory and Experiment
Du, Desong
Liu, Yanfang
Yuan, Qiufan
Zhao, Fuyou
Qi, Naiming
Yuhang Xuebao/Journal of Astronautics, 2024, 45 (06): : 903 - 913
[30] Fault-tolerant control of spacecraft attitude with prescribed performance based on reinforcement learning
Jin, Lei
Yang, Shaolong
Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2024, 50 (08): : 2404 - 2412

← 1 2 3 4 5 →