Neural network-based reinforcement learning control for combined spacecraft attitude tracking maneuvers

被引:16
|
作者
Liu, Yuhan [1 ]
Ma, Guangfu [1 ]
Lyu, Yueyong [1 ]
Wang, Pengyu [1 ]
机构
[1] Harbin Inst Technol, Sch Astronaut, Harbin 150001, Peoples R China
关键词
Combined spacecraft; Attitude tracking; Reinforcement learning; Q-learning; TARGET; IDENTIFICATION; POSTCAPTURE; ROBOT; SYSTEMS;
D O I
10.1016/j.neucom.2021.07.099
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel reinforcement learning-based attitude tracking control strategy for combined spacecraft takeover maneuvers with completely unknown dynamics. One major issue in the context of combined spacecraft attitude takeover control is that the accurate dynamic model is highly nonlinear, complex and costly to identify online, which makes it impractical for control design. To address this issue, we take the advantage of the Q-learning algorithm to acquire the control strategy directly from system input/output measurement data in a model-free manner, and thus the online inertia parameter identification procedure is avoided. More specifically, first, the attitude tracking is formulated as a regulation problem by introducing an argumented system, where the system dynamic model is still required in control design. Then, in order to achieve a model-free control strategy, an online policy iteration (PI) Q-learning procedure is derived to solve the Bellman optimality equation by utilizing the generated measurement data. In theoretical analysis, it is proved that the iteration sequences of Q value function and control strategy can converge to the optimal ones. In addition, rigorous proof of the stability and monotonicity guarantees of the proposed control strategy are also provided. Furthermore, for the purpose of online implementation, off-policy learning scheme is employed to find the optimal Q value function approximator with neural network structure after data-collection phase. Numerical simulations are exhibited to validate the effectiveness of the proposed strategy.(c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:67 / 78
页数:12
相关论文
共 50 条
  • [21] Attitude and Orbit Optimal Control of Combined Spacecraft via a Fully-Actuated System Approach
    Duan Guangquan
    Liu Guo-Ping
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2022, 35 (02) : 623 - 640
  • [22] Reinforcement learning robust optimal control for spacecraft attitude stabilization
    Xiao B.
    Zhang H.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (01):
  • [23] Finite-Time Attitude Tracking Control for Spacecraft Using Terminal Sliding Mode and Chebyshev Neural Network
    Zou, An-Min
    Kumar, Krishna Dev
    Hou, Zeng-Guang
    Liu, Xi
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2011, 41 (04): : 950 - 963
  • [24] Adaptive neural network-based trajectory tracking outer loop control for a quadrotor
    Lopez-Sanchez, Ivan
    Moyron, Jeronimo
    Moreno-Valenzuela, Javier
    AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 129
  • [25] Deep reinforcement learning-based attitude control for spacecraft using control moment gyros
    Oghim, Snyoll
    Park, Junwoo
    Bang, Hyochoong
    Leeghim, Henzeh
    ADVANCES IN SPACE RESEARCH, 2025, 75 (01) : 1129 - 1144
  • [26] Forecasting-based data-driven model-free adaptive sliding mode attitude control of combined spacecraft
    Gao, Han
    Ma, Guangfu
    Lv, Yueyong
    Guo, Yanning
    AEROSPACE SCIENCE AND TECHNOLOGY, 2019, 86 : 364 - 374
  • [27] Neural network-based learning impedance control for a robot
    Xiao, NF
    Todo, I
    JSME INTERNATIONAL JOURNAL SERIES C-MECHANICAL SYSTEMS MACHINE ELEMENTS AND MANUFACTURING, 2001, 44 (03): : 626 - 633
  • [28] Fault-tolerant attitude tracking control of combined spacecraft with reaction wheels under prescribed performance
    Huang, Xiuwei
    Duan, Guangren
    ISA TRANSACTIONS, 2020, 98 (98) : 161 - 172
  • [29] Reinforcement Learning-based Attitude Control for Spacecraft with Reaction Jets: Theory and Experiment
    Du, Desong
    Liu, Yanfang
    Yuan, Qiufan
    Zhao, Fuyou
    Qi, Naiming
    Yuhang Xuebao/Journal of Astronautics, 2024, 45 (06): : 903 - 913
  • [30] Fault-tolerant control of spacecraft attitude with prescribed performance based on reinforcement learning
    Jin, Lei
    Yang, Shaolong
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2024, 50 (08): : 2404 - 2412