Neural network-based reinforcement learning control for combined spacecraft attitude tracking maneuvers

被引:16
|
作者
Liu, Yuhan [1 ]
Ma, Guangfu [1 ]
Lyu, Yueyong [1 ]
Wang, Pengyu [1 ]
机构
[1] Harbin Inst Technol, Sch Astronaut, Harbin 150001, Peoples R China
关键词
Combined spacecraft; Attitude tracking; Reinforcement learning; Q-learning; TARGET; IDENTIFICATION; POSTCAPTURE; ROBOT; SYSTEMS;
D O I
10.1016/j.neucom.2021.07.099
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel reinforcement learning-based attitude tracking control strategy for combined spacecraft takeover maneuvers with completely unknown dynamics. One major issue in the context of combined spacecraft attitude takeover control is that the accurate dynamic model is highly nonlinear, complex and costly to identify online, which makes it impractical for control design. To address this issue, we take the advantage of the Q-learning algorithm to acquire the control strategy directly from system input/output measurement data in a model-free manner, and thus the online inertia parameter identification procedure is avoided. More specifically, first, the attitude tracking is formulated as a regulation problem by introducing an argumented system, where the system dynamic model is still required in control design. Then, in order to achieve a model-free control strategy, an online policy iteration (PI) Q-learning procedure is derived to solve the Bellman optimality equation by utilizing the generated measurement data. In theoretical analysis, it is proved that the iteration sequences of Q value function and control strategy can converge to the optimal ones. In addition, rigorous proof of the stability and monotonicity guarantees of the proposed control strategy are also provided. Furthermore, for the purpose of online implementation, off-policy learning scheme is employed to find the optimal Q value function approximator with neural network structure after data-collection phase. Numerical simulations are exhibited to validate the effectiveness of the proposed strategy.(c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:67 / 78
页数:12
相关论文
共 50 条
  • [41] Robust Control Allocation in Attitude Fault-Tolerant Control for Combined Spacecraft Under Measurement Uncertainty
    Huang, Xiu-Wei
    Duan, Guang-Ren
    IEEE ACCESS, 2019, 7 : 156191 - 156206
  • [42] Spacecraft Attitude Maneuvers using Composite Adaptive Control with Invariant Sliding Manifold
    Dando, Aaron
    PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 4535 - 4540
  • [43] Attitude tracking control for spacecraft with robust adaptive RBFNN augmenting sliding mode control
    Zou, Yao
    AEROSPACE SCIENCE AND TECHNOLOGY, 2016, 56 : 197 - 204
  • [44] Flexible Spacecraft Model and Robust Control Techniques for Attitude Maneuvers
    Morga, Pierangela
    Mancini, Mauro
    Capello, Elisa
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 1120 - 1126
  • [45] Brain-inspired learning rules for spiking neural network-based control: a tutorial
    Lee, Choongseop
    Park, Yuntae
    Yoon, Sungmin
    Lee, Jiwoon
    Cho, Youngho
    Park, Cheolsoo
    BIOMEDICAL ENGINEERING LETTERS, 2025, 15 (01) : 37 - 55
  • [46] Data-driven-based attitude control of combined spacecraft with noncooperative target
    Jiang, Huaiyuan
    Zhou, Bin
    Li, Dongxu
    Duan, Guangren
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2019, 29 (16) : 5801 - 5819
  • [47] Neural-Network-Based Reinforcement Learning Control for Path Following of Underactuated Ships
    Zhang Lixing
    Qiao Lei
    Chen Jianliang
    Zhang Weidong
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 5786 - 5791
  • [48] Adaptive neural network tracking control-based reinforcement learning for wheeled mobile robots with skidding and slipping
    Li, Shu
    Ding, Liang
    Gao, Haibo
    Chen, Chao
    Liu, Zhen
    Deng, Zongquan
    NEUROCOMPUTING, 2018, 283 : 20 - 30
  • [49] Robotic Curved Surface Tracking with a Neural Network for Angle Identification and Constant Force Control based on Reinforcement Learning
    Tie Zhang
    Meng Xiao
    Yan-biao Zou
    Jia-dong Xiao
    Shou-yan Chen
    International Journal of Precision Engineering and Manufacturing, 2020, 21 : 869 - 882
  • [50] Robotic Curved Surface Tracking with a Neural Network for Angle Identification and Constant Force Control based on Reinforcement Learning
    Zhang Tie
    Xiao Meng
    Zou Yan-biao
    Xiao Jia-dong
    Chen Shou-yan
    INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2020, 21 (05) : 869 - 882