Neural network-based reinforcement learning control for combined spacecraft attitude tracking maneuvers

被引:16
|
作者
Liu, Yuhan [1 ]
Ma, Guangfu [1 ]
Lyu, Yueyong [1 ]
Wang, Pengyu [1 ]
机构
[1] Harbin Inst Technol, Sch Astronaut, Harbin 150001, Peoples R China
关键词
Combined spacecraft; Attitude tracking; Reinforcement learning; Q-learning; TARGET; IDENTIFICATION; POSTCAPTURE; ROBOT; SYSTEMS;
D O I
10.1016/j.neucom.2021.07.099
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel reinforcement learning-based attitude tracking control strategy for combined spacecraft takeover maneuvers with completely unknown dynamics. One major issue in the context of combined spacecraft attitude takeover control is that the accurate dynamic model is highly nonlinear, complex and costly to identify online, which makes it impractical for control design. To address this issue, we take the advantage of the Q-learning algorithm to acquire the control strategy directly from system input/output measurement data in a model-free manner, and thus the online inertia parameter identification procedure is avoided. More specifically, first, the attitude tracking is formulated as a regulation problem by introducing an argumented system, where the system dynamic model is still required in control design. Then, in order to achieve a model-free control strategy, an online policy iteration (PI) Q-learning procedure is derived to solve the Bellman optimality equation by utilizing the generated measurement data. In theoretical analysis, it is proved that the iteration sequences of Q value function and control strategy can converge to the optimal ones. In addition, rigorous proof of the stability and monotonicity guarantees of the proposed control strategy are also provided. Furthermore, for the purpose of online implementation, off-policy learning scheme is employed to find the optimal Q value function approximator with neural network structure after data-collection phase. Numerical simulations are exhibited to validate the effectiveness of the proposed strategy.(c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:67 / 78
页数:12
相关论文
共 50 条
  • [1] Learning Chebyshev neural network-based spacecraft attitude tracking control ensuring finite-time prescribed performance
    Jia, Qingxian
    Li, Genghuan
    Yu, Dan
    Ahn, Choon Ki
    Zhang, Chengxi
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 148
  • [2] On adaptive attitude tracking control of spacecraft: A reinforcement learning based gain tuning way with guaranteed performance
    Wei, Caisheng
    Xiong, Yunwen
    Chen, Qifeng
    Xu, Dan
    ADVANCES IN SPACE RESEARCH, 2023, 71 (11) : 4534 - 4548
  • [3] Reinforcement learning strategy for spacecraft attitude hyperagile tracking control with uncertainties
    Zheng, Mohong
    Wu, Yunhua
    Li, Chaoyong
    AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 119
  • [4] Adaptive Neural Network Model-based Event-triggered Attitude Tracking Control for Spacecraft
    Xie, Hongyi
    Wu, Baolin
    Liu, Weixing
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2021, 19 (01) : 172 - 185
  • [5] Neural network-based nonsingular fixed-time pose tracking control for spacecraft with actuator faults
    Ji, Yuxia
    Chen, Li
    Zhang, Dexin
    Shao, Xiaowei
    ADVANCES IN SPACE RESEARCH, 2022, 69 (06) : 2555 - 2573
  • [6] Optimal nonlinear tracking of spacecraft attitude maneuvers
    Sharma, R
    Tewari, A
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2004, 12 (05) : 677 - 682
  • [7] Object Tracking Using Siamese Network-Based Reinforcement Learning
    Park, Sung Jun
    Hwang, Seung Jun
    Baek, Joong-Hwan
    IEEE ACCESS, 2022, 10 : 63339 - 63352
  • [8] Neural Network-Based Optimal Tracking Control of Continuous-Time Uncertain Nonlinear System via Reinforcement Learning
    Jingang Zhao
    Neural Processing Letters, 2020, 51 : 2513 - 2530
  • [9] Neural Network-Based Optimal Tracking Control of Continuous-Time Uncertain Nonlinear System via Reinforcement Learning
    Zhao, Jingang
    NEURAL PROCESSING LETTERS, 2020, 51 (03) : 2513 - 2530
  • [10] Bridging Reinforcement Learning and Online Learning for Spacecraft Attitude Control
    Elkins, Jacob G.
    Sood, Rohan
    Rumpf, Clemens
    JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2021, : 62 - 69