Reinforcement Learning-Based Approximate Optimal Control for Attitude Reorientation Under State Constraints

被引:47
|
作者
Dong, Hongyang [1 ]
Zhao, Xiaowei [1 ]
Yang, Haoyang [2 ]
机构
[1] Univ Warwick, Sch Engn, Coventry CV4 7AL, W Midlands, England
[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
Attitude control; Payloads; Angular velocity; Optimal control; Artificial neural networks; Cost function; Quaternions; Adaptive dynamic programming (ADP); approximate optimal control; attitude control; reinforcement learning (RL); state constraints; FEEDBACK-CONTROL; SPACECRAFT; TRACKING; STABILIZATION; PARAMETER; SYSTEMS;
D O I
10.1109/TCST.2020.3007401
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article addresses the attitude reorientation problems of rigid bodies under multiple state constraints. A novel reinforcement learning (RL)-based approximate optimal control method is proposed to make the tradeoff between control cost and performance. The novelty lies in that it guarantees constraint handling abilities on attitude forbidden zones and angular velocity limits. To achieve this, barrier functions are employed to encode the constraint information into the cost function. Then, an RL-based learning strategy is developed to approximate the optimal cost function and control policy. A simplified critic-only neural network (NN) is employed to replace the conventional actor-critic structure once adequate data are collected online. This design guarantees the uniform boundedness of reorientation errors and NN weight estimation errors subject to the satisfaction of a finite excitation condition, which is a relaxation compared with the persistent excitation condition that is typically required for this class of problems. More importantly, all underlying state constraints are strictly obeyed during the online learning process. The effectiveness and advantages of the proposed controller are verified by both numerical simulations and experimental tests based on a comprehensive hardware-in-loop testbed.
引用
收藏
页码:1664 / 1673
页数:10
相关论文
共 50 条
  • [1] Learning-Based Attitude Tracking Control With High-Performance Parameter Estimation
    Dong, Hongyang
    Zhao, Xiaowei
    Hu, Qinglei
    Yang, Haoyang
    Qi, Pengyuan
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2022, 58 (03) : 2218 - 2230
  • [2] Spacecraft attitude reorientation control method based on potential function under complex constraints
    Hua, Bing
    He, Jie
    Zhang, Hong
    Wu, Yunhua
    Chen, Zhiming
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 144
  • [3] Reinforcement learning-based satellite formation attitude control under multi-constraint
    Cai, Yingkai
    Low, Kay-Soon
    Wang, Zhaokui
    ADVANCES IN SPACE RESEARCH, 2024, 74 (11) : 5819 - 5836
  • [4] Learning-Based 6-DOF Control for Autonomous Proximity Operations Under Motion Constraints
    Hu, Qinglei
    Yang, Haoyang
    Dong, Hongyang
    Zhao, Xiaowei
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2021, 57 (06) : 4097 - 4109
  • [5] Koopman-Operator-Based Safe Learning Control for Spacecraft Attitude Reorientation With Angular Velocity Constraints
    Yao, Junyu
    Hu, Qinglei
    Zheng, Jianying
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (05) : 7072 - 7085
  • [6] Event-Triggered Sliding Mode Control for Spacecraft Reorientation With Multiple Attitude Constraints
    Tan, Jin
    Zhang, Kai
    Li, Bin
    Wu, Ai-Guo
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (05) : 6031 - 6043
  • [7] Deep reinforcement learning-based attitude control for spacecraft using control moment gyros
    Oghim, Snyoll
    Park, Junwoo
    Bang, Hyochoong
    Leeghim, Henzeh
    ADVANCES IN SPACE RESEARCH, 2025, 75 (01) : 1129 - 1144
  • [8] Explicit reference governor based spacecraft attitude reorientation control with constraints and disturbances
    Dang, Qingqing
    Liu, Kun
    Wei, Jingbo
    ACTA ASTRONAUTICA, 2022, 190 : 455 - 464
  • [9] Reinforcement learning-based attitude control for a barbell electric sail
    Ma, Xiaolei
    Wen, Hao
    ISA TRANSACTIONS, 2024, 147 : 252 - 264
  • [10] ADP-Based Spacecraft Attitude Control Under Actuator Misalignment and Pointing Constraints
    Yang, Haoyang
    Hu, Qinglei
    Dong, Hongyang
    Zhao, Xiaowei
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (09) : 9342 - 9352