Reinforcement Learning-Based Approximate Optimal Control for Attitude Reorientation Under State Constraints

被引:47
|
作者
Dong, Hongyang [1 ]
Zhao, Xiaowei [1 ]
Yang, Haoyang [2 ]
机构
[1] Univ Warwick, Sch Engn, Coventry CV4 7AL, W Midlands, England
[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
Attitude control; Payloads; Angular velocity; Optimal control; Artificial neural networks; Cost function; Quaternions; Adaptive dynamic programming (ADP); approximate optimal control; attitude control; reinforcement learning (RL); state constraints; FEEDBACK-CONTROL; SPACECRAFT; TRACKING; STABILIZATION; PARAMETER; SYSTEMS;
D O I
10.1109/TCST.2020.3007401
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article addresses the attitude reorientation problems of rigid bodies under multiple state constraints. A novel reinforcement learning (RL)-based approximate optimal control method is proposed to make the tradeoff between control cost and performance. The novelty lies in that it guarantees constraint handling abilities on attitude forbidden zones and angular velocity limits. To achieve this, barrier functions are employed to encode the constraint information into the cost function. Then, an RL-based learning strategy is developed to approximate the optimal cost function and control policy. A simplified critic-only neural network (NN) is employed to replace the conventional actor-critic structure once adequate data are collected online. This design guarantees the uniform boundedness of reorientation errors and NN weight estimation errors subject to the satisfaction of a finite excitation condition, which is a relaxation compared with the persistent excitation condition that is typically required for this class of problems. More importantly, all underlying state constraints are strictly obeyed during the online learning process. The effectiveness and advantages of the proposed controller are verified by both numerical simulations and experimental tests based on a comprehensive hardware-in-loop testbed.
引用
收藏
页码:1664 / 1673
页数:10
相关论文
共 50 条
  • [31] Feedback Control for Spacecraft Reorientation Under Attitude Constraints Via Convex Potentials
    Lee, Unsik
    Mesbahi, Mehran
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2014, 50 (04) : 2578 - 2592
  • [32] Learning-Based Quantum Control for Optimal Pure State Manipulation
    Chen, Anthony Siming
    Herrmann, Guido
    Vamvoudakis, Kyriakos G.
    Vijayan, Jayadev
    IEEE Control Systems Letters, 2024, 8 : 1319 - 1324
  • [33] Learning-Based Quantum Control for Optimal Pure State Manipulation
    Chen, Anthony Siming
    Herrmann, Guido
    Vamvoudakis, Kyriakos G.
    Vijayan, Jayadev
    IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 1319 - 1324
  • [34] Reinforcement learning-based control for waste biorefining processes under uncertainty
    Ji Gao
    Abigael Wahlen
    Caleb Ju
    Yongsheng Chen
    Guanghui Lan
    Zhaohui Tong
    Communications Engineering, 3 (1):
  • [35] Koopman-Operator-Based Safe Learning Control for Spacecraft Attitude Reorientation With Angular Velocity Constraints
    Yao, Junyu
    Hu, Qinglei
    Zheng, Jianying
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (05) : 7072 - 7085
  • [36] Reinforcement learning-based optimal fault-tolerant control for offshore platforms
    Ziaei, Amin
    Kharrati, Hamed
    Salim, Mina
    Rahimi, Afshin
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2022, 236 (06) : 1187 - 1196
  • [37] Optimal Reinforcement Learning-Based Control Algorithm for a Class of Nonlinear Macroeconomic Systems
    Ding, Qing
    Jahanshahi, Hadi
    Wang, Ye
    Bekiros, Stelios
    Alassafi, Madini O.
    MATHEMATICS, 2022, 10 (03)
  • [38] Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems
    Zhong-Xin Fan
    Lintao Tang
    Shihua Li
    Rongjie Liu
    Neural Computing and Applications, 2023, 35 : 23987 - 23996
  • [39] A reinforcement learning-based scheme for adaptive optimal control of linear stochastic systems
    Wong, Wee Chin
    Lee, Jay H.
    2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 57 - 62
  • [40] Reinforcement Learning-Based Optimal Tracking Control of an Unknown Unmanned Surface Vehicle
    Wang, Ning
    Gao, Ying
    Zhao, Hong
    Ahn, Choon Ki
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (07) : 3034 - 3045