Reinforcement Learning-Based Approximate Optimal Control for Attitude Reorientation Under State Constraints

被引:47
|
作者
Dong, Hongyang [1 ]
Zhao, Xiaowei [1 ]
Yang, Haoyang [2 ]
机构
[1] Univ Warwick, Sch Engn, Coventry CV4 7AL, W Midlands, England
[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
Attitude control; Payloads; Angular velocity; Optimal control; Artificial neural networks; Cost function; Quaternions; Adaptive dynamic programming (ADP); approximate optimal control; attitude control; reinforcement learning (RL); state constraints; FEEDBACK-CONTROL; SPACECRAFT; TRACKING; STABILIZATION; PARAMETER; SYSTEMS;
D O I
10.1109/TCST.2020.3007401
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article addresses the attitude reorientation problems of rigid bodies under multiple state constraints. A novel reinforcement learning (RL)-based approximate optimal control method is proposed to make the tradeoff between control cost and performance. The novelty lies in that it guarantees constraint handling abilities on attitude forbidden zones and angular velocity limits. To achieve this, barrier functions are employed to encode the constraint information into the cost function. Then, an RL-based learning strategy is developed to approximate the optimal cost function and control policy. A simplified critic-only neural network (NN) is employed to replace the conventional actor-critic structure once adequate data are collected online. This design guarantees the uniform boundedness of reorientation errors and NN weight estimation errors subject to the satisfaction of a finite excitation condition, which is a relaxation compared with the persistent excitation condition that is typically required for this class of problems. More importantly, all underlying state constraints are strictly obeyed during the online learning process. The effectiveness and advantages of the proposed controller are verified by both numerical simulations and experimental tests based on a comprehensive hardware-in-loop testbed.
引用
收藏
页码:1664 / 1673
页数:10
相关论文
共 50 条
  • [21] Reinforcement Learning-Based Nearly Optimal Control for Constrained-Input Partially Unknown Systems Using Differentiator
    Guo, Xinxin
    Yan, Weisheng
    Cui, Rongxin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 4713 - 4725
  • [22] Reinforcement Learning-Based Finite-Time Optimal Containment Control for Underactuated Surface Vehicles With Guaranteed Performance
    Chen, Lin
    Dong, Chao
    Dai, Shi-Lu
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (12): : 7206 - 7217
  • [23] Data-Driven Immersion and Invariance Adaptive Attitude Control for Rigid Bodies With Double-Level State Constraints
    Shao, Xiaodong
    Hu, Qinglei
    Shi, Yang
    Yi, Bowen
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2022, 30 (02) : 779 - 794
  • [24] Optimal Control under State Constraints
    Frankowska, Helene
    PROCEEDINGS OF THE INTERNATIONAL CONGRESS OF MATHEMATICIANS, VOL IV: INVITED LECTURES, 2010, : 2915 - 2942
  • [25] Fixed-time concurrent learning-based robust approximate optimal control
    Tan, Junkai
    Xue, Shuangsi
    Niu, Tiansen
    Qu, Kai
    Cao, Hui
    Chen, Badong
    NONLINEAR DYNAMICS, 2025,
  • [26] Sampling-based Approximate Optimal Control Under Temporal Logic Constraints
    Fu, Jie
    Papusha, Ivan
    Topcu, Ufuk
    PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (PART OF CPS WEEK) (HSCC' 17), 2017, : 227 - 235
  • [27] Dynamic Event-Triggered Robust Optimal Attitude Control of QUAV Using Reinforcement Learning
    Jin, Peng
    Ma, Qian
    Xu, Shengyuan
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (06) : 7798 - 7807
  • [28] Reinforcement Learning-Based Adaptive Optimal Exponential Tracking Control of Linear Systems With Unknown Dynamics
    Chen, Ci
    Modares, Hamidreza
    Xie, Kan
    Lewis, Frank L.
    Wan, Yan
    Xie, Shengli
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (11) : 4423 - 4438
  • [29] Approximate Optimized Backstepping Control of Uncertain Fractional-Order Nonlinear Systems Based on Reinforcement Learning
    Li, Dongdong
    Dong, Jiuxiang
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (11): : 6723 - 6732
  • [30] NN-based reinforcement-learning optimal sliding mode control for drag-free and attitude of spacecraft with state constraints
    Jiang, Changwu
    Liu, Yuan
    ADVANCES IN SPACE RESEARCH, 2024, 73 (01) : 971 - 981