Reinforcement Learning-Based Approximate Optimal Control for Attitude Reorientation Under State Constraints

被引:47
|
作者
Dong, Hongyang [1 ]
Zhao, Xiaowei [1 ]
Yang, Haoyang [2 ]
机构
[1] Univ Warwick, Sch Engn, Coventry CV4 7AL, W Midlands, England
[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
Attitude control; Payloads; Angular velocity; Optimal control; Artificial neural networks; Cost function; Quaternions; Adaptive dynamic programming (ADP); approximate optimal control; attitude control; reinforcement learning (RL); state constraints; FEEDBACK-CONTROL; SPACECRAFT; TRACKING; STABILIZATION; PARAMETER; SYSTEMS;
D O I
10.1109/TCST.2020.3007401
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article addresses the attitude reorientation problems of rigid bodies under multiple state constraints. A novel reinforcement learning (RL)-based approximate optimal control method is proposed to make the tradeoff between control cost and performance. The novelty lies in that it guarantees constraint handling abilities on attitude forbidden zones and angular velocity limits. To achieve this, barrier functions are employed to encode the constraint information into the cost function. Then, an RL-based learning strategy is developed to approximate the optimal cost function and control policy. A simplified critic-only neural network (NN) is employed to replace the conventional actor-critic structure once adequate data are collected online. This design guarantees the uniform boundedness of reorientation errors and NN weight estimation errors subject to the satisfaction of a finite excitation condition, which is a relaxation compared with the persistent excitation condition that is typically required for this class of problems. More importantly, all underlying state constraints are strictly obeyed during the online learning process. The effectiveness and advantages of the proposed controller are verified by both numerical simulations and experimental tests based on a comprehensive hardware-in-loop testbed.
引用
收藏
页码:1664 / 1673
页数:10
相关论文
共 50 条
  • [31] Reinforcement Learning-Based Optimal Tracking Control of an Unknown Unmanned Surface Vehicle
    Wang, Ning
    Gao, Ying
    Zhao, Hong
    Ahn, Choon Ki
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (07) : 3034 - 3045
  • [32] Reinforcement learning-based optimal control of uncertain nonlinear systems
    Garcia, Miguel
    Dong, Wenjie
    INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (12) : 2839 - 2850
  • [33] Learning-Based Fault-Tolerant Optimal Control for Spacecraft Constrained Reorientation
    Tian, Yuan
    Song, Bin
    Shao, Xiaodong
    Hu, Qinglei
    2023 2ND CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, CFASTA, 2023, : 805 - 810
  • [34] Distributed Approximate Optimal Attitude Control of Combined Spacecraft With Parameters Identification
    Wang, Menglei
    Wu, Baolin
    Geng, Yunhai
    Wang, Danwei
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (02) : 1784 - 1797
  • [35] Safe model-based reinforcement learning for nonlinear optimal control with state and input constraints
    Kim, Yeonsoo
    Kim, Jong Woo
    AICHE JOURNAL, 2022, 68 (05)
  • [36] Reinforcement Learning-based Integrated Position and Attitude Control Method towards Morphing Flight Vehicles
    Lu, Kunfeng
    Jia, Chenhui
    Huang, Xu
    Liu, Xiaodong
    Liu, Jiarun
    Wang, Zhaolei
    Yuhang Xuebao/Journal of Astronautics, 2024, 45 (07): : 1100 - 1110
  • [37] Online Continual Safe Reinforcement Learning-based Optimal Control of Mobile Robot Formations
    Ganie, Irfan
    Jagannathan, S.
    2024 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS, CCTA 2024, 2024, : 519 - 524
  • [38] Attitude Tracking Control With Constraints for Rigid Spacecraft Based on Control Barrier Lyapunov Functions
    Wu, Yu-Yao
    Sun, Hui-Jie
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2022, 58 (03) : 2053 - 2062
  • [39] Attitude Control of Hypersonic Vehicle based on Reinforcement Learning
    Liu, Jingwen
    Fan, Hongdong
    Fan, Yonghua
    Cai, Guangbin
    2024 3RD CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, FASTA 2024, 2024, : 1503 - 1507
  • [40] Approximate Optimal Stabilization Control of Servo Mechanisms based on Reinforcement Learning Scheme
    Yongfeng Lv
    Xuemei Ren
    Shuangyi Hu
    Hao Xu
    International Journal of Control, Automation and Systems, 2019, 17 : 2655 - 2665