Reinforcement Learning-Based Approximate Optimal Control for Attitude Reorientation Under State Constraints

被引：47

作者：

Dong, Hongyang ^{[1
]}

Zhao, Xiaowei ^{[1
]}

Yang, Haoyang ^{[2
]}

机构：

[1] Univ Warwick, Sch Engn, Coventry CV4 7AL, W Midlands, England

[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China

来源：

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY | 2021年 / 29卷 / 04期

基金：

英国工程与自然科学研究理事会;

关键词：

Attitude control; Payloads; Angular velocity; Optimal control; Artificial neural networks; Cost function; Quaternions; Adaptive dynamic programming (ADP); approximate optimal control; attitude control; reinforcement learning (RL); state constraints; FEEDBACK-CONTROL; SPACECRAFT; TRACKING; STABILIZATION; PARAMETER; SYSTEMS;

D O I：

10.1109/TCST.2020.3007401

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article addresses the attitude reorientation problems of rigid bodies under multiple state constraints. A novel reinforcement learning (RL)-based approximate optimal control method is proposed to make the tradeoff between control cost and performance. The novelty lies in that it guarantees constraint handling abilities on attitude forbidden zones and angular velocity limits. To achieve this, barrier functions are employed to encode the constraint information into the cost function. Then, an RL-based learning strategy is developed to approximate the optimal cost function and control policy. A simplified critic-only neural network (NN) is employed to replace the conventional actor-critic structure once adequate data are collected online. This design guarantees the uniform boundedness of reorientation errors and NN weight estimation errors subject to the satisfaction of a finite excitation condition, which is a relaxation compared with the persistent excitation condition that is typically required for this class of problems. More importantly, all underlying state constraints are strictly obeyed during the online learning process. The effectiveness and advantages of the proposed controller are verified by both numerical simulations and experimental tests based on a comprehensive hardware-in-loop testbed.

引用

页码：1664 / 1673

页数：10

共 50 条

[41] Approximate Optimal Stabilization Control of Servo Mechanisms based on Reinforcement Learning Scheme
Lv, Yongfeng
Ren, Xuemei
Hu, Shuangyi
Xu, Hao
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2019, 17 (10) : 2655 - 2665
[42] Feedback Control for Spacecraft Reorientation Under Attitude Constraints Via Convex Potentials
Lee, Unsik
Mesbahi, Mehran
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2014, 50 (04) : 2578 - 2592
[43] Optimal attitude consensus control for rigid spacecraft formation based on control Lyapunov functions
Wang, Hui
Zhang, Jinxiu
Zhou, Qingrui
Sun, Hui-Jie
Wu, Yu-Yao
IET CONTROL THEORY AND APPLICATIONS, 2023, 17 (12) : 1720 - 1731
[44] Learning-Based Quantum Control for Optimal Pure State Manipulation
Chen, Anthony Siming
Herrmann, Guido
Vamvoudakis, Kyriakos G.
Vijayan, Jayadev
IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 1319 - 1324
[45] Robust Learning-Based Predictive Control for Discrete-Time Nonlinear Systems With Unknown Dynamics and State Constraints
Zhang, Xinglong
Liu, Jiahang
Xu, Xin
Yu, Shuyou
Chen, Hong
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (12): : 7314 - 7327
[46] Improvement of hovering stability for UAVs under crosswinds via evolutionary learning-based optimal PID control
Yoon, Jaehyun
Kim, Mantae
Bang, Jinhong
Kim, Sanghoon
Doh, Jaehyeok
JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2025, : 2151 - 2162
[47] Unified attitude control for spacecraft under velocity and control constraints
Hu, Qinglei
Tan, Xiao
AEROSPACE SCIENCE AND TECHNOLOGY, 2017, 67 : 257 - 264
[48] Optimal control for a class of nonlinear systems with input constraints based on reinforcement learning
Luo A.
Xiao W.-B.
Zhou Q.
Lu R.-Q.
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (01): : 154 - 164
[49] Reinforcement Learning-based Distributed Secondary Optimal Control for Multi-Microgrids
Liu, Wei
Wen, Zhen
Shen, Yiping
Zhang, Zhifang
2017 IEEE CONFERENCE ON ENERGY INTERNET AND ENERGY SYSTEM INTEGRATION (EI2), 2017,
[50] A Deep Reinforcement Learning-Based Optimal Transmission Control Method for Streaming Videos
Yang, Yawen
Xiao, Yuxuan
IEEE ACCESS, 2024, 12 : 53088 - 53098

← 1 2 3 4 5 →