Variable gain gradient descent-based reinforcement learning for robust optimal tracking control of uncertain nonlinear system with input constraints

被引：11

作者：

Mishra, Amardeep ^{[1
]}

Ghosh, Satadal ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Aerosp Engn, Chennai 600036, Tamil Nadu, India

来源：

NONLINEAR DYNAMICS | 2022年 / 107卷 / 03期

关键词：

Adaptive dynamic programming; Variable gain gradient descent; Optimal tracking control; Actuator constraints; APPROXIMATE OPTIMAL-CONTROL; ADAPTIVE OPTIMAL-CONTROL; NEURAL-NETWORK; STABILIZATION; ALGORITHM;

D O I：

10.1007/s11071-021-06908-z

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

In recent times, a variety of reinforcement learning (RL) algorithms have been proposed for optimal tracking problem of continuous time nonlinear systems with input constraints. Most of these algorithms are based on the notion of uniform ultimate boundedness (UUB) stability, in which normally higher learning rates are avoided in order to restrict oscillations in state error to smaller values. However, this comes at the cost of higher convergence time of critic neural network weights. This paper addresses that problem by proposing a novel tuning law containing a variable gain gradient descent for critic neural network that can adjust the learning rate based on Hamilton-Jacobi-Bellman (HJB) approximation error. By allowing high learning rate the proposed variable gain gradient descent tuning law could improve the convergence time of critic neural network weights. Simultaneously, it also results in tighter residual set, on which trajectories of augmented system converge to, leading to smaller oscillations in state error. A tighter bound for UUB stability of the proposed update mechanism is proved. Numerical studies are then furnished to validate the variable gain gradient descent-based update law presented in this paper on a continuous time nonlinear system.

引用

页码：2195 / 2214

页数：20

共 50 条

[1] Variable gain gradient descent-based reinforcement learning for robust optimal tracking control of uncertain nonlinear system with input constraints
Amardeep Mishra
Satadal Ghosh
Nonlinear Dynamics, 2022, 107 : 2195 - 2214
[2] H∞ tracking control via variable gain gradient descent-based integral reinforcement learning for unknown continuous time non-linear system
Mishra, Amardeep
Ghosh, Satadal
IET CONTROL THEORY AND APPLICATIONS, 2020, 14 (20): : 3476 - 3489
[3] Integral reinforcement learning-based optimal tracking control for uncertain nonlinear systems under input constraint and specified performance constraints
Chang, Ru
Liu, Zhi-Meng
Li, Xiao-Bin
Sun, Chang-Yin
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (13) : 8802 - 8824
[4] Event-Triggered Optimal Tracking Control for Uncertain Nonlinear System Based on Reinforcement Learning
Wang, Yuanhao
Bai, Weiwei
2024 14th International Conference on Information Science and Technology, ICIST 2024, 2024, : 619 - 625
[5] Optimal control for a class of nonlinear systems with input constraints based on reinforcement learning
Luo A.
Xiao W.-B.
Zhou Q.
Lu R.-Q.
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (01): : 154 - 164
[6] An iterative gradient descent-based reinforcement learning policy for active control of structural vibrations
Panda, Jagajyoti
Chopra, Mudit
Matsagar, Vasant
Chakraborty, Souvik
COMPUTERS & STRUCTURES, 2024, 290
[7] Safe optimal robust control of nonlinear systems with asymmetric input constraints using reinforcement learning
Zhang, Dehua
Wang, Yuchen
Jiang, Kaijun
Liang, Linlin
APPLIED INTELLIGENCE, 2024, 54 (01) : 1 - 13
[8] Safe optimal robust control of nonlinear systems with asymmetric input constraints using reinforcement learning
Dehua Zhang
Yuchen Wang
Kaijun Jiang
Linlin Liang
Applied Intelligence, 2024, 54 : 1 - 13
[9] Robust relatively optimal trajectory tracking control for a class of uncertain nonlinear control affine systems with state and input constraints
Nidya, M., V
Mija, S. J.
Jeevamma, Jacob
NONLINEAR DYNAMICS, 2022, 110 (04) : 3513 - 3534
[10] Robust relatively optimal trajectory tracking control for a class of uncertain nonlinear control affine systems with state and input constraints
M. V. Nidya
S. J. Mija
Jacob Jeevamma
Nonlinear Dynamics, 2022, 110 : 3513 - 3534

← 1 2 3 4 5 →