Variable gain gradient descent-based reinforcement learning for robust optimal tracking control of uncertain nonlinear system with input constraints

被引:11
|
作者
Mishra, Amardeep [1 ]
Ghosh, Satadal [1 ]
机构
[1] Indian Inst Technol, Dept Aerosp Engn, Chennai 600036, Tamil Nadu, India
关键词
Adaptive dynamic programming; Variable gain gradient descent; Optimal tracking control; Actuator constraints; APPROXIMATE OPTIMAL-CONTROL; ADAPTIVE OPTIMAL-CONTROL; NEURAL-NETWORK; STABILIZATION; ALGORITHM;
D O I
10.1007/s11071-021-06908-z
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
In recent times, a variety of reinforcement learning (RL) algorithms have been proposed for optimal tracking problem of continuous time nonlinear systems with input constraints. Most of these algorithms are based on the notion of uniform ultimate boundedness (UUB) stability, in which normally higher learning rates are avoided in order to restrict oscillations in state error to smaller values. However, this comes at the cost of higher convergence time of critic neural network weights. This paper addresses that problem by proposing a novel tuning law containing a variable gain gradient descent for critic neural network that can adjust the learning rate based on Hamilton-Jacobi-Bellman (HJB) approximation error. By allowing high learning rate the proposed variable gain gradient descent tuning law could improve the convergence time of critic neural network weights. Simultaneously, it also results in tighter residual set, on which trajectories of augmented system converge to, leading to smaller oscillations in state error. A tighter bound for UUB stability of the proposed update mechanism is proved. Numerical studies are then furnished to validate the variable gain gradient descent-based update law presented in this paper on a continuous time nonlinear system.
引用
收藏
页码:2195 / 2214
页数:20
相关论文
共 50 条
  • [1] Variable gain gradient descent-based reinforcement learning for robust optimal tracking control of uncertain nonlinear system with input constraints
    Amardeep Mishra
    Satadal Ghosh
    Nonlinear Dynamics, 2022, 107 : 2195 - 2214
  • [2] H∞ tracking control via variable gain gradient descent-based integral reinforcement learning for unknown continuous time non-linear system
    Mishra, Amardeep
    Ghosh, Satadal
    IET CONTROL THEORY AND APPLICATIONS, 2020, 14 (20): : 3476 - 3489
  • [3] Integral reinforcement learning-based optimal tracking control for uncertain nonlinear systems under input constraint and specified performance constraints
    Chang, Ru
    Liu, Zhi-Meng
    Li, Xiao-Bin
    Sun, Chang-Yin
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (13) : 8802 - 8824
  • [4] Event-Triggered Optimal Tracking Control for Uncertain Nonlinear System Based on Reinforcement Learning
    Wang, Yuanhao
    Bai, Weiwei
    2024 14th International Conference on Information Science and Technology, ICIST 2024, 2024, : 619 - 625
  • [5] Optimal control for a class of nonlinear systems with input constraints based on reinforcement learning
    Luo A.
    Xiao W.-B.
    Zhou Q.
    Lu R.-Q.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (01): : 154 - 164
  • [6] An iterative gradient descent-based reinforcement learning policy for active control of structural vibrations
    Panda, Jagajyoti
    Chopra, Mudit
    Matsagar, Vasant
    Chakraborty, Souvik
    COMPUTERS & STRUCTURES, 2024, 290
  • [7] Safe optimal robust control of nonlinear systems with asymmetric input constraints using reinforcement learning
    Zhang, Dehua
    Wang, Yuchen
    Jiang, Kaijun
    Liang, Linlin
    APPLIED INTELLIGENCE, 2024, 54 (01) : 1 - 13
  • [8] Safe optimal robust control of nonlinear systems with asymmetric input constraints using reinforcement learning
    Dehua Zhang
    Yuchen Wang
    Kaijun Jiang
    Linlin Liang
    Applied Intelligence, 2024, 54 : 1 - 13
  • [9] Robust relatively optimal trajectory tracking control for a class of uncertain nonlinear control affine systems with state and input constraints
    Nidya, M., V
    Mija, S. J.
    Jeevamma, Jacob
    NONLINEAR DYNAMICS, 2022, 110 (04) : 3513 - 3534
  • [10] Robust relatively optimal trajectory tracking control for a class of uncertain nonlinear control affine systems with state and input constraints
    M. V. Nidya
    S. J. Mija
    Jacob Jeevamma
    Nonlinear Dynamics, 2022, 110 : 3513 - 3534