Variable gain gradient descent-based reinforcement learning for robust optimal tracking control of uncertain nonlinear system with input constraints

被引：11

作者：

Mishra, Amardeep ^{[1
]}

Ghosh, Satadal ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Aerosp Engn, Chennai 600036, Tamil Nadu, India

来源：

NONLINEAR DYNAMICS | 2022年 / 107卷 / 03期

关键词：

Adaptive dynamic programming; Variable gain gradient descent; Optimal tracking control; Actuator constraints; APPROXIMATE OPTIMAL-CONTROL; ADAPTIVE OPTIMAL-CONTROL; NEURAL-NETWORK; STABILIZATION; ALGORITHM;

D O I：

10.1007/s11071-021-06908-z

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

In recent times, a variety of reinforcement learning (RL) algorithms have been proposed for optimal tracking problem of continuous time nonlinear systems with input constraints. Most of these algorithms are based on the notion of uniform ultimate boundedness (UUB) stability, in which normally higher learning rates are avoided in order to restrict oscillations in state error to smaller values. However, this comes at the cost of higher convergence time of critic neural network weights. This paper addresses that problem by proposing a novel tuning law containing a variable gain gradient descent for critic neural network that can adjust the learning rate based on Hamilton-Jacobi-Bellman (HJB) approximation error. By allowing high learning rate the proposed variable gain gradient descent tuning law could improve the convergence time of critic neural network weights. Simultaneously, it also results in tighter residual set, on which trajectories of augmented system converge to, leading to smaller oscillations in state error. A tighter bound for UUB stability of the proposed update mechanism is proved. Numerical studies are then furnished to validate the variable gain gradient descent-based update law presented in this paper on a continuous time nonlinear system.

引用

页码：2195 / 2214

页数：20

共 50 条

[21] Improved off-policy reinforcement learning algorithm for robust control of unmodeled nonlinear system with asymmetric state constraints [J].

Zhang, Yong ;

Mu, Chaoxu ;

Feng, Yanghe ;

Zhao, Zhijia .

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (03) :1607-1632

[22] Robust Near-optimal Control for Constrained Nonlinear System via Integral Reinforcement Learning [J].

Qiu, Yu-Qing ;

Li, Yan ;

Wang, Zhong .

INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (04) :1319-1330

[23] Finite-Horizon Robust Optimal Trajectory Tracking Control for QUAV with Input Constraints [J].

Xu, Jicai ;

Bu, Chunlei ;

Xia, Rongsheng .

ADVANCES IN GUIDANCE, NAVIGATION AND CONTROL, VOL 8, 2025, 1344 :186-195

[24] Optimal tracking control of mechatronic servo system using integral reinforcement learning [J].

Chen, Wei ;

Hu, Jian ;

Xu, Chenchen ;

Zhou, Haibo ;

Yao, Jianyong ;

Nie, Weirong .

INTERNATIONAL JOURNAL OF CONTROL, 2023, 96 (12) :3072-3082

[25] Adaptive robust control for a class of uncertain nonlinear system with input and state quantization [J].

Liu, Yanbin ;

Sun, Weichao .

2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, :2566-2570

[26] Event-triggered-based online integral reinforcement learning for optimal control of unknown constrained nonlinear systems [J].

Han, Xiumei ;

Zhao, Xudong ;

Wang, Ding ;

Wang, Bohui .

INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (02) :213-225

[27] Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming [J].

Wang, Ding ;

Liu, Derong ;

Li, Hongliang ;

Ma, Hongwen .

INFORMATION SCIENCES, 2014, 282 :167-179

[28] Robust Control of Uncertain Linear Systems Based on Reinforcement Learning Principles [J].

Xu, Dengguo ;

Wang, Qinglin ;

Li, Yuan .

IEEE ACCESS, 2019, 7 :16431-16443

[29] Approximate Optimal Curve Path Tracking Control for Nonlinear Systems with Asymmetric Input Constraints [J].

Wang, Yajing ;

Wang, Xiangke ;

Shen, Lincheng .

DRONES, 2022, 6 (11)

[30] Stability analysis and robust tracking control for a class of switched nonlinear systems with uncertain input delay [J].

Pezeshki, Saeed ;

Badamchizadeh, Mohammad Ali ;

Ghiasi, Amir Rikhtehgar ;

Ghaemi, Sehraneh .

TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2019, 41 (07) :2053-2063

← 1 2 3 4 5 →