Reinforcement learning for adaptive optimal control of unknown continuous-time nonlinear systems with input constraints

被引:169
作者
Yang, Xiong [1 ]
Liu, Derong [1 ]
Wang, Ding [1 ]
机构
[1] Chinese Acad Sci, State Key Lab Management & Control Complex Syst, Inst Automat, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
adaptive control; input constraints; neural networks; optimal control; reinforcement learning; ARCHITECTURE; ALGORITHM; DESIGN;
D O I
10.1080/00207179.2013.848292
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an adaptive reinforcement learning-based solution is developed for the infinite-horizon optimal control problem of constrained-input continuous-time nonlinear systems in the presence of nonlinearities with unknown structures. Two different types of neural networks (NNs) are employed to approximate the Hamilton-Jacobi-Bellman equation. That is, an recurrent NN is constructed to identify the unknown dynamical system, and two feedforward NNs are used as the actor and the critic to approximate the optimal control and the optimal cost, respectively. Based on this framework, the action NN and the critic NN are tuned simultaneously, without the requirement for the knowledge of system drift dynamics. Moreover, by using Lyapunov's direct method, the weights of the action NN and the critic NN are guaranteed to be uniformly ultimately bounded, while keeping the closed-loop system stable. To demonstrate the effectiveness of the present approach, simulation results are illustrated.
引用
收藏
页码:553 / 566
页数:14
相关论文
共 50 条
[21]   Self-Triggered Adaptive NN Tracking Control for a Class of Continuous-Time Nonlinear Systems With Input Constraints [J].
Guo, Xinxin ;
Yan, Weisheng ;
Cui, Rongxin ;
Rout, Raja ;
Zhang, Shouxu .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (09) :5805-5815
[22]   Adaptive optimal control for a class of continuous-time affine nonlinear systems with unknown internal dynamics [J].
Derong Liu ;
Xiong Yang ;
Hongliang Li .
Neural Computing and Applications, 2013, 23 :1843-1850
[23]   Adaptive optimal control for a class of continuous-time affine nonlinear systems with unknown internal dynamics [J].
Liu, Derong ;
Yang, Xiong ;
Li, Hongliang .
NEURAL COMPUTING & APPLICATIONS, 2013, 23 (7-8) :1843-1850
[24]   Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints [J].
Liu, Derong ;
Yang, Xiong ;
Wang, Ding ;
Wei, Qinglai .
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (07) :1372-1385
[25]   Finite-horizon optimal control for continuous-time uncertain nonlinear systems using reinforcement learning [J].
Zhao, Jingang ;
Gan, Minggang .
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2020, 51 (13) :2429-2440
[26]   Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning [J].
Yang, Xiong ;
Liu, Derong ;
Luo, Biao ;
Li, Chao .
INFORMATION SCIENCES, 2016, 369 :731-747
[27]   Adaptive near-optimal neuro controller for continuous-time nonaffine nonlinear systems with constrained input [J].
Esfandiari, Kasra ;
Abdollahi, Farzaneh ;
Talebi, Heidar Ali .
NEURAL NETWORKS, 2017, 93 :195-204
[28]   Adaptive Reinforcement Learning Control Based on Neural Approximation for Nonlinear Discrete-Time Systems With Unknown Nonaffine Dead-Zone Input [J].
Liu, Yan-Jun ;
Li, Shu ;
Tong, Shaocheng ;
Chen, C. L. Philip .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (01) :295-305
[29]   Adaptive Event Triggered Optimal Control for Constrained Continuous-time Nonlinear Systems [J].
Ping Wang ;
Zhen Wang ;
Qian Ma .
International Journal of Control, Automation and Systems, 2022, 20 :857-868
[30]   Adaptive Event Triggered Optimal Control for Constrained Continuous-time Nonlinear Systems [J].
Wang, Ping ;
Wang, Zhen ;
Ma, Qian .
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (03) :857-868