Reinforcement learning for adaptive optimal control of unknown continuous-time nonlinear systems with input constraints (vol 87, pg 553, 2014)

被引:0
作者
Yang, Xiong
Liu, Derong
Wang, Ding
机构
[1] State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences
基金
中国国家自然科学基金;
关键词
adaptive control; input constraints; neural networks; optimal control; reinforcement learning;
D O I
10.1080/00207179.2013.862419
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an adaptive reinforcement learning-based solution is developed for the infinite-horizon optimal control problem of constrained-input continuous-time nonlinear systems in the presence of nonlinearities with unknown structures. Two different types of neural networks (NNs) are employed to approximate the Hamilton-Jacobi-Bellman equation. That is, an recurrent NN is constructed to identify the unknown dynamical system, and two feedforward NNs are used as the actor and the critic to approximate the optimal control and the optimal cost, respectively. Based on this framework, the action NN and the critic NN are tuned simultaneously, without the requirement for the knowledge of system drift dynamics. Moreover, by using Lyapunov's direct method, the weights of the action NN and the critic NN are guaranteed to be uniformly ultimately bounded, while keeping the closed-loop system stable. To demonstrate the effectiveness of the present approach, simulation results are illustrated.
引用
收藏
页码:I / I
页数:1
相关论文
共 50 条
[31]   Adaptive output feedback reinforcement learning control for continuous time switched stochastic nonlinear systems with unknown control coefficients and full-state constraints [J].
Li, Hongyao ;
Wang, Fuli .
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2024, 55 (02) :332-354
[32]   Data-Based Self-Learning Optimal Control for Continuous-Time Unknown Nonlinear Systems With Disturbance [J].
Wei, Qinglai ;
Liu, Derong ;
Song, Ruizhuo ;
Yan, Pengfei .
PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, :6633-6638
[33]   Constrained Reinforcement Learning-Based Closed-Loop Reference Model for Optimal Tracking Control of Unknown Continuous-Time Systems [J].
Zhang, Haoran ;
Zhao, Chunhui ;
Ding, Jinliang .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) :7312-7324
[34]   General value iteration based reinforcement learning for solving optimal :tracking control problem of continuous-time affine nonlinear systems [J].
Xiao, Geyang ;
Zhang, Huaguang ;
Luo, Yanhong ;
Qu, Qiuxia .
NEUROCOMPUTING, 2017, 245 :114-123
[35]   Reinforcement learning-based event-triggered optimal control for unknown nonlinear systems with input delay [J].
Chen, Xiangyu ;
Sun, Weiwei ;
Gao, Xinci ;
Li, Yongshu .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (07) :4844-4863
[36]   Integral reinforcement learning-based optimal output feedback control for linear continuous-time systems with input delay [J].
Wang, Gao ;
Luo, Biao ;
Xue, Shan .
NEUROCOMPUTING, 2021, 460 :31-38
[37]   Decentralized Stabilization for a Class of Continuous-Time Nonlinear Interconnected Systems Using Online Learning Optimal Control Approach [J].
Liu, Derong ;
Wang, Ding ;
Li, Hongliang .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (02) :418-428
[38]   Distributed optimal control for continuous-time nonaffine nonlinear interconnected systems [J].
Farzanegan, Behzad ;
Suratgar, Amir Abolfazl ;
Menhaj, Mohammad Bagher ;
Zamani, Mohsen .
INTERNATIONAL JOURNAL OF CONTROL, 2022, 95 (12) :3462-3476
[39]   Event-Triggered Optimal Adaptive Control of Partially Unknown Linear Continuous-Time Systems With State Delay [J].
Moghadam, Rohollah ;
Narayanan, Vignesh ;
Jagannathan, Sarangapani .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (06) :3324-3337
[40]   Data-Driven Adaptive Dynamic Programming for Optimal Control of Continuous-Time Multicontroller Systems With Unknown Dynamics [J].
Zhao, Jingang .
IEEE ACCESS, 2022, 10 :41503-41511