Reinforcement learning for adaptive optimal control of unknown continuous-time nonlinear systems with input constraints

被引：169

作者：

Yang, Xiong ^{[1
]}

Liu, Derong ^{[1
]}

Wang, Ding ^{[1
]}

机构：

[1] Chinese Acad Sci, State Key Lab Management & Control Complex Syst, Inst Automat, Beijing 100190, Peoples R China

来源：

INTERNATIONAL JOURNAL OF CONTROL | 2014年 / 87卷 / 03期

基金：

中国国家自然科学基金;

关键词：

adaptive control; input constraints; neural networks; optimal control; reinforcement learning; ARCHITECTURE; ALGORITHM; DESIGN;

D O I：

10.1080/00207179.2013.848292

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, an adaptive reinforcement learning-based solution is developed for the infinite-horizon optimal control problem of constrained-input continuous-time nonlinear systems in the presence of nonlinearities with unknown structures. Two different types of neural networks (NNs) are employed to approximate the Hamilton-Jacobi-Bellman equation. That is, an recurrent NN is constructed to identify the unknown dynamical system, and two feedforward NNs are used as the actor and the critic to approximate the optimal control and the optimal cost, respectively. Based on this framework, the action NN and the critic NN are tuned simultaneously, without the requirement for the knowledge of system drift dynamics. Moreover, by using Lyapunov's direct method, the weights of the action NN and the critic NN are guaranteed to be uniformly ultimately bounded, while keeping the closed-loop system stable. To demonstrate the effectiveness of the present approach, simulation results are illustrated.

引用

页码：553 / 566

页数：14

共 50 条

[31] Adaptive output feedback reinforcement learning control for continuous time switched stochastic nonlinear systems with unknown control coefficients and full-state constraints [J].

Li, Hongyao ;

Wang, Fuli .

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2024, 55 (02) :332-354

[32] Optimal Adaptive Control of Nonlinear Continuous-time Systems in Strict Feedback Form with Unknown Internal Dynamics [J].

Zargarzadeh, H. ;

Dierks, T. ;

Jagannathan, S. .

2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, :4127-4132

[33] Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems [J].

Vrabie, Draguna ;

Lewis, Frank .

NEURAL NETWORKS, 2009, 22 (03) :237-246

[34] General value iteration based reinforcement learning for solving optimal :tracking control problem of continuous-time affine nonlinear systems [J].

Xiao, Geyang ;

Zhang, Huaguang ;

Luo, Yanhong ;

Qu, Qiuxia .

NEUROCOMPUTING, 2017, 245 :114-123

[35] Decentralized Stabilization for a Class of Continuous-Time Nonlinear Interconnected Systems Using Online Learning Optimal Control Approach [J].

Liu, Derong ;

Wang, Ding ;

Li, Hongliang .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (02) :418-428

[36] Self-learning robust optimal control for continuous-time nonlinear systems with mismatched disturbances [J].

Yang, Xiong ;

He, Haibo .

NEURAL NETWORKS, 2018, 99 :19-30

[37] Distributed optimal control for continuous-time nonaffine nonlinear interconnected systems [J].

Farzanegan, Behzad ;

Suratgar, Amir Abolfazl ;

Menhaj, Mohammad Bagher ;

Zamani, Mohsen .

INTERNATIONAL JOURNAL OF CONTROL, 2022, 95 (12) :3462-3476

[38] Observer-based Adaptive Optimal Control for Unknown Singularly Perturbed Nonlinear Systems With Input Constraints [J].

Fu, Zhijun ;

Xie, Wenfang ;

Rakheja, Subhash ;

Na, Jing .

IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2017, 4 (01) :48-57

[39] Adaptive optimal control of unknown nonlinear systems with different time scales [J].

Fu, Zhi-Jun ;

Xie, Wen-Fang ;

Rakheja, Subhash ;

Zheng, Dong-Dong .

NEUROCOMPUTING, 2017, 238 :179-190

[40] Adaptive Fixed-Time Optimal Formation Control for Uncertain Nonlinear Multiagent Systems Using Reinforcement Learning [J].

Wang, Ping ;

Yu, Chengpu ;

Lv, Maolong ;

Cao, Jinde .

IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02) :1729-1743

← 1 2 3 4 5 →