Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning

被引:75
作者
Yang, Xiong [1 ]
Liu, Derong [2 ]
Luo, Biao [3 ]
Li, Chao [3 ]
机构
[1] Tianjin Univ, Sch Elect Engn & Automat, Tianjin 300072, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
[3] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming; Input constraint; Neural networks; Optimal control; Reinforcement learning; Robust control; DYNAMIC-PROGRAMMING ALGORITHM; APPROXIMATE OPTIMAL-CONTROL; ZERO-SUM GAME; TIME-SYSTEMS; EXPERIENCE REPLAY; TRACKING CONTROL; CONTROL SCHEME; DESIGN; ARCHITECTURE;
D O I
10.1016/j.ins.2016.07.051
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a data-based robust adaptive control methodology for a class of nonlinear constrained-input systems with completely unknown dynamics. By introducing a value function for the nominal system, the robust control problem is transformed into a constrained optimal control problem. Due to the unavailability of system dynamics, a data-based integral reinforcement learning (RL) algorithm is developed to solve the constrained optimal control problem. Based on the present algorithm, the value function and the control policy*can be updated simultaneously using only system data. The convergence of the developed algorithm is proved via an established equivalence relationship. To implement the integral RL algorithm, an actor neural network (NN) and a critic NN are separately utilized to approximate the control policy and the value function, and the least squares method is employed to estimate the unknown parameters. By using Lyapunov's direct method, the obtained approximate optimal control is verified to guarantee the unknown nonlinear system to be stable in the sense of uniform ultimate boundedness. Two examples are provided to demonstrate the effectiveness and applicability of the theoretical results. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:731 / 747
页数:17
相关论文
共 75 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]   Bounded robust control of nonlinear systems using neural network-based HJB solution [J].
Adhyaru, Dipak M. ;
Kar, I. N. ;
Gopal, M. .
NEURAL COMPUTING & APPLICATIONS, 2011, 20 (01) :91-103
[3]  
[Anonymous], 1999, Neural network control of robot manipulators and nonlinear systems
[4]  
[Anonymous], 1974, Ph.D. Thesis
[5]  
Bernhard P., 1995, H-optimal control and related minimax design problems, V2nd
[6]   A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].
Bhasin, S. ;
Kamalapurkar, R. ;
Johnson, M. ;
Vamvoudakis, K. G. ;
Lewis, F. L. ;
Dixon, W. E. .
AUTOMATICA, 2013, 49 (01) :82-92
[7]   Neural-approximation-based robust adaptive control of flexible air-breathing hypersonic vehicles with parametric uncertainties and control input constraints [J].
Bu, Xiangwei ;
Wu, Xiaoyan ;
Wei, Daozhi ;
Huang, Jiaqi .
INFORMATION SCIENCES, 2016, 346 :29-43
[8]   Guaranteed transient performance based control with input saturation for near space vehicles [J].
Chen Mou ;
Wu QinXian ;
Jiang ChangSheng ;
Jiang Bin .
SCIENCE CHINA-INFORMATION SCIENCES, 2014, 57 (05) :1-12
[9]   Online finite-horizon optimal learning algorithm for nonzero-sum games with partially unknown dynamics and constrained inputs [J].
Cui, Xiaohong ;
Zhang, Huaguang ;
Luo, Yanhong ;
Zu, Peifu .
NEUROCOMPUTING, 2016, 185 :37-44
[10]   Adaptive Control of Uncertain Nonaffine Nonlinear Systems With Input Saturation Using Neural Networks [J].
Esfandiari, Kasra ;
Abdollahi, Farzaneh ;
Talebi, Heidar Ali .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (10) :2311-2322