Neural-network-observer-based optimal control for unknown nonlinear systems using adaptive dynamic programming

被引:118
作者
Liu, Derong [1 ]
Huang, Yuzhu [1 ]
Wang, Ding [1 ]
Wei, Qinglai [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
nonlinear observer; adaptive dynamic programming; neural network; uniformly ultimately bounded; nonlinear system; OPTIMAL TRACKING CONTROL; DISCRETE-TIME-SYSTEMS; CONTROL SCHEME; DESIGN;
D O I
10.1080/00207179.2013.790562
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an observer-based optimal control scheme is developed for unknown nonlinear systems using adaptive dynamic programming (ADP) algorithm. First, a neural-network (NN) observer is designed to estimate system states. Then, based on the observed states, a neuro-controller is constructed via ADP method to obtain the optimal control. In this design, two NN structures are used: a three-layer NN is used to construct the observer which can be applied to systems with higher degrees of nonlinearity and without a priori knowledge of system dynamics, and a critic NN is employed to approximate the value function. The optimal control law is computed using the critic NN and the observer NN. Uniform ultimate boundedness of the closed-loop system is guaranteed. The actor, critic, and observer structures are all implemented in real-time, continuously and simultaneously. Finally, simulation results are presented to demonstrate the effectiveness of the proposed control scheme.
引用
收藏
页码:1554 / 1566
页数:13
相关论文
共 44 条
[1]   A stable neural network-based observer with application to flexible-joint manipulators [J].
Abdollahi, F ;
Talebi, HA ;
Patel, RV .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (01) :118-129
[2]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[3]  
Ahmed MS, 2000, J INTELL FUZZY SYST, V9, P113
[4]   Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].
Al-Tamimi, Asma ;
Lewis, Frank .
2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, :38-+
[5]  
[Anonymous], 1999, Neural network control of robot manipulators and nonlinear systems
[6]  
[Anonymous], 1996, Neuro-dynamic programming
[7]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[8]  
Bernard A. A., 1970, SIAM J APPL MATH, V18, P407
[9]   ADAPTIVE-CONTROL OF A CLASS OF NONLINEAR DISCRETE-TIME-SYSTEMS USING NEURAL NETWORKS [J].
CHEN, FC ;
KHALIL, HK .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1995, 40 (05) :791-801
[10]   Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence [J].
Dierks, Travis ;
Thumati, Balaje T. ;
Jagannathan, S. .
NEURAL NETWORKS, 2009, 22 (5-6) :851-860