Finite-horizon optimal control of unknown nonlinear time-delay systems

被引:10
作者
Cui, Xiaohong [1 ,2 ]
Zhang, Huaguang [1 ]
Luo, Yanhong [1 ]
Jiang, He [1 ]
机构
[1] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
[2] Mudanjiang Normal Univ, Inst Math Sci, Mudanjiang 157011, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimal control; Neural network; Time-delay; Finite-horizon; HJB equation; ADAPTIVE OPTIMAL-CONTROL; OPTIMAL-CONTROL DESIGN; POLICY ITERATION; LINEAR-SYSTEMS; NEURAL-CONTROL; DYNAMICS;
D O I
10.1016/j.neucom.2017.01.063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a neural-network (NN)-based online off-policy algorithm to optimize a class of nonlinear continuous-time time-delay systems during finite time horizon. The online off-policy algorithm is used to learn the two-stage solution to the time-varying Hamilton-Jacobi-Bellman (HJB) equation without requiring the knowledge of the time-delay system dynamics. The algorithm is implemented by using an actor-critic NN structure with time-varying activation functions. The weights of the two NNs are tuned simultaneously in real-time by considering both the residual error and the terminal error. Two simulation examples demonstrate the applicability of the proposed algorithm. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:277 / 285
页数:9
相关论文
共 45 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]  
Bokov GV., 2011, J. Math. Sci, V172, P623, DOI [10.1007/s10958-011-0208-y, DOI 10.1007/S10958-011-0208-Y]
[3]   Stable direct adaptive neural controller of nonlinear systems based on single auto-tuning neuron [J].
Chang, WD ;
Hwang, RC ;
Hsieh, JG .
NEUROCOMPUTING, 2002, 48 :541-554
[4]   ANALYSIS AND PARAMETER-IDENTIFICATION OF TIME-DELAY SYSTEMS VIA POLYNOMIAL SERIES [J].
CHEN, CK ;
YANG, CY .
INTERNATIONAL JOURNAL OF CONTROL, 1987, 46 (01) :111-127
[5]   A neural network solution for fixed-final time optimal control of nonlinear systems [J].
Cheng, Tao ;
Lewis, Frank L. ;
Abu-Khalaf, Murad .
AUTOMATICA, 2007, 43 (03) :482-490
[6]   Adaptive neural control of stochastic nonlinear systems with multiple time-varying delays and input saturation [J].
Cui, Guozeng ;
Jiao, Ticao ;
Wei, Yunliang ;
Song, Gongfei ;
Chu, Yuming .
NEURAL COMPUTING & APPLICATIONS, 2014, 25 (3-4) :779-791
[7]   Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update [J].
Dierks, Travis ;
Jagannathan, Sarangapani .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (07) :1118-1129
[8]   NN-based output feedback adaptive variable structure control for a class of non-affine nonlinear systems: A nonseparation principle design [J].
Du, Hongbin ;
Chen, Xiaochuan .
NEUROCOMPUTING, 2009, 72 (7-9) :2009-2016
[9]   Adaptive neural control with intercepted adaptation for time-delay saturated nonlinear systems [J].
Gao, Shigen ;
Ning, Bin ;
Dong, Hairong .
NEURAL COMPUTING & APPLICATIONS, 2015, 26 (08) :1849-1857
[10]   Adaptive neural network control of nonlinear systems with unknown time delays [J].
Ge, SS ;
Hong, F ;
Lee, TH .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (11) :2004-2010