Finite-horizon optimal control of unknown nonlinear time-delay systems

被引：10

作者：

Cui, Xiaohong ^{[1
,2
]}

Zhang, Huaguang ^{[1
]}

Luo, Yanhong ^{[1
]}

Jiang, He ^{[1
]}

机构：

[1] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China

[2] Mudanjiang Normal Univ, Inst Math Sci, Mudanjiang 157011, Heilongjiang, Peoples R China

来源：

NEUROCOMPUTING | 2017年 / 238卷

基金：

中国国家自然科学基金;

关键词：

Optimal control; Neural network; Time-delay; Finite-horizon; HJB equation; ADAPTIVE OPTIMAL-CONTROL; OPTIMAL-CONTROL DESIGN; POLICY ITERATION; LINEAR-SYSTEMS; NEURAL-CONTROL; DYNAMICS;

D O I：

10.1016/j.neucom.2017.01.063

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a neural-network (NN)-based online off-policy algorithm to optimize a class of nonlinear continuous-time time-delay systems during finite time horizon. The online off-policy algorithm is used to learn the two-stage solution to the time-varying Hamilton-Jacobi-Bellman (HJB) equation without requiring the knowledge of the time-delay system dynamics. The algorithm is implemented by using an actor-critic NN structure with time-varying activation functions. The weights of the two NNs are tuned simultaneously in real-time by considering both the residual error and the terminal error. Two simulation examples demonstrate the applicability of the proposed algorithm. (C) 2017 Elsevier B.V. All rights reserved.

引用

页码：277 / 285

页数：9

共 45 条

[1] Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].

Abu-Khalaf, M ;

Lewis, FL .

AUTOMATICA, 2005, 41 (05) :779-791

[2]

Bokov GV., 2011, J. Math. Sci, V172, P623, DOI [10.1007/s10958-011-0208-y, DOI 10.1007/S10958-011-0208-Y]

[3] Stable direct adaptive neural controller of nonlinear systems based on single auto-tuning neuron [J].

Chang, WD ;

Hwang, RC ;

Hsieh, JG .

NEUROCOMPUTING, 2002, 48 :541-554

[4] ANALYSIS AND PARAMETER-IDENTIFICATION OF TIME-DELAY SYSTEMS VIA POLYNOMIAL SERIES [J].

CHEN, CK ;

YANG, CY .

INTERNATIONAL JOURNAL OF CONTROL, 1987, 46 (01) :111-127

[5] A neural network solution for fixed-final time optimal control of nonlinear systems [J].

Cheng, Tao ;

Lewis, Frank L. ;

Abu-Khalaf, Murad .

AUTOMATICA, 2007, 43 (03) :482-490

[6] Adaptive neural control of stochastic nonlinear systems with multiple time-varying delays and input saturation [J].

Cui, Guozeng ;

Jiao, Ticao ;

Wei, Yunliang ;

Song, Gongfei ;

Chu, Yuming .

NEURAL COMPUTING & APPLICATIONS, 2014, 25 (3-4) :779-791

[7] Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update [J].

Dierks, Travis ;

Jagannathan, Sarangapani .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (07) :1118-1129

[8] NN-based output feedback adaptive variable structure control for a class of non-affine nonlinear systems: A nonseparation principle design [J].

Du, Hongbin ;

Chen, Xiaochuan .

NEUROCOMPUTING, 2009, 72 (7-9) :2009-2016

[9] Adaptive neural control with intercepted adaptation for time-delay saturated nonlinear systems [J].

Gao, Shigen ;

Ning, Bin ;

Dong, Hairong .

NEURAL COMPUTING & APPLICATIONS, 2015, 26 (08) :1849-1857

[10] Adaptive neural network control of nonlinear systems with unknown time delays [J].

Ge, SS ;

Hong, F ;

Lee, TH .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (11) :2004-2010

← 1 2 3 4 5 →