Optimal tracking control based on reinforcement learning value iteration algorithm for time-delayed nonlinear systems with external disturbances and input constraints

被引：38

作者：

Mohammadi, Mehdi ^{[1
]}

Arefi, Mohammad Mehdi ^{[1
]}

Setoodeh, Peyman ^{[1
]}

Kaynak, Okyay ^{[2
]}

机构：

[1] Shiraz Univ, Sch Elect & Comp Engn, Dept Power & Control Engn, Shiraz, Iran

[2] Bogazici Univ, Dept Elect & Elect Engn, Istanbul, Turkey

来源：

INFORMATION SCIENCES | 2021年 / 554卷

关键词：

Reinforcement learning; Tracking control; Mismatched external disturbance; Time delay; Disturbance observer; Critic neural network; Input constraints;

D O I：

10.1016/j.ins.2020.11.057

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article investigates the design of an optimal tracking controller for a class of nonlinear continuous-time systems with time-delay, mismatched external disturbances and input constraints. The technique of integral reinforcement learning (IRL) is utilized for determining the control input that optimizes an objective function. To enable the usage of an estimation of the external disturbances in the recursive objective function, a disturbance observer is designed. For the derivation of the optimal control input, a Hamilton-JacobiBellman (HJB) equation is employed and solved using the iterative IRL algorithm. The proposed approach guarantees that in the presence of mismatched disturbances, the output of the time-delayed nonlinear system tracks the desired trajectory with bounded error. A critic neural network is designed for the implementation of the proposed approach. The efficiency of the method is illustrated by a simulation example. (C) 2020 Elsevier Inc. All rights reserved.

引用

页码：84 / 98

页数：15

共 50 条

[1] Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].

Abu-Khalaf, M ;

Lewis, FL .

AUTOMATICA, 2005, 41 (05) :779-791

[2] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].

Bhasin, S. ;

Kamalapurkar, R. ;

Johnson, M. ;

Vamvoudakis, K. G. ;

Lewis, F. L. ;

Dixon, W. E. .

AUTOMATICA, 2013, 49 (01) :82-92

[3] Novel adaptive neural control design for nonlinear MIMO time-delay systems [J].

Chen, Bing ;

Liu, Xiaoping ;

Liu, Kefu ;

Lin, Chong .

AUTOMATICA, 2009, 45 (06) :1554-1560

[4] A nonlinear disturbance observer for robotic manipulators [J].

Chen, WH ;

Ballance, DJ ;

Gawthrop, PJ ;

O'Reilly, J .

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2000, 47 (04) :932-938

[5]

Fridman E., 2014, Systems and Control Foundations and Applications, DOI DOI 10.1007/978-3-319-09393-2

[6] Approximation-based control of nonlinear MIMO time-delay systems [J].

Ge, S. S. ;

Tee, K. P. .

AUTOMATICA, 2007, 43 (01) :31-43

[7] Adaptive neural network control of nonlinear systems with unknown time delays [J].

Ge, SS ;

Hong, F ;

Lee, TH .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (11) :2004-2010

[8] Adaptive tracking of nonlinear systems with non-symmetric dead-zone input [J].

Ibrir, Salim ;

Xie, Wen Fang ;

Su, Chun-Yi .

AUTOMATICA, 2007, 43 (03) :522-530

[9] Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics [J].

Jiang, Yu ;

Jiang, Zhong-Ping .

AUTOMATICA, 2012, 48 (10) :2699-2704

[10] Approximate optimal trajectory tracking for continuous-time nonlinear systems [J].

Kamalapurkar, Rushikesh ;

Dinh, Huyen ;

Bhasin, Shubhendu ;

Dixon, Warren E. .

AUTOMATICA, 2015, 51 :40-48

← 1 2 3 4 5 →