Lyapunov stability-based control and identification of nonlinear dynamical systems using adaptive dynamic programming

被引：24

作者：

Kumar, Rajesh ^{[1
]}

Srivastava, Smriti ^{[1
]}

Gupta, J. R. P. ^{[1
]}

机构：

[1] Netaji Subhas Inst Technol, Div Instrumentat & Control Engn, Sect 3, New Delhi 110078, India

来源：

SOFT COMPUTING | 2017年 / 21卷 / 15期

关键词：

Adaptive dynamic programming; Nonlinear dynamical systems; Lyapunov stability; Identification and adaptive control; Gradient descent principle; NEURAL-NETWORKS; ALGORITHM; STABILIZATION;

D O I：

10.1007/s00500-017-2500-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a novel control and identification scheme based on adaptive dynamic programming for nonlinear dynamical systems. The aim of control in this paper is to make output of the plant to follow the desired reference trajectory. The dynamics of plants are assumed to be unknown, and to tackle the problem of unknown plant's dynamics, parameter variations and disturbance signal effects, a separate neural network-based identification model is set up which will work in parallel to the plant and the control scheme. Weights update equations of all neural networks present in the proposed scheme are derived using both gradient descent (GD) and Lyapunov stability (LS) criterion methods. Stability proof of LS-based algorithm is also given. Weight update equations derived using LS criterion ensure the global stability of the system, whereas those obtained through GD principle do not. Further, adaptive learning rate is employed in weight update equation instead of constant one in order to have fast learning of weight vectors. Also, L-Sand GD-based weight update equations are also tested against parameter variation and disturbance signal. Three nonlinear dynamical systems (of different complexity) including the forced rigid pendulum trajectory control are used in this paper on which the proposed scheme is applied. The results obtained with LS method are found more accurate than those obtained with the GD-based method.

引用

页码：4465 / 4480

页数：16

共 49 条

[1] Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
Abu-Khalaf, M
Lewis, FL
[J]. AUTOMATICA, 2005, 41 (05) : 779 - 791
[2] Distributed parameter system identification using finite element differential neural networks
Aguilar-Leal, O.
Fuentes-Aguilar, R. Q.
Chairez, I.
Garcia-Gonzalez, A.
Huegel, J. C.
[J]. APPLIED SOFT COMPUTING, 2016, 43 : 633 - 642
[3] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
Al-Tamimi, Asma
Lewis, Frank L.
Abu-Khalaf, Murad
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 943 - 949
[4] [Anonymous], 1988, MATLAB USERS GUIDE
[5] [Anonymous], J IEEE T NEURAL NETW
[6] [Anonymous], 1977, ART THEORY DYNAMIC P
[7] [Anonymous], 1988, P 1988 CONNECTIONIST
[8] Adaptive-critic-based neural networks for aircraft optimal control
Balakrishnan, SN
Biega, V
[J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1996, 19 (04) : 893 - 898
[9] Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics
[10] Temporal Difference Methods for General Projected Equations
Bertsekas, Dimitri P.
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2011, 56 (09) : 2128 - 2139

← 1 2 3 4 5 →