Finite-Horizon H∞ Tracking Control for Unknown Nonlinear Systems With Saturating Actuators

被引：85

作者：

Zhang, Huaguang ^{[1
]}

Cui, Xiaohong ^{[1
,2
]}

Luo, Yanhong ^{[1
]}

Jiang, He ^{[1
]}

机构：

[1] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China

[2] Mudanjiang Normal Univ, Inst Math Sci, Mudanjiang 157011, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2018年 / 29卷 / 04期

关键词：

H-infinity tracking; finite-horizon; Hamilton-Jacobi-Isaacs (HJI) equation; model-free; neural network (NN); online learning; TIME-OPTIMAL-CONTROL; POLICY UPDATE ALGORITHM; ITERATION; DESIGN;

D O I：

10.1109/TNNLS.2017.2669099

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a neural network (NN)-based online model-free integral reinforcement learning algorithm is developed to solve the finite-horizon H-infinity optimal tracking control problem for completely unknown nonlinear continuous-time systems with disturbance and saturating actuators (constrained control input). An augmented system is constructed with the tracking error system and the command generator system. A time-varying Hamilton-Jacobi-Isaacs (HJI) equation is formulated for the augmented problem, which is extremely difficult or impossible to solve due to its time-dependent property and nonlinearity. Then, an actor-critic-disturbance NN structure-based scheme is proposed to learn the time-varying solution to the HJI equation in real time without using the knowledge of system dynamics. Since the solution to the HJI equation is time-dependent, the form of NNs representation with constant weights and time-dependent activation functions is considered. Furthermore, an extra error is incorporated in order to satisfy the terminal constraints in the weight update law. Convergence and stability proofs are given based on the Lyapunov theory for nonautonomous systems. Two simulation examples are provided to demonstrate the effectiveness of the designed algorithm.

引用

页码：1200 / 1212

页数：13

共 43 条

[1] Successive Galerkin approximation algorithms for nonlinear optimal and robust control
Beard, RW
McLain, TW
[J]. INTERNATIONAL JOURNAL OF CONTROL, 1998, 71 (05) : 717 - 743
[2] Chen Ben M, 2000, ROBUST H CONTROL
[3] Chen BS, 1996, IEEE T FUZZY SYST, V4, P32, DOI 10.1109/91.481843
[4] A neural network solution for fixed-final time optimal control of nonlinear systems
Cheng, Tao
Lewis, Frank L.
Abu-Khalaf, Murad
[J]. AUTOMATICA, 2007, 43 (03) : 482 - 490
[5] Finite horizon H∞ filtering with initial condition
Foo, Yung Kuan
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2006, 53 (11): : 1220 - 1224
[6] Improved scalar parameters approach to design robust H∞ filter for uncertain discrete-time linear systems
Han, Kezhen
Feng, Jian
[J]. SIGNAL PROCESSING, 2015, 113 : 113 - 123
[7] Closed-form solution to finite-horizon suboptimal control of nonlinear systems
Heydari, Ali
Balakrishnan, S. N.
[J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2015, 25 (15) : 2687 - 2704
[8] Fixed-final-time optimal tracking control of input-affine nonlinear systems
Heydari, Ali
Balakrishnan, S. N.
[J]. NEUROCOMPUTING, 2014, 129 : 528 - 539
[9] Finite-Horizon Control-Constrained Nonlinear Optimal Control Using Single Network Adaptive Critics
Heydari, Ali
Balakrishnan, Sivasubramanya N.
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (01) : 145 - 157
[10] Approximate optimal trajectory tracking for continuous-time nonlinear systems
Kamalapurkar, Rushikesh
Dinh, Huyen
Bhasin, Shubhendu
Dixon, Warren E.
[J]. AUTOMATICA, 2015, 51 : 40 - 48

← 1 2 3 4 5 →