Finite-Horizon H∞ Tracking Control for Unknown Nonlinear Systems With Saturating Actuators

被引:85
作者
Zhang, Huaguang [1 ]
Cui, Xiaohong [1 ,2 ]
Luo, Yanhong [1 ]
Jiang, He [1 ]
机构
[1] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
[2] Mudanjiang Normal Univ, Inst Math Sci, Mudanjiang 157011, Peoples R China
关键词
H-infinity tracking; finite-horizon; Hamilton-Jacobi-Isaacs (HJI) equation; model-free; neural network (NN); online learning; TIME-OPTIMAL-CONTROL; POLICY UPDATE ALGORITHM; ITERATION; DESIGN;
D O I
10.1109/TNNLS.2017.2669099
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a neural network (NN)-based online model-free integral reinforcement learning algorithm is developed to solve the finite-horizon H-infinity optimal tracking control problem for completely unknown nonlinear continuous-time systems with disturbance and saturating actuators (constrained control input). An augmented system is constructed with the tracking error system and the command generator system. A time-varying Hamilton-Jacobi-Isaacs (HJI) equation is formulated for the augmented problem, which is extremely difficult or impossible to solve due to its time-dependent property and nonlinearity. Then, an actor-critic-disturbance NN structure-based scheme is proposed to learn the time-varying solution to the HJI equation in real time without using the knowledge of system dynamics. Since the solution to the HJI equation is time-dependent, the form of NNs representation with constant weights and time-dependent activation functions is considered. Furthermore, an extra error is incorporated in order to satisfy the terminal constraints in the weight update law. Convergence and stability proofs are given based on the Lyapunov theory for nonautonomous systems. Two simulation examples are provided to demonstrate the effectiveness of the designed algorithm.
引用
收藏
页码:1200 / 1212
页数:13
相关论文
共 43 条
  • [1] Successive Galerkin approximation algorithms for nonlinear optimal and robust control
    Beard, RW
    McLain, TW
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 1998, 71 (05) : 717 - 743
  • [2] Chen Ben M, 2000, ROBUST H CONTROL
  • [3] Chen BS, 1996, IEEE T FUZZY SYST, V4, P32, DOI 10.1109/91.481843
  • [4] A neural network solution for fixed-final time optimal control of nonlinear systems
    Cheng, Tao
    Lewis, Frank L.
    Abu-Khalaf, Murad
    [J]. AUTOMATICA, 2007, 43 (03) : 482 - 490
  • [5] Finite horizon H∞ filtering with initial condition
    Foo, Yung Kuan
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2006, 53 (11): : 1220 - 1224
  • [6] Improved scalar parameters approach to design robust H∞ filter for uncertain discrete-time linear systems
    Han, Kezhen
    Feng, Jian
    [J]. SIGNAL PROCESSING, 2015, 113 : 113 - 123
  • [7] Closed-form solution to finite-horizon suboptimal control of nonlinear systems
    Heydari, Ali
    Balakrishnan, S. N.
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2015, 25 (15) : 2687 - 2704
  • [8] Fixed-final-time optimal tracking control of input-affine nonlinear systems
    Heydari, Ali
    Balakrishnan, S. N.
    [J]. NEUROCOMPUTING, 2014, 129 : 528 - 539
  • [9] Finite-Horizon Control-Constrained Nonlinear Optimal Control Using Single Network Adaptive Critics
    Heydari, Ali
    Balakrishnan, Sivasubramanya N.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (01) : 145 - 157
  • [10] Approximate optimal trajectory tracking for continuous-time nonlinear systems
    Kamalapurkar, Rushikesh
    Dinh, Huyen
    Bhasin, Shubhendu
    Dixon, Warren E.
    [J]. AUTOMATICA, 2015, 51 : 40 - 48