Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems

被引:182
作者
Jiang, Yu [1 ]
Jiang, Zhong-Ping [2 ]
机构
[1] The MathWorks, Natick, MA 01760 USA
[2] NYU, Polytech Sch Engn, Dept Elect & Comp Engn, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
Adaptive dynamic programming; global stabilization; nonlinear systems; optimal control; STABILIZATION; APPROXIMATIONS; OBSERVABILITY; CONTROLLERS; DESIGN;
D O I
10.1109/TAC.2015.2414811
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel method of global adaptive dynamic programming (ADP) for the adaptive optimal control of nonlinear polynomial systems. The strategy consists of relaxing the problem of solving the Hamilton-Jacobi-Bellman (HJB) equation to an optimization problem, which is solved via a new policy iteration method. The proposed method distinguishes from previously known nonlinear ADP methods in that the neural network approximation is avoided, giving rise to significant computational improvement. Instead of semiglobally or locally stabilizing, the resultant control policy is globally stabilizing for a general class of nonlinear polynomial systems. Furthermore, in the absence of the a priori knowledge of the system dynamics, an online learning method is devised to implement the proposed policy iteration technique by generalizing the current ADP theory. Finally, three numerical examples are provided to validate the effectiveness of the proposed method.
引用
收藏
页码:2917 / 2929
页数:13
相关论文
共 61 条
  • [1] Forward completeness, unboundedness observability, and their Lyapunov characterizations
    Angeli, D
    Sontag, ED
    [J]. SYSTEMS & CONTROL LETTERS, 1999, 38 (4-5) : 209 - 217
  • [2] [Anonymous], 2004, P IEEE INT C ROB AUT, DOI DOI 10.1109/CACSD.2004.1393890
  • [3] [Anonymous], 1996, Neuro-Dynamic Programming
  • [4] Issues on stability of ADP feedback controllers for dynamical systems
    Balakrishnan, S. N.
    Ding, Jie
    Lewis, Frank L.
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 913 - 917
  • [5] Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
    Beard, RW
    Saridis, GN
    Wen, JT
    [J]. AUTOMATICA, 1997, 33 (12) : 2159 - 2177
  • [6] DYNAMIC PROGRAMMING
    BELLMAN, R
    [J]. SCIENCE, 1966, 153 (3731) : 34 - &
  • [7] Bellman Richard, 1959, Math Tables and Other Aides to Computation, V13, P247
  • [8] Bertsekas D. P., 2007, Dynamic Programming and Optimal Control, VII
  • [9] Decentralized Adaptive Optimal Control of Large-Scale Systems With Application to Power Systems
    Bian, Tao
    Jiang, Yu
    Jiang, Zhong-Ping
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2015, 62 (04) : 2439 - 2447
  • [10] Adaptive dynamic programming and optimal control of nonlinear nonaffine systems
    Bian, Tao
    Jiang, Yu
    Jiang, Zhong-Ping
    [J]. AUTOMATICA, 2014, 50 (10) : 2624 - 2632