Constrained Online Optimal Control for Continuous-Time Nonlinear Systems Using Neuro-Dynamic Programming

被引:0
作者
Yang Xiong [1 ]
Liu Derong [1 ]
Wang Ding [1 ]
Ma Hongwen [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
来源
2014 33RD CHINESE CONTROL CONFERENCE (CCC) | 2014年
关键词
Constrained input; Neuro-dynamic programming; Nonlinear systems; Online control; Optimal control;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops an online adaptive optimal control scheme to solve the infinite-horizon optimal control problem of continuous-time nonlinear systems with control constraints. A novel architecture is presented to approximate the Hamilton-Jacobi-Bellman equation. That is, only a critic neural network is used to derive the optimal control instead of typical actioncritic dual networks employed in neuro-dynamic programming methods. Meanwhile, unlike existing tuning laws for the critic, the newly developed critic update rule not only ensures convergence of the critic to the optimal control but also guarantees the closed-loop system to be uniformly ultimately bounded. In addition, no initial stabilizing control is required. Finally, an example is provided to verify the effectiveness of the present approach.
引用
收藏
页码:8717 / 8722
页数:6
相关论文
共 20 条
  • [11] Reinforcement Learning and Feedback Control USING NATURAL DECISION METHODS TO DESIGN OPTIMAL ADAPTIVE CONTROLLERS
    Lewis, Frank L.
    Vrabie, Draguna
    Vamvoudakis, Kyriakos G.
    [J]. IEEE CONTROL SYSTEMS MAGAZINE, 2012, 32 (06): : 76 - 105
  • [12] Reinforcement Learning and Adaptive Dynamic Programming for Feedback Control
    Lewis, Frank L.
    Vrabie, Draguna
    [J]. IEEE CIRCUITS AND SYSTEMS MAGAZINE, 2009, 9 (03) : 32 - 50
  • [13] Adaptive optimal control for a class of continuous-time affine nonlinear systems with unknown internal dynamics
    Liu, Derong
    Yang, Xiong
    Li, Hongliang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2013, 23 (7-8) : 1843 - 1850
  • [14] Finite-Approximation-Error-Based Optimal Control Approach for Discrete-Time Nonlinear Systems
    Liu, Derong
    Wei, Qinglai
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (02) : 779 - 789
  • [15] Online solution of nonquadratic two-player zero-sum games arising in the H∞ control of constrained input systems
    Modares, Hamidreza
    Lewis, Frank L.
    Sistani, Mohammad-Bagher Naghibi
    [J]. INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2014, 28 (3-5) : 232 - 254
  • [16] Si J., 2004, HDB LEARNING APPROXI
  • [17] Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    Vamvoudakis, Kyriakos G.
    Lewis, Frank L.
    [J]. AUTOMATICA, 2010, 46 (05) : 878 - 888
  • [18] Adaptive Dynamic Programming: An Introduction
    Wang, Fei-Yue
    Zhang, Huaguang
    Liu, Derong
    [J]. IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2009, 4 (02) : 39 - 47
  • [19] Yang X, 2014, INT J CONTROL, V87, P553, DOI 10.1080/00207179.2013.848292
  • [20] Zhang H, 2013, COMMUN CONTROL ENG, P1, DOI 10.1007/978-1-4471-4757-2