Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks

被引:361
作者
Modares, Hamidreza [1 ]
Lewis, Frank L. [2 ]
Naghibi-Sistani, Mohammad-Bagher [1 ]
机构
[1] Ferdowsi Univ Mashhad, Dept Elect Engn, Mashhad, Iran
[2] Univ Texas Arlington, Res Inst, Ft Worth, TX 76118 USA
基金
美国国家科学基金会;
关键词
Input constraints; neural networks; optimal control; reinforcement learning; unknown dynamics; CONTINUOUS-TIME;
D O I
10.1109/TNNLS.2013.2276571
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an online policy iteration (PI) algorithm to learn the continuous-time optimal control solution for unknown constrained-input systems. The proposed PI algorithm is implemented on an actor-critic structure where two neural networks (NNs) are tuned online and simultaneously to generate the optimal bounded control policy. The requirement of complete knowledge of the system dynamics is obviated by employing a novel NN identifier in conjunction with the actor and critic NNs. It is shown how the identifier weights estimation error affects the convergence of the critic NN. A novel learning rule is developed to guarantee that the identifier weights converge to small neighborhoods of their ideal values exponentially fast. To provide an easy-to-check persistence of excitation condition, the experience replay technique is used. That is, recorded past experiences are used simultaneously with current data for the adaptation of the identifier weights. Stability of the whole system consisting of the actor, critic, system state, and system identifier is guaranteed while all three networks undergo adaptation. Convergence to a near-optimal control law is also shown. The effectiveness of the proposed method is illustrated with a simulation example.
引用
收藏
页码:1513 / 1525
页数:13
相关论文
共 50 条
  • [21] Optimal Learning Control for Discrete-Time Nonlinear Systems Using Generalized Policy Iteration Based Adaptive Dynamic Programming
    Wei, Qinglai
    Liu, Derong
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 1781 - 1786
  • [22] Extended adaptive optimal control of linear systems with unknown dynamics using adaptive dynamic programming
    Gan, Minggang
    Zhao, Jingang
    Zhang, Chi
    ASIAN JOURNAL OF CONTROL, 2021, 23 (02) : 1097 - 1106
  • [23] Generalized Policy Iteration-based Reinforcement Learning Algorithm for Optimal Control of Unknown Discrete-time Systems
    Lin, Mingduo
    Zhao, Bo
    Liu, Derong
    Liu, Xi
    Luo, Fangchao
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3650 - 3655
  • [24] Model-free Nearly Optimal Control of Constrained-Input Nonlinear Systems Based on Synchronous Reinforcement Learning
    Zhao, Han
    Guo, Lei
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 2162 - 2167
  • [25] Online Off-Policy Reinforcement Learning for Optimal Control of Unknown Nonlinear Systems Using Neural Networks
    Zhu, Liao
    Wei, Qinglai
    Guo, Ping
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (08): : 5112 - 5122
  • [26] Inverse Optimal Adaptive Neural Control for State-Constrained Nonlinear Systems
    Lu, Kaixin
    Liu, Zhi
    Yu, Haoyong
    Chen, C. L. Philip
    Zhang, Yun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 10617 - 10628
  • [27] Adaptive control for input-constrained linear systems
    Bong Seok Park
    Jae Young Lee
    Jin Bae Park
    Yoon Ho Choi
    International Journal of Control, Automation and Systems, 2012, 10 : 890 - 896
  • [28] Adaptive Optimal Control for Unknown Constrained Nonlinear Systems With a Novel Quasi-Model Network
    Han, Xiumei
    Zhao, Xudong
    Karimi, Hamid Reza
    Wang, Ding
    Zong, Guangdeng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) : 2867 - 2878
  • [29] Adaptive Control for Input-Constrained Linear Systems
    Park, Bong Seok
    Lee, Jae Young
    Park, Jin Bae
    Choi, Yoon Ho
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2012, 10 (05) : 890 - 896
  • [30] Reinforcement learning for adaptive optimal control of unknown continuous-time nonlinear systems with input constraints (vol 87, pg 553, 2014)
    Yang, Xiong
    Liu, Derong
    Wang, Ding
    INTERNATIONAL JOURNAL OF CONTROL, 2014, 87 (03) : I - I