Reinforcement Learning and Adaptive Optimal Control for Continuous-Time Nonlinear Systems: A Value Iteration Approach

被引:86
|
作者
Bian, Tao [1 ]
Jiang, Zhong-Ping [1 ]
机构
[1] NYU, Control & Networks Lab, Tandon Sch Engn, Dept Elect & Comp Engn, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
Nonlinear systems; Optimal control; Adaptive systems; Dynamical systems; Mathematical model; Heuristic algorithms; Linear systems; Adaptive optimal control; nonlinear systems; value iteration (VI); INTERCONNECTED SYSTEMS; STABILIZATION;
D O I
10.1109/TNNLS.2020.3045087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article studies the adaptive optimal control problem for continuous-time nonlinear systems described by differential equations. A key strategy is to exploit the value iteration (VI) method proposed initially by Bellman in 1957 as a fundamental tool to solve dynamic programming problems. However, previous VI methods are all exclusively devoted to the Markov decision processes and discrete-time dynamical systems. In this article, we aim to fill up the gap by developing a new continuous-time VI method that will be applied to address the adaptive or nonadaptive optimal control problems for continuous-time systems described by differential equations. Like the traditional VI, the continuous-time VI algorithm retains the nice feature that there is no need to assume the knowledge of an initial admissible control policy. As a direct application of the proposed VI method, a new class of adaptive optimal controllers is obtained for nonlinear systems with totally unknown dynamics. A learning-based control algorithm is proposed to show how to learn robust optimal controllers directly from real-time data. Finally, two examples are given to illustrate the efficacy of the proposed methodology.
引用
收藏
页码:2781 / 2790
页数:10
相关论文
共 50 条
  • [1] Value Iteration and Adaptive Optimal Control for Linear Continuous-time Systems
    Bian, Tao
    Jiang, Zhong-Ping
    PROCEEDINGS OF THE 2015 7TH IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) AND ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), 2015, : 53 - 58
  • [2] General value iteration based reinforcement learning for solving optimal :tracking control problem of continuous-time affine nonlinear systems
    Xiao, Geyang
    Zhang, Huaguang
    Luo, Yanhong
    Qu, Qiuxia
    NEUROCOMPUTING, 2017, 245 : 114 - 123
  • [3] Adaptive optimal tracking control for nonlinear continuous-time systems with time delay using value iteration algorithm
    Shi, Jing
    Yue, Dong
    Xie, Xiangpeng
    NEUROCOMPUTING, 2020, 396 : 172 - 178
  • [4] Reinforcement learning for adaptive optimal control of unknown continuous-time nonlinear systems with input constraints
    Yang, Xiong
    Liu, Derong
    Wang, Ding
    INTERNATIONAL JOURNAL OF CONTROL, 2014, 87 (03) : 553 - 566
  • [5] Value iteration based integral reinforcement learning approach for H∞ controller design of continuous-time nonlinear systems
    Xiao, Geyang
    Zhang, Huaguang
    Zhang, Kun
    Wen, Yinlei
    NEUROCOMPUTING, 2018, 285 : 51 - 59
  • [6] Adaptive optimal control of continuous-time nonlinear affine systems via hybrid iteration
    Qasem, Omar
    Gao, Weinan
    Vamvoudakis, Kyriakos G.
    AUTOMATICA, 2023, 157
  • [7] Adaptive Optimal Control Algorithm for Continuous-Time Nonlinear Systems Based on Policy Iteration
    Vrabie, D.
    Lewis, F. L.
    47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 73 - 79
  • [8] Reinforcement learning for adaptive optimal control of continuous-time linear periodic systems
    Pang, Bo
    Jiang, Zhong-Ping
    Mareels, Iven
    AUTOMATICA, 2020, 118
  • [9] Online reinforcement learning for a class of partially unknown continuous-time nonlinear systems via value iteration
    Su, Hanguang
    Zhang, Huaguang
    Zhang, Kun
    Gao, Wenzhong
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2018, 39 (02): : 1011 - 1028
  • [10] Adaptive Optimal Control of Continuous-Time Linear Systems via Hybrid Iteration
    Qasem, Omar
    Gao, Weinan
    Bian, Tao
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,