Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design

被引:201
|
作者
Bian, Tao [1 ]
Jiang, Zhong-Ping [1 ]
机构
[1] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, Control & Networks Lab,Metrotech Ctr 5, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
Value iteration; Adaptive dynamic programming; Optimal control; Adaptive control; Stochastic approximation; CONTINUOUS-TIME SYSTEMS; STOCHASTIC-APPROXIMATION; NONLINEAR-SYSTEMS; LINEAR-SYSTEMS; ROBUST STABILIZATION; STABILITY; DELAYS;
D O I
10.1016/j.automatica.2016.05.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel non-model-based, data-driven adaptive optimal controller design for linear continuous-time systems with completely unknown dynamics. Inspired by the stochastic approximation theory, a continuous-time version of the traditional value iteration (VI) algorithm is presented with rigorous convergence analysis. This VI method is crucial for developing new adaptive dynamic programming methods to solve the adaptive optimal control problem and the stochastic robust optimal control problem for linear continuous-time systems. Fundamentally different from existing results, the a priori knowledge of an initial admissible control policy is no longer required. The efficacy of the proposed methodology is illustrated by two examples and a brief comparative study between VI and earlier policy iteration methods. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:348 / 360
页数:13
相关论文
共 50 条
  • [1] Distributed adaptive dynamic programming for data-driven optimal control
    Tang, Wentao
    Daoutidis, Prodromos
    SYSTEMS & CONTROL LETTERS, 2018, 120 : 36 - 43
  • [2] Value Iteration, Adaptive Dynamic Programming, and Optimal Control of Nonlinear Systems
    Bian, Tao
    Jiang, Zhong-Ping
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 3375 - 3380
  • [3] Adaptive Dynamic Programming and Data-Driven Cooperative Optimal Output Regulation with Adaptive Observers
    Qasem, Omar
    Jebari, Khalid
    Gao, Weinan
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 2538 - 2543
  • [4] Integrated adaptive dynamic programming for data -driven optimal controller design
    Li, Guoqiang
    Goerges, Daniel
    Mu, Chaoxu
    NEUROCOMPUTING, 2020, 403 : 143 - 152
  • [5] Data-Driven Neuro-Optimal Tracking Control of Ozone production Based on Adaptive Dynamic Programming
    Dong Zhe
    Liu Wenjuan
    Li Yueheng
    Han Jie
    Chen MengJiao
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 9731 - 9736
  • [6] Data-Driven Iterative Adaptive Dynamic Programming Algorithm for Approximate Optimal Control of Unknown Nonlinear Systems
    Li, Hongliang
    Liu, Derong
    Wang, Ding
    Li, Chao
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3265 - 3271
  • [7] Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Liu, Derong
    Lin, Hanquan
    IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (03) : 840 - 853
  • [8] DATA-DRIVEN CONTROL OF HYDRAULIC SERVO ACTUATOR BASED ON ADAPTIVE DYNAMIC PROGRAMMING
    Djordjevic, Vladimir
    Stojanovic, Vladimir
    Tao, Hongfeng
    Song, Xiaona
    He, Shuping
    Gao, Weinan
    DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS-SERIES S, 2022, 15 (07): : 1633 - 1650
  • [9] Data-Driven Adaptive Dynamic Programming for Optimal Control of Continuous-Time Multicontroller Systems With Unknown Dynamics
    Zhao, Jingang
    IEEE ACCESS, 2022, 10 : 41503 - 41511
  • [10] Data-driven Neuro-optimal Tracking Control of Ozone Generation Process Based on Adaptive Dynamic Programming
    Dong, Zhe
    Liu, Wenjuan
    Li, Yueheng
    Han, Jie
    Chen, Mengjiao
    2017 6TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS (DDCLS), 2017, : 650 - 655