The role of identification in data-driven policy iteration: A system theoretic study

被引:0
|
作者
Song, Bowen [1 ]
Iannelli, Andrea [1 ]
机构
[1] Univ Stuttgart, Inst Syst Theory & Automat Control, Pfeffenwaldring 9, D-70569 Stuttgart, Germany
关键词
data-driven control; nonlinear systems; policy iteration; robustness; system identification; RICCATI EQUATION; CONVERGENCE; STABILITY;
D O I
10.1002/rnc.7475
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of this article is to study fundamental mechanisms behind so-called indirect and direct data-driven control for unknown systems. Specifically, we consider policy iteration applied to the linear quadratic regulator problem. Two iterative procedures, where data collected from the system are repeatedly used to compute new estimates of the desired optimal controller, are considered. In indirect policy iteration, data are used to obtain an updated model estimate through a recursive identification scheme, which is used in a certainty-equivalent fashion to perform the classic policy iteration update. By casting the concurrent model identification and control design as a feedback interconnection between two algorithmic systems, we provide a closed-loop analysis that shows convergence and robustness properties for arbitrary levels of excitation in the data. In direct policy iteration, data are used to approximate the value function and design the associated controller without requiring the intermediate identification step. After proposing an extension to a recently proposed scheme that overcomes potential identifiability issues, we establish under which conditions this procedure is guaranteed to deliver the optimal controller. Based on these analyses we are able to compare the strengths and limitations of the two approaches, highlighting aspects such as the required samples, convergence properties, and excitation requirement. Simulations are also provided to illustrate the results.
引用
收藏
页数:32
相关论文
共 50 条
  • [41] Data-Driven Robust Backward Reachable Sets for Set-Theoretic Model Predictive Control
    Attar, Mehran
    Lucia, Walter
    IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 2305 - 2310
  • [42] Data-driven system identification and control of home facilities for comfortable, energy cost effective and safe home
    Yang, Zhenyi
    Zhang, Long
    2024 29TH INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING, ICAC 2024, 2024, : 81 - 86
  • [43] Data-Driven Vehicle Dynamics: Neural Network Modeling for System Identification and Prediction in Driver Assistance Control
    Song, Pan
    Zheng, Ling
    Tian, Guannan
    Zhang, Linbo
    AUTOMOTIVE INNOVATION, 2025, 8 (01) : 46 - 58
  • [44] Composed Physics- and Data-driven System Identification for Non-autonomous Systems in Control Engineering
    Goette, Ricarda-Samantha
    Timmermann, Julia
    2022 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, ROBOTICS AND CONTROL, AIRC, 2022, : 67 - 76
  • [45] Data-driven structural identification of nonlinear assemblies: Structures with bolted joints
    Safari, S.
    Monsalve, J. M. Londono
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2023, 195
  • [46] Data-driven model identification of guided wave propagation in composite structures
    da Silva, Samuel
    JOURNAL OF THE BRAZILIAN SOCIETY OF MECHANICAL SCIENCES AND ENGINEERING, 2018, 40 (11)
  • [47] Rapid identification of switched systems: A data-driven method in variational framework
    ChunJiang Li
    ZhiLong Huang
    Yong Wang
    HanQing Jiang
    Science China Technological Sciences, 2021, 64 : 148 - 156
  • [48] Data-driven stochastic subspace identification of flutter derivatives of bridge decks
    Boonyapinyo, Virote
    Janesupasaeree, Tharach
    JOURNAL OF WIND ENGINEERING AND INDUSTRIAL AERODYNAMICS, 2010, 98 (12) : 784 - 799
  • [49] Data-driven Buck converter model identification method with missing outputs
    Hou, Jie
    Zhang, Xinhua
    Wang, Huiming
    Wang, Shiwei
    IET CONTROL THEORY AND APPLICATIONS, 2024, 18 (14): : 1825 - 1835
  • [50] Data-Driven Topology and Parameter Identification in Distribution Systems With Limited Measurements
    de Jongh, Steven
    Mueller, Felicitas
    Osterberg, Fabian
    Canizares, Claudio A.
    Leibfried, Thomas
    Bhattacharya, Kankar
    IEEE TRANSACTIONS ON POWER DELIVERY, 2025, 40 (01) : 249 - 260