Statistically consistent inverse optimal control for discrete-time indefinite linear-quadratic systems☆

被引:0
|
作者
Zhang, Han [1 ,2 ]
Ringh, Axel [3 ,4 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Dept Automat, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Med Robot, Shanghai, Peoples R China
[3] Chalmers Univ Technol, Dept Math Sci, S-41296 Gothenburg, Sweden
[4] Univ Gothenburg, S-41296 Gothenburg, Sweden
关键词
Inverse optimal control; Inverse reinforcement learning; Indefinite linear quadratic regulator; System identification; Convex optimization; Semidefinite programming; Time-varying system matrices; MODEL;
D O I
10.1016/j.automatica.2024.111705
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Inverse Optimal Control (IOC) problem is a structured system identification problem that aims to identify the underlying objective function based on observed optimal trajectories. This provides a data-driven way to model experts' behavior. In this paper, we consider the case of discrete-time finitehorizon linear-quadratic problems where: the quadratic cost term in the objective is not necessarily positive semi-definite; the planning horizon is a random variable; we have both process noise and observation noise; the dynamics can have a drift term; and where we can have a linear cost term in the objective. In this setting, we first formulate the necessary and sufficient conditions for when the forward optimal control problem is solvable. Next, we show that the corresponding IOC problem is identifiable. Using the conditions for existence of an optimum of the forward problem, we then formulate an estimator for the parameters in the objective function of the forward problem as the globally optimal solution to a convex optimization problem, and prove that the estimator is statistical consistent. Finally, the performance of the algorithm is demonstrated on two numerical examples. (c) 2024 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Weak detectability and the linear-quadratic control problem of discrete-time Markov jump linear systems
    Costa, EF
    Do Val, JBR
    INTERNATIONAL JOURNAL OF CONTROL, 2002, 75 (16-17) : 1282 - 1292
  • [32] Discrete-Time Linear-Quadratic Dynamic Games
    Pachter, M.
    Pham, K. D.
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2010, 146 (01) : 151 - 179
  • [33] Discrete-Time Linear-Quadratic Dynamic Games
    M. Pachter
    K. D. Pham
    Journal of Optimization Theory and Applications, 2010, 146 : 151 - 179
  • [34] Linear-quadratic optimal control problems for discrete descriptor systems
    Kurina, G
    SYSTEM STRUCTURE AND CONTROL 2001, VOLS 1 AND 2, 2001, : 261 - 266
  • [35] Distributed linear-quadratic regulator control for discrete-time multi-agent systems
    Wang, Wei
    Zhang, Fangfang
    Han, Chunyan
    IET CONTROL THEORY AND APPLICATIONS, 2017, 11 (14): : 2279 - 2287
  • [36] Linear-Quadratic Near-Optimal Controls For Discrete-Time Mean-Field Systems
    Qin, Xilin
    Hou, Ting
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 945 - 950
  • [37] Pathwise turnpike and dissipativity results for discrete-time stochastic linear-quadratic optimal control problems
    Schiessl, Jonas
    Ou, Ruchuan
    Faulwasser, Timm
    Baumann, Michael Heinrich
    Gruene, Lars
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 2790 - 2795
  • [38] Optimal nominal dual control for discrete-time linear-quadratic Gaussian problems with unknown parameters
    Li, Duan
    Qian, Fucai
    Fu, Peilin
    AUTOMATICA, 2008, 44 (01) : 119 - 127
  • [39] Inverse linear-quadratic discrete-time finite-horizon optimal control for indistinguishable homogeneous agents: A convex optimization approach
    Zhang, Han
    Ringh, Axel
    AUTOMATICA, 2023, 148
  • [40] Inverse optimal control for discrete-time finite-horizon Linear Quadratic Regulators
    Zhang, Han
    Umenberger, Jack
    Hu, Xiaoming
    AUTOMATICA, 2019, 110