Statistically consistent inverse optimal control for discrete-time indefinite linear-quadratic systems☆

被引:0
|
作者
Zhang, Han [1 ,2 ]
Ringh, Axel [3 ,4 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Dept Automat, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Med Robot, Shanghai, Peoples R China
[3] Chalmers Univ Technol, Dept Math Sci, S-41296 Gothenburg, Sweden
[4] Univ Gothenburg, S-41296 Gothenburg, Sweden
关键词
Inverse optimal control; Inverse reinforcement learning; Indefinite linear quadratic regulator; System identification; Convex optimization; Semidefinite programming; Time-varying system matrices; MODEL;
D O I
10.1016/j.automatica.2024.111705
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Inverse Optimal Control (IOC) problem is a structured system identification problem that aims to identify the underlying objective function based on observed optimal trajectories. This provides a data-driven way to model experts' behavior. In this paper, we consider the case of discrete-time finitehorizon linear-quadratic problems where: the quadratic cost term in the objective is not necessarily positive semi-definite; the planning horizon is a random variable; we have both process noise and observation noise; the dynamics can have a drift term; and where we can have a linear cost term in the objective. In this setting, we first formulate the necessary and sufficient conditions for when the forward optimal control problem is solvable. Next, we show that the corresponding IOC problem is identifiable. Using the conditions for existence of an optimum of the forward problem, we then formulate an estimator for the parameters in the objective function of the forward problem as the globally optimal solution to a convex optimization problem, and prove that the estimator is statistical consistent. Finally, the performance of the algorithm is demonstrated on two numerical examples. (c) 2024 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Inverse optimal neural control for a class of discrete-time nonlinear positive systems
    Leon, Blanca S.
    Alanis, Alma Y.
    Sanchez, Edgar N.
    Ruiz-Velazquez, Eduardo
    Ornelas-Tellez, Fernando
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2012, 26 (07) : 614 - 629
  • [22] INDEFINITE LQ OPTIMAL CONTROL WITH PROCESS STATE INEQUALITY CONSTRAINTS FOR DISCRETE-TIME UNCERTAIN SYSTEMS
    Chen, Yuefen
    Zhu, Yuanguo
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2018, 14 (03) : 913 - 930
  • [23] Discrete-time inverse optimal neural control for synchronous generators
    Alanis, Alma Y.
    Ornelas-Tellez, Fernando
    Sanchez, Edgar N.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (02) : 697 - 705
  • [24] Robust inverse optimal control for discrete-time nonlinear system stabilization
    Ornelas-Tellez, Fernando
    Sanchez, Edgar N.
    Loukianov, Alexander G.
    Jesus Rico, J.
    EUROPEAN JOURNAL OF CONTROL, 2014, 20 (01) : 38 - 44
  • [25] Linear Quadratic Regulation and Stabilization of Discrete-Time Systems With Delay and Multiplicative Noise
    Zhang, Huanshui
    Li, Lin
    Xu, Juanjuan
    Fu, Minyue
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (10) : 2599 - 2613
  • [26] Turnpike Properties for Stochastic Linear-Quadratic Optimal Control Problems
    Sun, Jiangrui
    Wang, Hanxiao
    Yong, Jiongmin
    CHINESE ANNALS OF MATHEMATICS SERIES B, 2022, 43 (06) : 999 - 1022
  • [27] Optimal control for both forward and backward discrete-time systems
    Chen, Xin
    Yuan, Yue
    Yuan, Dongmei
    Ge, Xiao
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2024, 221 : 298 - 314
  • [28] Discrete-Time Fractional Optimal Control
    Chiranjeevi, Tirumalasetty
    Biswas, Raj Kumar
    MATHEMATICS, 2017, 5 (02)
  • [29] Robust Optimal Control of Biobjective Linear-Quadratic System With Noisy Observation
    Wang, Guangchen
    Xing, Zhuangzhuang
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (01) : 303 - 308
  • [30] Inverse optimal control for asymptotic trajectory tracking of discrete-time stochastic nonlinear systems in block controllable form
    Elvira-Ceja, Santiago
    Sanchez, Edgar N.
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2018, 39 (05) : 1702 - 1715