Statistically consistent inverse optimal control for discrete-time indefinite linear-quadratic systems☆

被引:0
|
作者
Zhang, Han [1 ,2 ]
Ringh, Axel [3 ,4 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Dept Automat, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Med Robot, Shanghai, Peoples R China
[3] Chalmers Univ Technol, Dept Math Sci, S-41296 Gothenburg, Sweden
[4] Univ Gothenburg, S-41296 Gothenburg, Sweden
关键词
Inverse optimal control; Inverse reinforcement learning; Indefinite linear quadratic regulator; System identification; Convex optimization; Semidefinite programming; Time-varying system matrices; MODEL;
D O I
10.1016/j.automatica.2024.111705
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Inverse Optimal Control (IOC) problem is a structured system identification problem that aims to identify the underlying objective function based on observed optimal trajectories. This provides a data-driven way to model experts' behavior. In this paper, we consider the case of discrete-time finitehorizon linear-quadratic problems where: the quadratic cost term in the objective is not necessarily positive semi-definite; the planning horizon is a random variable; we have both process noise and observation noise; the dynamics can have a drift term; and where we can have a linear cost term in the objective. In this setting, we first formulate the necessary and sufficient conditions for when the forward optimal control problem is solvable. Next, we show that the corresponding IOC problem is identifiable. Using the conditions for existence of an optimum of the forward problem, we then formulate an estimator for the parameters in the objective function of the forward problem as the globally optimal solution to a convex optimization problem, and prove that the estimator is statistical consistent. Finally, the performance of the algorithm is demonstrated on two numerical examples. (c) 2024 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Discrete-time Indefinite Stochastic Linear Quadratic Optimal Control: Inequality Constraint Case
    Li, Guiling
    Zhang, Weihai
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 2327 - 2332
  • [22] Discrete-Time Indefinite Stochastic Linear Quadratic Optimal Control with Second Moment Constraints
    Zhang, Weihai
    Li, Guiling
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [23] Discrete-time indefinite linear-quadratic mean field games and control: The finite-population case
    Liang, Yong
    Wang, Bing-Chang
    Zhang, Huanshui
    Automatica, 2024, 162
  • [24] GENERALIZED LINEAR-QUADRATIC PROBLEMS OF DETERMINISTIC AND STOCHASTIC OPTIMAL-CONTROL IN DISCRETE-TIME
    ROCKAFELLAR, RT
    WETS, RJB
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1990, 28 (04) : 810 - 822
  • [25] Discrete-time indefinite linear-quadratic mean field games and control: The finite-population case
    Liang, Yong
    Wang, Bing -Chang
    Zhang, Huanshui
    AUTOMATICA, 2024, 162
  • [26] Discrete-time linear-quadratic output regulator for multivariable systems
    Gessing, R
    LARGE SCALE SYSTEMS: THEORY AND APPLICATIONS 2001 (LSS'01), 2001, : 551 - 556
  • [27] Finite and infinite horizon indefinite linear quadratic optimal control for discrete-time singular Markov jump systems
    Li, Yichun
    Ma, Shuping
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2021, 358 (17): : 8993 - 9022
  • [28] Linear Quadratic Optimal Control for Discrete-time Markov Jump Linear Systems
    Han, Chunyan
    Li, Hongdan
    Wang, Wei
    Zhang, Huanshui
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2018, : 769 - 774
  • [29] LINEAR QUADRATIC OPTIMAL-CONTROL OF DISCRETE-TIME IMPLICIT SYSTEMS
    WANG, XM
    BERNHARD, P
    GRIMM, J
    COMPTES RENDUS DE L ACADEMIE DES SCIENCES SERIE I-MATHEMATIQUE, 1986, 303 (04): : 127 - 130
  • [30] Stochastic linear quadratic optimal control with constraint for discrete-time systems
    Liu, Xikui
    Li, Yan
    Zhang, Weihai
    APPLIED MATHEMATICS AND COMPUTATION, 2014, 228 : 264 - 270