Statistically consistent inverse optimal control for discrete-time indefinite linear-quadratic systems☆

被引:0
|
作者
Zhang, Han [1 ,2 ]
Ringh, Axel [3 ,4 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Dept Automat, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Med Robot, Shanghai, Peoples R China
[3] Chalmers Univ Technol, Dept Math Sci, S-41296 Gothenburg, Sweden
[4] Univ Gothenburg, S-41296 Gothenburg, Sweden
关键词
Inverse optimal control; Inverse reinforcement learning; Indefinite linear quadratic regulator; System identification; Convex optimization; Semidefinite programming; Time-varying system matrices; MODEL;
D O I
10.1016/j.automatica.2024.111705
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Inverse Optimal Control (IOC) problem is a structured system identification problem that aims to identify the underlying objective function based on observed optimal trajectories. This provides a data-driven way to model experts' behavior. In this paper, we consider the case of discrete-time finitehorizon linear-quadratic problems where: the quadratic cost term in the objective is not necessarily positive semi-definite; the planning horizon is a random variable; we have both process noise and observation noise; the dynamics can have a drift term; and where we can have a linear cost term in the objective. In this setting, we first formulate the necessary and sufficient conditions for when the forward optimal control problem is solvable. Next, we show that the corresponding IOC problem is identifiable. Using the conditions for existence of an optimum of the forward problem, we then formulate an estimator for the parameters in the objective function of the forward problem as the globally optimal solution to a convex optimization problem, and prove that the estimator is statistical consistent. Finally, the performance of the algorithm is demonstrated on two numerical examples. (c) 2024 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Indefinite Linear Quadratic Optimal Control and Stabilization Problem for Discrete-Time Rectangular Descriptor Markov Jump Systems With Noise
    Li, Yichun
    Ma, Shuping
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (02): : 841 - 853
  • [42] Discrete-time linear-quadratic output regulator for two variable systems
    Gessing, R
    PROCEEDINGS OF THE 2000 IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER-AIDED CONTROL SYSTEM DESIGN, 2000, : 279 - 284
  • [43] Indefinite Linear Quadratic Optimal Control Problem for Singular Linear Discrete-time System: Krein Space Method
    CUI Peng ZHANG ChengHui School of Control Science and Engineering Shandong University Jinan P R China
    自动化学报, 2007, (06) : 635 - 640
  • [44] Indefinite linear quadratic optimal control problem for singular linear discrete-time system: krein space method
    Cui, Peng
    Zhang, Cheng-Hui
    Zidonghua Xuebao/Acta Automatica Sinica, 2007, 33 (06): : 635 - 640
  • [45] On linear quadratic optimal control of discrete-time complex-valued linear systems
    Zhou, Bin
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2020, 41 (02): : 499 - 520
  • [46] New algorithm for discrete-time linear-quadratic control with inequality constraints
    Almutairi, NB
    Hassan, MF
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13 (02): : 309 - 336
  • [47] Inverse Reinforcement Learning for Discrete-Time Linear Quadratic Systems
    Yu, Meiling
    Ni, Yuanhua
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1027 - 1032
  • [48] Inverse Stochastic Optimal Control for Linear-Quadratic Gaussian and Linear-Quadratic Sensorimotor Control Models
    Karg, Philipp
    Stoll, Simon
    Rothfuss, Simon
    Hohmann, Soren
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 2801 - 2808
  • [49] Indefinite Backward Stochastic Linear-Quadratic Optimal Control Problems
    Sun, Jingrui
    Wu, Zhen
    Xiong, Jie
    ESAIM-CONTROL OPTIMISATION AND CALCULUS OF VARIATIONS, 2023, 29
  • [50] Indefinite linear quadratic optimal control for discrete time-varying linear rectangular descriptor systems
    Li, Yichun
    Ma, Shuping
    ASIAN JOURNAL OF CONTROL, 2023, 25 (02) : 1310 - 1322