Statistically consistent inverse optimal control for discrete-time indefinite linear-quadratic systems☆

被引:0
|
作者
Zhang, Han [1 ,2 ]
Ringh, Axel [3 ,4 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Dept Automat, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Med Robot, Shanghai, Peoples R China
[3] Chalmers Univ Technol, Dept Math Sci, S-41296 Gothenburg, Sweden
[4] Univ Gothenburg, S-41296 Gothenburg, Sweden
关键词
Inverse optimal control; Inverse reinforcement learning; Indefinite linear quadratic regulator; System identification; Convex optimization; Semidefinite programming; Time-varying system matrices; MODEL;
D O I
10.1016/j.automatica.2024.111705
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Inverse Optimal Control (IOC) problem is a structured system identification problem that aims to identify the underlying objective function based on observed optimal trajectories. This provides a data-driven way to model experts' behavior. In this paper, we consider the case of discrete-time finitehorizon linear-quadratic problems where: the quadratic cost term in the objective is not necessarily positive semi-definite; the planning horizon is a random variable; we have both process noise and observation noise; the dynamics can have a drift term; and where we can have a linear cost term in the objective. In this setting, we first formulate the necessary and sufficient conditions for when the forward optimal control problem is solvable. Next, we show that the corresponding IOC problem is identifiable. Using the conditions for existence of an optimum of the forward problem, we then formulate an estimator for the parameters in the objective function of the forward problem as the globally optimal solution to a convex optimization problem, and prove that the estimator is statistical consistent. Finally, the performance of the algorithm is demonstrated on two numerical examples. (c) 2024 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Optimal Learning Control Scheme for Discrete-Time Systems With Nonuniform Trials
    Liu, Chen
    Ruan, Xiaoe
    Shen, Dong
    Jiang, Hao
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (06) : 3639 - 3650
  • [32] Optimal disturbance attenuation for discrete-time switched and Markovian jump linear systems
    Lee, Ji-Woong
    Dullerud, Geir E.
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2006, 45 (04) : 1329 - 1358
  • [33] Inverse optimal controller based on extended Kalman filter for discrete-time nonlinear systems
    Almobaied, Moayed
    Eksin, Ibrahim
    Guzelkaya, Mujde
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2018, 39 (01) : 19 - 34
  • [34] Optimal output regulation for discrete-time switched and Markovian jump linear systems
    Lee, Ji-Woong
    Khargonekar, Pramod P.
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2008, 47 (01) : 40 - 72
  • [35] Constrained minimum variance control for discrete-time stochastic linear systems
    Bakolas, E.
    SYSTEMS & CONTROL LETTERS, 2018, 113 : 109 - 116
  • [36] Maximum turn-off control for discrete-time linear systems
    Iwata, Takumi
    Azuma, Shun-ichi
    Ariizumi, Ryo
    Asai, Toru
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (01) : 23 - 34
  • [37] Discrete-time inverse optimal control for a reaction wheel pendulum: a passivity-based control approach
    Danilo Montoya, Oscar
    Gil-Gonzalez, Walter
    Martin Serra, Federico
    UIS INGENIERIAS, 2020, 19 (04): : 123 - 132
  • [38] A primal-dual interior-point method for robust optimal control of linear discrete-time systems
    Hansson, A
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2000, 45 (09) : 1639 - 1655
  • [39] Direct Data-Driven Optimal Set-Point Tracking Control of Linear Discrete-Time Systems
    Xu, Yao
    Zhou, Linna
    Zhao, Jianguo
    Ma, Lei
    Yang, Chunyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (08) : 3795 - 3799
  • [40] Identifiability and Solvability in Inverse Linear Quadratic Optimal Control Problems
    Li Yibei
    Wahlberg, Bo
    Hu Xiaoming
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2021, 34 (05) : 1840 - 1857