Statistically consistent inverse optimal control for discrete-time indefinite linear-quadratic systems☆

被引:0
|
作者
Zhang, Han [1 ,2 ]
Ringh, Axel [3 ,4 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Dept Automat, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Med Robot, Shanghai, Peoples R China
[3] Chalmers Univ Technol, Dept Math Sci, S-41296 Gothenburg, Sweden
[4] Univ Gothenburg, S-41296 Gothenburg, Sweden
关键词
Inverse optimal control; Inverse reinforcement learning; Indefinite linear quadratic regulator; System identification; Convex optimization; Semidefinite programming; Time-varying system matrices; MODEL;
D O I
10.1016/j.automatica.2024.111705
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Inverse Optimal Control (IOC) problem is a structured system identification problem that aims to identify the underlying objective function based on observed optimal trajectories. This provides a data-driven way to model experts' behavior. In this paper, we consider the case of discrete-time finitehorizon linear-quadratic problems where: the quadratic cost term in the objective is not necessarily positive semi-definite; the planning horizon is a random variable; we have both process noise and observation noise; the dynamics can have a drift term; and where we can have a linear cost term in the objective. In this setting, we first formulate the necessary and sufficient conditions for when the forward optimal control problem is solvable. Next, we show that the corresponding IOC problem is identifiable. Using the conditions for existence of an optimum of the forward problem, we then formulate an estimator for the parameters in the objective function of the forward problem as the globally optimal solution to a convex optimization problem, and prove that the estimator is statistical consistent. Finally, the performance of the algorithm is demonstrated on two numerical examples. (c) 2024 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Identifiability and Solvability in Inverse Linear Quadratic Optimal Control Problems
    Yibei Li
    Bo Wahlberg
    Xiaoming Hu
    Journal of Systems Science and Complexity, 2021, 34 : 1840 - 1857
  • [42] Data-driven estimation of the algebraic Riccati equation for the discrete-time inverse linear quadratic regulator problem
    Sugiura, Shuhei
    Ariizumi, Ryo
    Tanemura, Masaya
    Asai, Toru
    Azuma, Shun-ichi
    DISCOVER APPLIED SCIENCES, 2024, 6 (06)
  • [43] LQ optimal control of fractional-order discrete-time uncertain systems
    Lu, Qinyun
    Zhu, Yuanguo
    CHAOS SOLITONS & FRACTALS, 2021, 147
  • [44] Particle Swarm Optimization for Discrete-Time Inverse Optimal Control of a Doubly Fed Induction Generator
    Ruiz-Cruz, Riemann
    Sanchez, Edgar N.
    Ornelas-Tellez, Fernando
    Loukianov, Alexander G.
    Harley, Ronald G.
    IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (06) : 1698 - 1709
  • [45] Event-Triggered Control for Extended Plants of Discrete-Time Linear Systems
    Ichihara, Hiroyuki
    Sawada, Kenji
    Kobayashi, Koichi
    Tarbouriech, Sophie
    IFAC PAPERSONLINE, 2020, 53 (02): : 2714 - 2719
  • [46] Local input-to-state stabilization and -induced norm control of discrete-time quadratic systems
    de Souza, C. E.
    Coutinho, D.
    Gomes da Silva, J. M., Jr.
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2015, 25 (14) : 2420 - 2442
  • [47] Identification and Control of Discrete-time Stochastic Systems
    Li, Yong-zhi'
    Gong, Miao-kun
    Ruan, Rong-yao
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 171 - +
  • [48] Indefinite Stochastic Linear Quadratic Control with Markovian Jumps in Infinite Time Horizon
    Xun Li
    Xun Yu Zhou
    Mustapha Ait Rami
    Journal of Global Optimization, 2003, 27 : 149 - 175
  • [49] Indefinite Stochastic linear quadratic control with Markovian jumps in infinite time horizon
    Li, X
    Zhou, XY
    Rami, MA
    JOURNAL OF GLOBAL OPTIMIZATION, 2003, 27 (2-3) : 149 - 175
  • [50] DISCRETE-TIME LQ DESIGN FROM THE VIEWPOINT OF THE INVERSE OPTIMAL REGULATOR
    MEHDI, D
    DAROUACH, M
    ZASADZINSKI, M
    OPTIMAL CONTROL APPLICATIONS & METHODS, 1994, 15 (03) : 205 - 213