Statistically consistent inverse optimal control for discrete-time indefinite linear-quadratic systems☆

被引：0

作者：

Zhang, Han ^{[1
,2
]}

Ringh, Axel ^{[3
,4
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Dept Automat, Shanghai, Peoples R China

[2] Shanghai Jiao Tong Univ, Inst Med Robot, Shanghai, Peoples R China

[3] Chalmers Univ Technol, Dept Math Sci, S-41296 Gothenburg, Sweden

[4] Univ Gothenburg, S-41296 Gothenburg, Sweden

来源：

AUTOMATICA | 2024年 / 166卷

关键词：

Inverse optimal control; Inverse reinforcement learning; Indefinite linear quadratic regulator; System identification; Convex optimization; Semidefinite programming; Time-varying system matrices; MODEL;

D O I：

10.1016/j.automatica.2024.111705

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The Inverse Optimal Control (IOC) problem is a structured system identification problem that aims to identify the underlying objective function based on observed optimal trajectories. This provides a data-driven way to model experts' behavior. In this paper, we consider the case of discrete-time finitehorizon linear-quadratic problems where: the quadratic cost term in the objective is not necessarily positive semi-definite; the planning horizon is a random variable; we have both process noise and observation noise; the dynamics can have a drift term; and where we can have a linear cost term in the objective. In this setting, we first formulate the necessary and sufficient conditions for when the forward optimal control problem is solvable. Next, we show that the corresponding IOC problem is identifiable. Using the conditions for existence of an optimum of the forward problem, we then formulate an estimator for the parameters in the objective function of the forward problem as the globally optimal solution to a convex optimization problem, and prove that the estimator is statistical consistent. Finally, the performance of the algorithm is demonstrated on two numerical examples. (c) 2024 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

引用

页数：13

共 50 条

[31] Optimal Learning Control Scheme for Discrete-Time Systems With Nonuniform Trials
Liu, Chen
Ruan, Xiaoe
Shen, Dong
Jiang, Hao
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (06) : 3639 - 3650
[32] Optimal disturbance attenuation for discrete-time switched and Markovian jump linear systems
Lee, Ji-Woong
Dullerud, Geir E.
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2006, 45 (04) : 1329 - 1358
[33] Inverse optimal controller based on extended Kalman filter for discrete-time nonlinear systems
Almobaied, Moayed
Eksin, Ibrahim
Guzelkaya, Mujde
OPTIMAL CONTROL APPLICATIONS & METHODS, 2018, 39 (01) : 19 - 34
[34] Optimal output regulation for discrete-time switched and Markovian jump linear systems
Lee, Ji-Woong
Khargonekar, Pramod P.
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2008, 47 (01) : 40 - 72
[35] Constrained minimum variance control for discrete-time stochastic linear systems
Bakolas, E.
SYSTEMS & CONTROL LETTERS, 2018, 113 : 109 - 116
[36] Maximum turn-off control for discrete-time linear systems
Iwata, Takumi
Azuma, Shun-ichi
Ariizumi, Ryo
Asai, Toru
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (01) : 23 - 34
[37] Discrete-time inverse optimal control for a reaction wheel pendulum: a passivity-based control approach
Danilo Montoya, Oscar
Gil-Gonzalez, Walter
Martin Serra, Federico
UIS INGENIERIAS, 2020, 19 (04): : 123 - 132
[38] A primal-dual interior-point method for robust optimal control of linear discrete-time systems
Hansson, A
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2000, 45 (09) : 1639 - 1655
[39] Direct Data-Driven Optimal Set-Point Tracking Control of Linear Discrete-Time Systems
Xu, Yao
Zhou, Linna
Zhao, Jianguo
Ma, Lei
Yang, Chunyu
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (08) : 3795 - 3799
[40] Identifiability and Solvability in Inverse Linear Quadratic Optimal Control Problems
Li Yibei
Wahlberg, Bo
Hu Xiaoming
JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2021, 34 (05) : 1840 - 1857

← 1 2 3 4 5 →