Statistically consistent inverse optimal control for discrete-time indefinite linear-quadratic systems☆

被引：0

作者：

Zhang, Han ^{[1
,2
]}

Ringh, Axel ^{[3
,4
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Dept Automat, Shanghai, Peoples R China

[2] Shanghai Jiao Tong Univ, Inst Med Robot, Shanghai, Peoples R China

[3] Chalmers Univ Technol, Dept Math Sci, S-41296 Gothenburg, Sweden

[4] Univ Gothenburg, S-41296 Gothenburg, Sweden

来源：

AUTOMATICA | 2024年 / 166卷

关键词：

Inverse optimal control; Inverse reinforcement learning; Indefinite linear quadratic regulator; System identification; Convex optimization; Semidefinite programming; Time-varying system matrices; MODEL;

D O I：

10.1016/j.automatica.2024.111705

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The Inverse Optimal Control (IOC) problem is a structured system identification problem that aims to identify the underlying objective function based on observed optimal trajectories. This provides a data-driven way to model experts' behavior. In this paper, we consider the case of discrete-time finitehorizon linear-quadratic problems where: the quadratic cost term in the objective is not necessarily positive semi-definite; the planning horizon is a random variable; we have both process noise and observation noise; the dynamics can have a drift term; and where we can have a linear cost term in the objective. In this setting, we first formulate the necessary and sufficient conditions for when the forward optimal control problem is solvable. Next, we show that the corresponding IOC problem is identifiable. Using the conditions for existence of an optimum of the forward problem, we then formulate an estimator for the parameters in the objective function of the forward problem as the globally optimal solution to a convex optimization problem, and prove that the estimator is statistical consistent. Finally, the performance of the algorithm is demonstrated on two numerical examples. (c) 2024 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

引用

页数：13

共 50 条

[1] Statistically Consistent Inverse Optimal Control for Linear-Quadratic Tracking with Random Time Horizon
Zhang, Han
Ringh, Axel
Jiang, Weihan
Li, Shaoyuan
Hu, Xiaoming
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 1515 - 1522
[2] Inverse linear-quadratic discrete-time finite-horizon optimal control for indistinguishable homogeneous agents: A convex optimization approach
Zhang, Han
Ringh, Axel
AUTOMATICA, 2023, 148
[3] Indefinite linear quadratic optimal control problem for uncertain random discrete-time systems
Chen, Xin
Cao, Jing
Jin, Ting
JOURNAL OF APPLIED MATHEMATICS AND COMPUTING, 2023, 69 (04) : 3533 - 3552
[4] Infinite horizon indefinite stochastic linear quadratic control for discrete-time systems
Zhang W.
Li Y.
Liu X.
Control Theory and Technology, 2015, 13 (03) : 230 - 237
[5] Infinite horizon indefinite stochastic linear quadratic control for discrete-time systems
Weihai ZHANG
Yan LI
Xikui LIU
Control Theory and Technology, 2015, 13 (03) : 230 - 237
[6] Inverse optimal control for discrete-time finite-horizon Linear Quadratic Regulators
Zhang, Han
Umenberger, Jack
Hu, Xiaoming
AUTOMATICA, 2019, 110
[7] Inverse Reinforcement Learning for Discrete-Time Linear Quadratic Systems
Yu, Meiling
Ni, Yuanhua
2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1027 - 1032
[8] A discrete-time mean-field stochastic linear-quadratic optimal control problem with financial application
Li, Xun
Tai, Allen H.
Tian, Fei
INTERNATIONAL JOURNAL OF CONTROL, 2021, 94 (01) : 175 - 189
[9] Sequential Inverse Optimal Control of Discrete-Time Systems
Cao, Sheng
Luo, Zhiwei
Quan, Changqin
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (03) : 608 - 621
[10] A New Inverse Optimal Control Method for Discrete-time Systems
Almobaied, Moayed
Eksin, Ibrahim
Guzelkaya, Mujde
ICIMCO 2015 PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL. 1, 2015, : 275 - 280

← 1 2 3 4 5 →