Inverse optimal control;
Inverse reinforcement learning;
Indefinite linear quadratic regulator;
System identification;
Convex optimization;
Semidefinite programming;
Time-varying system matrices;
MODEL;
D O I:
10.1016/j.automatica.2024.111705
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
The Inverse Optimal Control (IOC) problem is a structured system identification problem that aims to identify the underlying objective function based on observed optimal trajectories. This provides a data-driven way to model experts' behavior. In this paper, we consider the case of discrete-time finitehorizon linear-quadratic problems where: the quadratic cost term in the objective is not necessarily positive semi-definite; the planning horizon is a random variable; we have both process noise and observation noise; the dynamics can have a drift term; and where we can have a linear cost term in the objective. In this setting, we first formulate the necessary and sufficient conditions for when the forward optimal control problem is solvable. Next, we show that the corresponding IOC problem is identifiable. Using the conditions for existence of an optimum of the forward problem, we then formulate an estimator for the parameters in the objective function of the forward problem as the globally optimal solution to a convex optimization problem, and prove that the estimator is statistical consistent. Finally, the performance of the algorithm is demonstrated on two numerical examples. (c) 2024 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
机构:
Univ Dist Francisco Jose de Caldas, Fac Ingn, Ingn Elect, Bogota, Colombia
Univ Tecnol Bolivar, Fac Ingn, Lab Inteligente Energia, Turbaco, ColombiaUniv Dist Francisco Jose de Caldas, Fac Ingn, Ingn Elect, Bogota, Colombia
Danilo Montoya, Oscar
Gil-Gonzalez, Walter
论文数: 0引用数: 0
h-index: 0
机构:
Univ Tecnol Bolivar, Fac Ingn, Lab Inteligente Energia, Turbaco, ColombiaUniv Dist Francisco Jose de Caldas, Fac Ingn, Ingn Elect, Bogota, Colombia
Gil-Gonzalez, Walter
Martin Serra, Federico
论文数: 0引用数: 0
h-index: 0
机构:
Univ Nacl San Luis, Fac Ingn & Ciencias Agr, Lab Control Automat, San Luis, ArgentinaUniv Dist Francisco Jose de Caldas, Fac Ingn, Ingn Elect, Bogota, Colombia
机构:
China Univ Min & Technol, Sch Informat & Control Engn, Minist Educ, Xuzhou 221116, Peoples R China
China Univ Min & Technol, Engn Res Ctr Intelligent Control Underground Space, Minist Educ, Xuzhou 221116, Peoples R ChinaChina Univ Min & Technol, Sch Informat & Control Engn, Minist Educ, Xuzhou 221116, Peoples R China
Xu, Yao
Zhou, Linna
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Min & Technol, Sch Informat & Control Engn, Minist Educ, Xuzhou 221116, Peoples R China
China Univ Min & Technol, Engn Res Ctr Intelligent Control Underground Space, Minist Educ, Xuzhou 221116, Peoples R ChinaChina Univ Min & Technol, Sch Informat & Control Engn, Minist Educ, Xuzhou 221116, Peoples R China
Zhou, Linna
Zhao, Jianguo
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Min & Technol, Sch Informat & Control Engn, Minist Educ, Xuzhou 221116, Peoples R China
China Univ Min & Technol, Engn Res Ctr Intelligent Control Underground Space, Minist Educ, Xuzhou 221116, Peoples R ChinaChina Univ Min & Technol, Sch Informat & Control Engn, Minist Educ, Xuzhou 221116, Peoples R China
Zhao, Jianguo
Ma, Lei
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Min & Technol, Sch Informat & Control Engn, Minist Educ, Xuzhou 221116, Peoples R China
China Univ Min & Technol, Engn Res Ctr Intelligent Control Underground Space, Minist Educ, Xuzhou 221116, Peoples R ChinaChina Univ Min & Technol, Sch Informat & Control Engn, Minist Educ, Xuzhou 221116, Peoples R China
Ma, Lei
Yang, Chunyu
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Min & Technol, Sch Informat & Control Engn, Minist Educ, Xuzhou 221116, Peoples R China
China Univ Min & Technol, Engn Res Ctr Intelligent Control Underground Space, Minist Educ, Xuzhou 221116, Peoples R ChinaChina Univ Min & Technol, Sch Informat & Control Engn, Minist Educ, Xuzhou 221116, Peoples R China
机构:
KTH Royal Inst Technol, Optimizat & Syst Theory, Dept Math, SE-10044 Stockholm, SwedenKTH Royal Inst Technol, Optimizat & Syst Theory, Dept Math, SE-10044 Stockholm, Sweden
Li Yibei
Wahlberg, Bo
论文数: 0引用数: 0
h-index: 0
机构:
KTH Royal Inst Technol, Sch Elect Engn & Comp Sci, Div Decis & Control, SE-10044 Stockholm, SwedenKTH Royal Inst Technol, Optimizat & Syst Theory, Dept Math, SE-10044 Stockholm, Sweden
Wahlberg, Bo
Hu Xiaoming
论文数: 0引用数: 0
h-index: 0
机构:
KTH Royal Inst Technol, Optimizat & Syst Theory, Dept Math, SE-10044 Stockholm, SwedenKTH Royal Inst Technol, Optimizat & Syst Theory, Dept Math, SE-10044 Stockholm, Sweden