Learning for Control: An Inverse Optimization Approach

被引：0

作者：

Akhtar, Syed Adnan ^{[1
]}

Kolarijani, Arman Sharifi ^{[1
]}

Esfahani, Peyman Mohajerin ^{[1
]}

机构：

[1] Delft Univ Technol, Delft Ctr Syst & Control, Delft, Netherlands

来源：

2021 AMERICAN CONTROL CONFERENCE (ACC) | 2021年

关键词：

SYSTEMS;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a learning method to learn the mapping from an input space to an action space, which is particularly suitable when the action is an optimal decision with respect to a certain unknown cost function. We use an inverse optimization approach to retrieve the cost function by introducing a new loss function and a new hypothesis class of mappings. A tractable convex reformulation of the learning problem is also presented. The method is effective for learning input-action mapping in continuous input-action space with input-output constraints, typically present in control systems. The learning approach can be effectively transformed to learn a Model Predictive Control (MPC) behaviour and a case study to mimic an MPC is presented, which is a rather computationally heavy control strategy. Simulation and experimental results show the effectiveness of the proposed approach.

引用

页码：2193 / 2198

页数：6

共 25 条

[1] Abbeel P., 2004, P 21 INT C MACH LEAR, P1, DOI [10.1145/1015330.1015430, DOI 10.1145/1015330.1015430]
[2] Akhtar S. A., 2020, LEARNING CONTROL INV
[3] Alessio A, 2009, LECT NOTES CONTR INF, V384, P345, DOI 10.1007/978-3-642-01094-1_29
[4] Learning movement sequences from demonstration
Amit, R
Mataric, M
[J]. 2ND INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, PROCEEDINGS, 2002, : 203 - 208
[5] Arora S., 2018, A survey of inverse reinforcement learning
[6] Data-driven estimation in equilibrium using inverse optimization
Bertsimas, Dimitris
Gupta, Vishal
Paschalidis, Ioannis Ch.
[J]. MATHEMATICAL PROGRAMMING, 2015, 153 (02) : 595 - 633
[7] Boularias A, 2011, P 14 INT C ART INT S, P182
[8] Reinforcement learning of motor skills using Policy Search and human corrective advice
Celemin, Carlos
Maeda, Guilherme
Ruiz-del-Solar, Javier
Peters, Jan
Kober, Jens
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2019, 38 (14) : 1560 - 1580
[9] Coates A., 2008, 25th International Conference on Machine Learning, P144
[10] Data-driven inverse optimization with imperfect information
Esfahani, Peyman Mohajerin
Shafieezadeh-Abadeh, Soroosh
Hanasusanto, Grani A.
Kuhn, Daniel
[J]. MATHEMATICAL PROGRAMMING, 2018, 167 (01) : 191 - 234

← 1 2 3 →