Learning for Control: An Inverse Optimization Approach

被引:0
作者
Akhtar, Syed Adnan [1 ]
Kolarijani, Arman Sharifi [1 ]
Esfahani, Peyman Mohajerin [1 ]
机构
[1] Delft Univ Technol, Delft Ctr Syst & Control, Delft, Netherlands
来源
2021 AMERICAN CONTROL CONFERENCE (ACC) | 2021年
关键词
SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a learning method to learn the mapping from an input space to an action space, which is particularly suitable when the action is an optimal decision with respect to a certain unknown cost function. We use an inverse optimization approach to retrieve the cost function by introducing a new loss function and a new hypothesis class of mappings. A tractable convex reformulation of the learning problem is also presented. The method is effective for learning input-action mapping in continuous input-action space with input-output constraints, typically present in control systems. The learning approach can be effectively transformed to learn a Model Predictive Control (MPC) behaviour and a case study to mimic an MPC is presented, which is a rather computationally heavy control strategy. Simulation and experimental results show the effectiveness of the proposed approach.
引用
收藏
页码:2193 / 2198
页数:6
相关论文
共 25 条
  • [1] Abbeel P., 2004, P 21 INT C MACH LEAR, P1, DOI [10.1145/1015330.1015430, DOI 10.1145/1015330.1015430]
  • [2] Akhtar S. A., 2020, LEARNING CONTROL INV
  • [3] Alessio A, 2009, LECT NOTES CONTR INF, V384, P345, DOI 10.1007/978-3-642-01094-1_29
  • [4] Learning movement sequences from demonstration
    Amit, R
    Mataric, M
    [J]. 2ND INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, PROCEEDINGS, 2002, : 203 - 208
  • [5] Arora S., 2018, A survey of inverse reinforcement learning
  • [6] Data-driven estimation in equilibrium using inverse optimization
    Bertsimas, Dimitris
    Gupta, Vishal
    Paschalidis, Ioannis Ch.
    [J]. MATHEMATICAL PROGRAMMING, 2015, 153 (02) : 595 - 633
  • [7] Boularias A, 2011, P 14 INT C ART INT S, P182
  • [8] Reinforcement learning of motor skills using Policy Search and human corrective advice
    Celemin, Carlos
    Maeda, Guilherme
    Ruiz-del-Solar, Javier
    Peters, Jan
    Kober, Jens
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2019, 38 (14) : 1560 - 1580
  • [9] Coates A., 2008, 25th International Conference on Machine Learning, P144
  • [10] Data-driven inverse optimization with imperfect information
    Esfahani, Peyman Mohajerin
    Shafieezadeh-Abadeh, Soroosh
    Hanasusanto, Grani A.
    Kuhn, Daniel
    [J]. MATHEMATICAL PROGRAMMING, 2018, 167 (01) : 191 - 234