Reinforcement Learning for MPC: Fundamentals and Current Challenges

被引:3
作者
Gros, Sebastien [1 ]
机构
[1] Norwegian Univ Sci & Technol NTNU, Dept Cybernet, Oslo, Norway
来源
IFAC PAPERSONLINE | 2023年 / 56卷 / 02期
关键词
MPC; Reinforcement Learning; Learning for MPC; Stability & Safety;
D O I
10.1016/j.ifacol.2023.10.548
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent publications have laid a solid theoretical foundation for the combination of Reinforcement Learning and Model Predictive Control, in view of obtaining high-performance data-driven MPC policies. Early practical results, both in simulation and in experiments, have shown the potential of this combination but have also revealed certain challenges. In addition, the technical complexity of these results makes it difficult for interested readers to gather the fundamental ideas and principles behind this combination. This paper aims to provide a coherent and more accessible picture of these results and to offer significantly deeper and more mature insights into their meaning than has been proposed before. It also aims at identifying the current challenges in the field. Copyright (c) 2023 The Authors. This is an open access article under the CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0/)
引用
收藏
页码:5773 / 5780
页数:8
相关论文
共 25 条
  • [1] Anand A.S., 2022, A painless deterministic policy gradient method for MPC
  • [2] On Average Performance and Stability of Economic Model Predictive Control
    Angeli, David
    Amrit, Rishi
    Rawlings, James B.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (07) : 1615 - 1626
  • [3] Cheng R, 2019, AAAI CONF ARTIF INTE, P3387
  • [4] Gros S, 2019, Arxiv, DOI arXiv:1906.04034
  • [5] Learning for MPC with stability & safety guarantees
    Gros, Sebastien
    Zanon, Mario
    [J]. AUTOMATICA, 2022, 146
  • [6] Economic MPC of Markov Decision Processes: Dissipativity in undiscounted infinite-horizon optimal control
    Gros, Sebastien
    Zanon, Mario
    [J]. AUTOMATICA, 2022, 146
  • [7] Reinforcement Learning for mixed-integer problems based on MPC
    Gros, Sebastien
    Zanon, Mario
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 5219 - 5224
  • [8] Data-Driven Economic NMPC Using Reinforcement Learning
    Gros, Sebastien
    Zanon, Mario
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (02) : 636 - 648
  • [9] Houska B, 2019, Handbook of Model Predictive Control. Control Engineering, P413, DOI [10.1007/978-3-319-77489-318, DOI 10.1007/978-3-319-77489-318]
  • [10] Kordabad AB, 2023, Arxiv, DOI arXiv:2210.04302