Reinforcement Learning for MPC: Fundamentals and Current Challenges

被引：3

作者：

Gros, Sebastien ^{[1
]}

机构：

[1] Norwegian Univ Sci & Technol NTNU, Dept Cybernet, Oslo, Norway

来源：

IFAC PAPERSONLINE | 2023年 / 56卷 / 02期

关键词：

MPC; Reinforcement Learning; Learning for MPC; Stability & Safety;

D O I：

10.1016/j.ifacol.2023.10.548

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recent publications have laid a solid theoretical foundation for the combination of Reinforcement Learning and Model Predictive Control, in view of obtaining high-performance data-driven MPC policies. Early practical results, both in simulation and in experiments, have shown the potential of this combination but have also revealed certain challenges. In addition, the technical complexity of these results makes it difficult for interested readers to gather the fundamental ideas and principles behind this combination. This paper aims to provide a coherent and more accessible picture of these results and to offer significantly deeper and more mature insights into their meaning than has been proposed before. It also aims at identifying the current challenges in the field. Copyright (c) 2023 The Authors. This is an open access article under the CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0/)

引用

页码：5773 / 5780

页数：8

共 25 条

[1] Anand A.S., 2022, A painless deterministic policy gradient method for MPC
[2] On Average Performance and Stability of Economic Model Predictive Control
Angeli, David
Amrit, Rishi
Rawlings, James B.
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (07) : 1615 - 1626
[3] Cheng R, 2019, AAAI CONF ARTIF INTE, P3387
[4] Gros S, 2019, Arxiv, DOI arXiv:1906.04034
[5] Learning for MPC with stability & safety guarantees
Gros, Sebastien
Zanon, Mario
[J]. AUTOMATICA, 2022, 146
[6] Economic MPC of Markov Decision Processes: Dissipativity in undiscounted infinite-horizon optimal control
Gros, Sebastien
Zanon, Mario
[J]. AUTOMATICA, 2022, 146
[7] Reinforcement Learning for mixed-integer problems based on MPC
Gros, Sebastien
Zanon, Mario
[J]. IFAC PAPERSONLINE, 2020, 53 (02): : 5219 - 5224
[8] Data-Driven Economic NMPC Using Reinforcement Learning
Gros, Sebastien
Zanon, Mario
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (02) : 636 - 648
[9] Houska B, 2019, Handbook of Model Predictive Control. Control Engineering, P413, DOI [10.1007/978-3-319-77489-318, DOI 10.1007/978-3-319-77489-318]
[10] Kordabad AB, 2023, Arxiv, DOI arXiv:2210.04302

← 1 2 3 →