Model-free Data-driven Predictive Control Using Reinforcement Learning

被引：1

作者：

Sawant, Shambhuraj ^{[1
]}

Reinhardt, Dirk ^{[1
]}

Kordabad, Arash Bahari ^{[1
]}

Gros, Sebastien ^{[1
]}

机构：

[1] Norwegian Univ Sci & Technol NTNU, Dept Engn Cybernet, Trondheim, Norway

来源：

2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC | 2023年

关键词：

MPC;

D O I：

10.1109/CDC49753.2023.10383431

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes a novel approach for Predictive Control utilizing Reinforcement Learning (RL) and DataDriven techniques to derive optimal control policies for real systems. Using pure input-output multi-step predictors based on Subspace Identification and RL techniques, the resulting predictive control scheme can approximate the optimal control policy of a system with high accuracy, even if the predictor cannot accurately capture the true system dynamics. One of the key contributions of the proposed approach is the extension of the framework connecting Model Predictive Control (MPC) and RL to one that does not require explicit state-space models, nor to define a notion of state at all. The paper demonstrates the efficacy of the proposed approach through an illustrative example, highlighting the ability of our approach to provide an optimal control policy for a real system without requiring any prior knowledge about its internal dynamics.

引用

页码：4046 / 4052

页数：7

共 21 条

[1] Büskens C, 2001, ONLINE OPTIMIZATION OF LARGE SCALE SYSTEMS, P3
[2] MPC-based Reinforcement Learning for a Simplified Freight Mission of Autonomous Surface Vehicles
Cai, Wenqi
Kordabad, Arash B.
Esfahani, Hossein N.
Lekkas, Anastasios M.
Gros, Sebastien
[J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2990 - 2995
[3] Reinforcement Learning based on MPC/MHE for Unmodeled and Partially Observable Dynamics
Esfahani, Hossein Nejatbakhsh
Kordabad, Arash Bahari
Gros, Sebastien
[J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 2121 - 2126
[4] Favoreel W., 1999, Proceedings of the 14th World Congress. International Federation of Automatic Control, P235
[5] Reinforcement Learning for mixed-integer problems based on MPC
Gros, Sebastien
Zanon, Mario
[J]. IFAC PAPERSONLINE, 2020, 53 (02): : 5219 - 5224
[6] Data-Driven Economic NMPC Using Reinforcement Learning
Gros, Sebastien
Zanon, Mario
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (02) : 636 - 648
[7] Learning-Based Model Predictive Control: Toward Safe Learning in Control
Hewing, Lukas
Wabersich, Kim P.
Menner, Marcel
Zeilinger, Melanie N.
[J]. ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 3, 2020, 2020, 3 : 269 - 296
[8] Efficient Representation and Approximation of Model Predictive Control Laws via Deep Learning
Karg, Benjamin
Lucia, Sergio
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) : 3866 - 3878
[9] Safe Reinforcement Learning Using Wasserstein Distributionally Robust MPC and Chance Constraint
Kordabad, Arash Bahari
Wisniewski, Rafael
Gros, Sebastien
[J]. IEEE ACCESS, 2022, 10 : 130058 - 130067
[10] Kordabad AB, 2021, 2021 EUROPEAN CONTROL CONFERENCE (ECC), P2573, DOI 10.23919/ECC54610.2021.9654852

← 1 2 3 →