Model-free Data-driven Predictive Control Using Reinforcement Learning

被引:1
作者
Sawant, Shambhuraj [1 ]
Reinhardt, Dirk [1 ]
Kordabad, Arash Bahari [1 ]
Gros, Sebastien [1 ]
机构
[1] Norwegian Univ Sci & Technol NTNU, Dept Engn Cybernet, Trondheim, Norway
来源
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC | 2023年
关键词
MPC;
D O I
10.1109/CDC49753.2023.10383431
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a novel approach for Predictive Control utilizing Reinforcement Learning (RL) and DataDriven techniques to derive optimal control policies for real systems. Using pure input-output multi-step predictors based on Subspace Identification and RL techniques, the resulting predictive control scheme can approximate the optimal control policy of a system with high accuracy, even if the predictor cannot accurately capture the true system dynamics. One of the key contributions of the proposed approach is the extension of the framework connecting Model Predictive Control (MPC) and RL to one that does not require explicit state-space models, nor to define a notion of state at all. The paper demonstrates the efficacy of the proposed approach through an illustrative example, highlighting the ability of our approach to provide an optimal control policy for a real system without requiring any prior knowledge about its internal dynamics.
引用
收藏
页码:4046 / 4052
页数:7
相关论文
共 21 条
  • [1] Büskens C, 2001, ONLINE OPTIMIZATION OF LARGE SCALE SYSTEMS, P3
  • [2] MPC-based Reinforcement Learning for a Simplified Freight Mission of Autonomous Surface Vehicles
    Cai, Wenqi
    Kordabad, Arash B.
    Esfahani, Hossein N.
    Lekkas, Anastasios M.
    Gros, Sebastien
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2990 - 2995
  • [3] Reinforcement Learning based on MPC/MHE for Unmodeled and Partially Observable Dynamics
    Esfahani, Hossein Nejatbakhsh
    Kordabad, Arash Bahari
    Gros, Sebastien
    [J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 2121 - 2126
  • [4] Favoreel W., 1999, Proceedings of the 14th World Congress. International Federation of Automatic Control, P235
  • [5] Reinforcement Learning for mixed-integer problems based on MPC
    Gros, Sebastien
    Zanon, Mario
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 5219 - 5224
  • [6] Data-Driven Economic NMPC Using Reinforcement Learning
    Gros, Sebastien
    Zanon, Mario
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (02) : 636 - 648
  • [7] Learning-Based Model Predictive Control: Toward Safe Learning in Control
    Hewing, Lukas
    Wabersich, Kim P.
    Menner, Marcel
    Zeilinger, Melanie N.
    [J]. ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 3, 2020, 2020, 3 : 269 - 296
  • [8] Efficient Representation and Approximation of Model Predictive Control Laws via Deep Learning
    Karg, Benjamin
    Lucia, Sergio
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) : 3866 - 3878
  • [9] Safe Reinforcement Learning Using Wasserstein Distributionally Robust MPC and Chance Constraint
    Kordabad, Arash Bahari
    Wisniewski, Rafael
    Gros, Sebastien
    [J]. IEEE ACCESS, 2022, 10 : 130058 - 130067
  • [10] Kordabad AB, 2021, 2021 EUROPEAN CONTROL CONFERENCE (ECC), P2573, DOI 10.23919/ECC54610.2021.9654852