Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control

被引:0
作者
Kamthe, Sanket [1 ]
Deisenroth, Marc Peter [1 ]
机构
[1] Imperial Coll London, Dept Comp, London, England
来源
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84 | 2018年 / 84卷
关键词
STABILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Trial-and-error based reinforcement learning (RL) has seen rapid advancements in recent times, especially with the advent of deep neural networks. However, the majority of autonomous RL algorithms require a large number of interactions with the environment. A large number of interactions may be impractical in many real-world applications, such as robotics, and many practical systems have to obey limitations in the form of state space or control constraints. To reduce the number of system interactions while simultaneously handling constraints, we propose a model-based RL framework based on probabilistic Model Predictive Control (MPC). In particular, we propose to learn a probabilistic transition model using Gaussian Processes (GPs) to incorporate model uncertainty into long-term predictions, thereby, reducing the impact of model errors. We then use MPC to find a control sequence that minimises the expected long-term cost. We provide theoretical guarantees for first-order optimality in the GP-based transition models with deterministic approximate inference for long-term planning. We demonstrate that our approach does not only achieve state-of-the-art data efficiency, but also is a principled way for RL in constrained environments.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Data Driven Economic Model Predictive Control
    Kheradmandi, Masoud
    Mhaskar, Prashant
    MATHEMATICS, 2018, 6 (04):
  • [22] Learning-Based Model Predictive Control: Toward Safe Learning in Control
    Hewing, Lukas
    Wabersich, Kim P.
    Menner, Marcel
    Zeilinger, Melanie N.
    ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 3, 2020, 2020, 3 : 269 - 296
  • [23] An Efficient and Stabilizing Model Predictive Control of Switched Systems
    Hariprasad, K.
    Bhartiya, Sharad
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (07) : 3401 - 3407
  • [24] Learning Lyapunov terminal costs from data for complexity reduction in nonlinear model predictive control
    Abdufattokhov, Shokhjakhon
    Zanon, Mario
    Bemporad, Alberto
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (13) : 8676 - 8691
  • [25] Robust Model Predictive Control Using Iterative Learning
    HosseinNia, S. Hassan
    2015 EUROPEAN CONTROL CONFERENCE (ECC), 2015, : 3514 - 3519
  • [26] A probabilistic validation approach for penalty function design in Stochastic Model Predictive Control
    Mammarella, Martina
    Alamo, Teodoro
    Lucia, Sergio
    Dabbene, Fabrizio
    IFAC PAPERSONLINE, 2020, 53 (02): : 11271 - 11276
  • [27] Distributed Learning Model Predictive Control for Linear Systems
    Sturz, Yvonne R.
    Zhu, Edward L.
    Rosolia, Ugo
    Johansson, Karl H.
    Borrelli, Francesco
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 4366 - 4373
  • [28] Efficient Nonlinear Model Predictive Control for Discrete System with Disturbances
    Chacko, Keerthi
    Janardhanan, S.
    Kar, Indra Narayan
    2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 2032 - 2037
  • [29] Reinforcement Learning for Multi-Agent Systems with an Application to Distributed Predictive Cruise Control
    Mynuddin, Mohammed
    Gao, Weinan
    Jiang, Zhong-Ping
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 315 - 320
  • [30] Differentiable predictive control: Deep learning alternative to explicit model predictive control for unknown nonlinear systems
    Drgona, Jan
    Kis, Karol
    Tuor, Aaron
    Vrabie, Draguna
    Klauco, Martin
    JOURNAL OF PROCESS CONTROL, 2022, 116 : 80 - 92