Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control

被引：0

作者：

Kamthe, Sanket ^{[1
]}

Deisenroth, Marc Peter ^{[1
]}

机构：

[1] Imperial Coll London, Dept Comp, London, England

来源：

INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84 | 2018年 / 84卷

关键词：

STABILITY;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Trial-and-error based reinforcement learning (RL) has seen rapid advancements in recent times, especially with the advent of deep neural networks. However, the majority of autonomous RL algorithms require a large number of interactions with the environment. A large number of interactions may be impractical in many real-world applications, such as robotics, and many practical systems have to obey limitations in the form of state space or control constraints. To reduce the number of system interactions while simultaneously handling constraints, we propose a model-based RL framework based on probabilistic Model Predictive Control (MPC). In particular, we propose to learn a probabilistic transition model using Gaussian Processes (GPs) to incorporate model uncertainty into long-term predictions, thereby, reducing the impact of model errors. We then use MPC to find a control sequence that minimises the expected long-term cost. We provide theoretical guarantees for first-order optimality in the GP-based transition models with deterministic approximate inference for long-term planning. We demonstrate that our approach does not only achieve state-of-the-art data efficiency, but also is a principled way for RL in constrained environments.

引用

页数：10

共 50 条

[21] Data Driven Economic Model Predictive Control
Kheradmandi, Masoud
Mhaskar, Prashant
MATHEMATICS, 2018, 6 (04):
[22] Learning-Based Model Predictive Control: Toward Safe Learning in Control
Hewing, Lukas
Wabersich, Kim P.
Menner, Marcel
Zeilinger, Melanie N.
ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 3, 2020, 2020, 3 : 269 - 296
[23] An Efficient and Stabilizing Model Predictive Control of Switched Systems
Hariprasad, K.
Bhartiya, Sharad
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (07) : 3401 - 3407
[24] Learning Lyapunov terminal costs from data for complexity reduction in nonlinear model predictive control
Abdufattokhov, Shokhjakhon
Zanon, Mario
Bemporad, Alberto
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (13) : 8676 - 8691
[25] Robust Model Predictive Control Using Iterative Learning
HosseinNia, S. Hassan
2015 EUROPEAN CONTROL CONFERENCE (ECC), 2015, : 3514 - 3519
[26] A probabilistic validation approach for penalty function design in Stochastic Model Predictive Control
Mammarella, Martina
Alamo, Teodoro
Lucia, Sergio
Dabbene, Fabrizio
IFAC PAPERSONLINE, 2020, 53 (02): : 11271 - 11276
[27] Distributed Learning Model Predictive Control for Linear Systems
Sturz, Yvonne R.
Zhu, Edward L.
Rosolia, Ugo
Johansson, Karl H.
Borrelli, Francesco
2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 4366 - 4373
[28] Efficient Nonlinear Model Predictive Control for Discrete System with Disturbances
Chacko, Keerthi
Janardhanan, S.
Kar, Indra Narayan
2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 2032 - 2037
[29] Reinforcement Learning for Multi-Agent Systems with an Application to Distributed Predictive Cruise Control
Mynuddin, Mohammed
Gao, Weinan
Jiang, Zhong-Ping
2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 315 - 320
[30] Differentiable predictive control: Deep learning alternative to explicit model predictive control for unknown nonlinear systems
Drgona, Jan
Kis, Karol
Tuor, Aaron
Vrabie, Draguna
Klauco, Martin
JOURNAL OF PROCESS CONTROL, 2022, 116 : 80 - 92

← 1 2 3 4 5 →