Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control

被引：0

作者：

Kamthe, Sanket ^{[1
]}

Deisenroth, Marc Peter ^{[1
]}

机构：

[1] Imperial Coll London, Dept Comp, London, England

来源：

INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84 | 2018年 / 84卷

关键词：

STABILITY;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Trial-and-error based reinforcement learning (RL) has seen rapid advancements in recent times, especially with the advent of deep neural networks. However, the majority of autonomous RL algorithms require a large number of interactions with the environment. A large number of interactions may be impractical in many real-world applications, such as robotics, and many practical systems have to obey limitations in the form of state space or control constraints. To reduce the number of system interactions while simultaneously handling constraints, we propose a model-based RL framework based on probabilistic Model Predictive Control (MPC). In particular, we propose to learn a probabilistic transition model using Gaussian Processes (GPs) to incorporate model uncertainty into long-term predictions, thereby, reducing the impact of model errors. We then use MPC to find a control sequence that minimises the expected long-term cost. We provide theoretical guarantees for first-order optimality in the GP-based transition models with deterministic approximate inference for long-term planning. We demonstrate that our approach does not only achieve state-of-the-art data efficiency, but also is a principled way for RL in constrained environments.

引用

页数：10

共 50 条

[31] Stochastic self-triggered model predictive control for linear systems with probabilistic constraints
Dai, Li
Gao, Yulong
Xie, Lihua
Johansson, Karl Henrik
Xia, Yuanqing
[J]. AUTOMATICA, 2018, 92 : 9 - 17
[32] Recursive Feasibility of Stochastic Model Predictive Control with Mission-Wide Probabilistic Constraints
Wang, Kai
Gros, Sebastien
[J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2312 - 2317
[33] Nontracking type iterative learning control based on economic model predictive control
Long, Yushen
Xie, Lihua
Liu, Shuai
[J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (18) : 8564 - 8582
[34] Data-Efficient Static Cost Optimization via Extremum-Seeking Control with Kernel-Based Function Approximation
Weekers, Wouter
Saccon, Alessandro
van de Wouw, Nathan
[J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 6761 - 6767
[35] A Computationally Efficient System Level Parametrization for Robust Model Predictive Control
Herold, Thilo
Berkel, Felix
Trachte, Adrian
Specker, Thomas
[J]. 2022 EUROPEAN CONTROL CONFERENCE (ECC), 2022, : 826 - 832
[36] Efficient Load Management in Electric Ships: A Model Predictive Control Approach
Zohrabi, Nasibeh
Zakeri, Hasan
Abdelwahed, Sherif
[J]. THIRTY-FOURTH ANNUAL IEEE APPLIED POWER ELECTRONICS CONFERENCE AND EXPOSITION (APEC 2019), 2019, : 3000 - 3006
[37] A stabilizing model predictive control for networked control system with data packet dropout
Zhijun LI
[J]. Journal of Control Theory and Applications, 2009, 7 (03) : 281 - 284
[38] A stabilizing model predictive control for networked control system with data packet dropout
Li Z.
Sun D.
Shi Y.
Wang L.
[J]. Journal of Control Theory and Applications, 2009, 7 (03): : 281 - 284
[39] Data-Driven Model Predictive Control for Redundant Manipulators With Unknown Model
Yan, Jingkun
Jin, Long
Hu, Bin
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (10) : 5901 - 5911
[40] Efficient data-driven predictive control of nonlinear systems: A review and perspectives
Li, Xiaojie
Yan, Mingxue
Zhang, Xuewen
Han, Minghao
Law, Adrian Wing-Keung
Yin, Xunyuan
[J]. DIGITAL CHEMICAL ENGINEERING, 2025, 14

← 1 2 3 4 5 →