Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control

被引:0
作者
Kamthe, Sanket [1 ]
Deisenroth, Marc Peter [1 ]
机构
[1] Imperial Coll London, Dept Comp, London, England
来源
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84 | 2018年 / 84卷
关键词
STABILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Trial-and-error based reinforcement learning (RL) has seen rapid advancements in recent times, especially with the advent of deep neural networks. However, the majority of autonomous RL algorithms require a large number of interactions with the environment. A large number of interactions may be impractical in many real-world applications, such as robotics, and many practical systems have to obey limitations in the form of state space or control constraints. To reduce the number of system interactions while simultaneously handling constraints, we propose a model-based RL framework based on probabilistic Model Predictive Control (MPC). In particular, we propose to learn a probabilistic transition model using Gaussian Processes (GPs) to incorporate model uncertainty into long-term predictions, thereby, reducing the impact of model errors. We then use MPC to find a control sequence that minimises the expected long-term cost. We provide theoretical guarantees for first-order optimality in the GP-based transition models with deterministic approximate inference for long-term planning. We demonstrate that our approach does not only achieve state-of-the-art data efficiency, but also is a principled way for RL in constrained environments.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Stochastic self-triggered model predictive control for linear systems with probabilistic constraints
    Dai, Li
    Gao, Yulong
    Xie, Lihua
    Johansson, Karl Henrik
    Xia, Yuanqing
    [J]. AUTOMATICA, 2018, 92 : 9 - 17
  • [32] Recursive Feasibility of Stochastic Model Predictive Control with Mission-Wide Probabilistic Constraints
    Wang, Kai
    Gros, Sebastien
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2312 - 2317
  • [33] Nontracking type iterative learning control based on economic model predictive control
    Long, Yushen
    Xie, Lihua
    Liu, Shuai
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (18) : 8564 - 8582
  • [34] Data-Efficient Static Cost Optimization via Extremum-Seeking Control with Kernel-Based Function Approximation
    Weekers, Wouter
    Saccon, Alessandro
    van de Wouw, Nathan
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 6761 - 6767
  • [35] A Computationally Efficient System Level Parametrization for Robust Model Predictive Control
    Herold, Thilo
    Berkel, Felix
    Trachte, Adrian
    Specker, Thomas
    [J]. 2022 EUROPEAN CONTROL CONFERENCE (ECC), 2022, : 826 - 832
  • [36] Efficient Load Management in Electric Ships: A Model Predictive Control Approach
    Zohrabi, Nasibeh
    Zakeri, Hasan
    Abdelwahed, Sherif
    [J]. THIRTY-FOURTH ANNUAL IEEE APPLIED POWER ELECTRONICS CONFERENCE AND EXPOSITION (APEC 2019), 2019, : 3000 - 3006
  • [38] A stabilizing model predictive control for networked control system with data packet dropout
    Li Z.
    Sun D.
    Shi Y.
    Wang L.
    [J]. Journal of Control Theory and Applications, 2009, 7 (03): : 281 - 284
  • [39] Data-Driven Model Predictive Control for Redundant Manipulators With Unknown Model
    Yan, Jingkun
    Jin, Long
    Hu, Bin
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (10) : 5901 - 5911
  • [40] Efficient data-driven predictive control of nonlinear systems: A review and perspectives
    Li, Xiaojie
    Yan, Mingxue
    Zhang, Xuewen
    Han, Minghao
    Law, Adrian Wing-Keung
    Yin, Xunyuan
    [J]. DIGITAL CHEMICAL ENGINEERING, 2025, 14