Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control

被引：0

作者：

Kamthe, Sanket ^{[1
]}

Deisenroth, Marc Peter ^{[1
]}

机构：

[1] Imperial Coll London, Dept Comp, London, England

来源：

INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84 | 2018年 / 84卷

关键词：

STABILITY;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Trial-and-error based reinforcement learning (RL) has seen rapid advancements in recent times, especially with the advent of deep neural networks. However, the majority of autonomous RL algorithms require a large number of interactions with the environment. A large number of interactions may be impractical in many real-world applications, such as robotics, and many practical systems have to obey limitations in the form of state space or control constraints. To reduce the number of system interactions while simultaneously handling constraints, we propose a model-based RL framework based on probabilistic Model Predictive Control (MPC). In particular, we propose to learn a probabilistic transition model using Gaussian Processes (GPs) to incorporate model uncertainty into long-term predictions, thereby, reducing the impact of model errors. We then use MPC to find a control sequence that minimises the expected long-term cost. We provide theoretical guarantees for first-order optimality in the GP-based transition models with deterministic approximate inference for long-term planning. We demonstrate that our approach does not only achieve state-of-the-art data efficiency, but also is a principled way for RL in constrained environments.

引用

页数：10

共 50 条

[1] Data-Efficient Reinforcement Learning for Malaria Control
Zou, Lixin
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 507 - 513
[2] Model-Based Reinforcement Learning With Probabilistic Ensemble Terminal Critics for Data-Efficient Control Applications
Park, Jonghyeok
Jeon, Soo
Han, Soohee
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (08) : 9470 - 9479
[3] DATA-EFFICIENT MODEL-BASED REINFORCEMENT LEARNING FOR ROBOT CONTROL
Sun, Ming
Gao, Yue
Liu, Wei
Li, Shaoyuan
INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2021, 36 (04): : 211 - 218
[4] Data-Efficient Reinforcement Learning for Variable Impedance Control
Anand, Akhil S.
Kaushik, Rituraj
Gravdahl, Jan Tommy
Abu-Dakka, Fares J.
IEEE ACCESS, 2024, 12 : 15631 - 15641
[5] Data-Efficient Task Generalization via Probabilistic Model-Based Meta Reinforcement Learning
Bhardwaj, Arjun
Rothfuss, Jonas
Sukhija, Bhavya
As, Yarden
Hutter, Marco
Coros, Stelian
Krause, Andreas
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3918 - 3925
[6] Data-Efficient Hierarchical Reinforcement Learning
Nachum, Ofir
Gu, Shixiang
Lee, Honglak
Levine, Sergey
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[7] Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control
Frauenknecht, Bernd
Ehlgen, Tobias
Trimpe, Sebastian
2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 894 - 901
[8] A Safe and Data-Efficient Model-Based Reinforcement Learning System for HVAC Control
Ding, Xianzhong
An, Zhiyu
Rathee, Arya
Du, Wan
IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (07): : 8014 - 8032
[9] Data Based Optimal Control with Neural Networks and Data-Efficient Reinforcement Learning
Runkler, Thomas A.
Udluft, Steffen
Duell, Siegmund
AT-AUTOMATISIERUNGSTECHNIK, 2012, 60 (10) : 641 - 647
[10] Pretraining Representations for Data-Efficient Reinforcement Learning
Schwarzer, Max
Rajkumar, Nitarshan
Noukhovitch, Michael
Anand, Ankesh
Charlin, Laurent
Hjelm, Devon
Bachman, Philip
Courville, Aaron
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34

← 1 2 3 4 5 →