Asynchronous Methods for Model-Based Reinforcement Learning

被引：0

作者：

Zhang, Yunzhi ^{[1
]}

Clavera, Ignasi ^{[1
]}

Tsai, Boren ^{[1
]}

Abbeel, Pieter ^{[1
]}

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

来源：

CONFERENCE ON ROBOT LEARNING, VOL 100 | 2019年 / 100卷

关键词：

Reinforcement Learning; Model-Based; Asynchronous Learning;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Significant progress has been made in the area of model-based reinforcement learning. State-of-the-art algorithms are now able to match the asymptotic performance of model-free methods while being significantly more data efficient. However, this success has come at a price: state-of-the-art model-based methods require significant computation interleaved with data collection, resulting in run times that take days, even if the amount of agent interaction might be just hours or even minutes. When considering the goal of learning in real-time on real robots, this means these state-of-the-art model-based algorithms still remain impractical. In this work, we propose an asynchronous framework for model-based reinforcement learning methods that brings down the run time of these algorithms to be just the data collection time. We evaluate our asynchronous framework on a range of standard MuJoCo benchmarks. We also evaluate our asynchronous framework on three real-world robotic manipulation tasks. We show how asynchronous learning not only speeds up learning w.r.t wall-clock time through parallelization, but also further reduces the sample complexity of model-based approaches by means of improving the exploration and by means of effectively avoiding the policy overfitting to the deficiencies of learned dynamics models.

引用

页数：10

共 50 条

[41] Weighted model estimation for offline model-based reinforcement learning
Hishinuma, Toru
Senda, Kei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
[42] Latent Causal Dynamics Model for Model-Based Reinforcement Learning
Hao, Zhifeng
Zhu, Haipeng
Chen, Wei
Cai, Ruichu
NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II, 2024, 14448 : 219 - 230
[43] Model-based reinforcement learning with model error and its application
Tajima, Yoshiyuki
Onisawa, Takehisa
PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-8, 2007, : 1333 - 1336
[44] Model-based reinforcement learning: a computational model and an fMRI study
Yoshida, W
Ishii, S
NEUROCOMPUTING, 2005, 63 : 253 - 269
[45] Reinforcement Twinning: From digital twins to model-based reinforcement learning
Schena, Lorenzo
Marques, Pedro A.
Poletti, Romain
Van den Berghe, Jan
Mendez, Miguel A.
JOURNAL OF COMPUTATIONAL SCIENCE, 2024, 82
[46] Model-Based Reinforcement Learning with a Generative Model is Minimax Optimal
Agarwal, Alekh
Kakade, Sham
Yang, Lin F.
CONFERENCE ON LEARNING THEORY, VOL 125, 2020, 125
[47] Model-based reinforcement learning under concurrent schedules of reinforcement in rodents
Huh, Namjung
Jo, Suhyun
Kim, Hoseok
Sul, Jung Hoon
Jung, Min Whan
LEARNING & MEMORY, 2009, 16 (05) : 315 - 323
[48] Reward Shaping for Model-Based Bayesian Reinforcement Learning
Kim, Hyeoneun
Lim, Woosang
Lee, Kanghoon
Noh, Yung-Kyun
Kim, Kee-Eung
PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3548 - 3555
[49] On the Importance of Hyperparameter Optimization for Model-based Reinforcement Learning
Zhang, Baohe
Rajan, Raghu
Pineda, Luis
Lambert, Nathan
Biedenkapp, Andre
Chua, Kurtland
Hutter, Frank
Calandra, Roberto
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[50] Model-based Adversarial Meta-Reinforcement Learning
Lin, Zichuan
Thomas, Garrett
Yang, Guangwen
Ma, Tengyu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33

← 1 2 3 4 5 →