Model-based Lifelong Reinforcement Learning with Bayesian Exploration

被引：0

作者：

Fu, Haotian ^{[1
]}

Yu, Shangqun ^{[1
]}

Littman, Michael ^{[1
]}

Konidaris, George ^{[1
]}

机构：

[1] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年

关键词：

ENTROPY;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a model-based lifelong reinforcement-learning approach that estimates a hierarchical Bayesian posterior distilling the common structure shared across different tasks. The learned posterior combined with a sample-based Bayesian exploration procedure increases the sample efficiency of learning across a family of related tasks. We first derive an analysis of the relationship between the sample complexity and the initialization quality of the posterior in the finite MDP setting. We next scale the approach to continuous-state domains by introducing a Variational Bayesian Lifelong Reinforcement Learning algorithm that can be combined with recent model-based deep RL methods, and that exhibits backward transfer. Experimental results on several challenging domains show that our algorithms achieve both better forward and backward transfer performance than state-of-the-art lifelong RL methods.

引用

页数：14

共 50 条

[1] Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning
Wu, Chenyang
Li, Tianci
Zhang, Zongzhang
Yu, Yang
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[2] Model-based Bayesian Reinforcement Learning for Dialogue Management
Lison, Pierre
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 475 - 479
[3] Smarter Sampling in Model-Based Bayesian Reinforcement Learning
Castro, Pablo Samuel
Precup, Doina
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I: EUROPEAN CONFERENCE, ECML PKDD 2010, 2010, 6321 : 200 - 214
[4] A Model-based Factored Bayesian Reinforcement Learning Approach
Wu, Bo
Feng, Yanpeng
Zheng, Hongyan
APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 1092 - 1095
[5] Reward Shaping for Model-Based Bayesian Reinforcement Learning
Kim, Hyeoneun
Lim, Woosang
Lee, Kanghoon
Noh, Yung-Kyun
Kim, Kee-Eung
PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3548 - 3555
[6] Exploration in Relational Domains for Model-based Reinforcement Learning
Lang, Tobias
Toussaint, Marc
Kersting, Kristian
JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 3725 - 3768
[7] Variational Inference MPC for Bayesian Model-based Reinforcement Learning
Okada, Masashi
Taniguchi, Tadahiro
CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
[8] Bayesian Model-Based Offline Reinforcement Learning for Product Allocation
Jenkins, Porter
Wei, Hua
Jenkins, J. Stockton
Li, Zhenhui
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 12531 - 12537
[9] Robust and Explorative Behavior in Model-based Bayesian Reinforcement Learning
Hishinuma, Toru
Senda, Kei
PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
[10] Cloud Reasoning Model-based Exploration for Deep Reinforcement Learning
Li Chenxi
Cao Lei
Chen Xiliang
Zhang Yongliang
Xu Zhixiong
Peng Hui
Duan Liwen
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (01) : 244 - 248

← 1 2 3 4 5 →