Ensemble algorithms in reinforcement learning

被引：112

作者：

Wiering, Marco A. ^{[1
]}

van Hasselt, Hado ^{[2
]}

机构：

[1] Univ Groningen, Dept Artificial Intelligence, NL-9400 AK Groningen, Netherlands

[2] Univ Utrecht, Dept Informat & Comp Sci, Intelligent Syst Grp, NL-3508 TB Utrecht, Netherlands

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2008年 / 38卷 / 04期

关键词：

dynamic mazes; ensemble algorithms; partially observable environments; reinforcement learning (RL);

D O I：

10.1109/TSMCB.2008.920231

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper describes several ensemble methods that combine multiple different reinforcement learning (RL) algorithms in a single agent. The aim is to enhance learning speed and final performance by combining the chosen actions or action probabilities of different RL algorithms. We designed and implemented four different ensemble methods combining the following five different RL algorithms: Q-learning, Sarsa, actor-critic (AC), QV-learning, and AC learning automaton. The intuitively designed ensemble methods, namely, majority voting (MV), rank voting, Boltzmann multiplication (BM), and Boltzmann addition, combine the policies derived from the value functions of the different RL algorithms, in contrast to previous work where ensemble methods have been used in RL for representing and learning a single value function. We show experiments on five maze problems of varying complexity; the first problem is simple, but the other four maze tasks are of a dynamic or partially observable nature. The results indicate that the BM and MV ensembles significantly outperform the single RL algorithms.

引用

页码：930 / 936

页数：7

共 50 条

[1] ENSEMBLE LEARNING ALGORITHMS
Turan, Selin Ceren
Cengiz, Mehmet Ali
JOURNAL OF SCIENCE AND ARTS, 2022, (02): : 459 - 470
[2] Ensemble reinforcement learning: A survey
Song, Yanjie
Suganthan, Ponnuthurai Nagaratnam
Pedrycz, Witold
Ou, Junwei
He, Yongming
Chen, Yingwu
Wu, Yutong
APPLIED SOFT COMPUTING, 2023, 149
[3] Dynamic Ensemble Selection with Reinforcement Learning
Liu, Lihua
Wu, Jibing
Li, Xuan
Huang, Hongbin
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 629 - 640
[4] An ensemble method for inverse reinforcement learning
Lin, Jin-Ling
Hwang, Kao-Shing
Shi, Haobin
Pan, Wei
INFORMATION SCIENCES, 2020, 512 (512) : 518 - 532
[5] Ensemble pruning using reinforcement learning
Partalas, Ioannis
Tsoumakas, Grigorios
Katakis, Ioannis
Vlahavas, Ioannis
ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 3955 : 301 - 310
[6] Reinforcement Learning of Protein Conformational Ensemble
Feng, Jiangyan
BIOPHYSICAL JOURNAL, 2019, 116 (03) : 184A - 184A
[7] Evolutionary algorithms for reinforcement learning
Moriarty, DE
Schultz, AC
Grefenstette, JJ
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 : 241 - 276
[8] Evolutionary Algorithms for Reinforcement Learning
Moriarty, David E.
Schultz, Alan C.
Grefenstette, John J.
Journal of Artificial Intelligence Research, 1999, 11 (00): : 241 - 276
[9] REINFORCEMENT LEARNING ALGORITHMS IN ROBOTICS
Bocsi, Botond
Csato, Lehel
KEPT 2011: KNOWLEDGE ENGINEERING PRINCIPLES AND TECHNIQUES, 2011, : 131 - 142
[10] REINFORCEMENT LEARNING - ARCHITECTURES AND ALGORITHMS
KOKAR, MM
REVELIOTIS, SA
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1993, 8 (08) : 875 - 894

← 1 2 3 4 5 →