Ensemble algorithms in reinforcement learning

被引:112
|
作者
Wiering, Marco A. [1 ]
van Hasselt, Hado [2 ]
机构
[1] Univ Groningen, Dept Artificial Intelligence, NL-9400 AK Groningen, Netherlands
[2] Univ Utrecht, Dept Informat & Comp Sci, Intelligent Syst Grp, NL-3508 TB Utrecht, Netherlands
关键词
dynamic mazes; ensemble algorithms; partially observable environments; reinforcement learning (RL);
D O I
10.1109/TSMCB.2008.920231
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes several ensemble methods that combine multiple different reinforcement learning (RL) algorithms in a single agent. The aim is to enhance learning speed and final performance by combining the chosen actions or action probabilities of different RL algorithms. We designed and implemented four different ensemble methods combining the following five different RL algorithms: Q-learning, Sarsa, actor-critic (AC), QV-learning, and AC learning automaton. The intuitively designed ensemble methods, namely, majority voting (MV), rank voting, Boltzmann multiplication (BM), and Boltzmann addition, combine the policies derived from the value functions of the different RL algorithms, in contrast to previous work where ensemble methods have been used in RL for representing and learning a single value function. We show experiments on five maze problems of varying complexity; the first problem is simple, but the other four maze tasks are of a dynamic or partially observable nature. The results indicate that the BM and MV ensembles significantly outperform the single RL algorithms.
引用
收藏
页码:930 / 936
页数:7
相关论文
共 50 条
  • [1] ENSEMBLE LEARNING ALGORITHMS
    Turan, Selin Ceren
    Cengiz, Mehmet Ali
    JOURNAL OF SCIENCE AND ARTS, 2022, (02): : 459 - 470
  • [2] Ensemble reinforcement learning: A survey
    Song, Yanjie
    Suganthan, Ponnuthurai Nagaratnam
    Pedrycz, Witold
    Ou, Junwei
    He, Yongming
    Chen, Yingwu
    Wu, Yutong
    APPLIED SOFT COMPUTING, 2023, 149
  • [3] Dynamic Ensemble Selection with Reinforcement Learning
    Liu, Lihua
    Wu, Jibing
    Li, Xuan
    Huang, Hongbin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 629 - 640
  • [4] An ensemble method for inverse reinforcement learning
    Lin, Jin-Ling
    Hwang, Kao-Shing
    Shi, Haobin
    Pan, Wei
    INFORMATION SCIENCES, 2020, 512 (512) : 518 - 532
  • [5] Ensemble pruning using reinforcement learning
    Partalas, Ioannis
    Tsoumakas, Grigorios
    Katakis, Ioannis
    Vlahavas, Ioannis
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 3955 : 301 - 310
  • [6] Reinforcement Learning of Protein Conformational Ensemble
    Feng, Jiangyan
    BIOPHYSICAL JOURNAL, 2019, 116 (03) : 184A - 184A
  • [7] Evolutionary algorithms for reinforcement learning
    Moriarty, DE
    Schultz, AC
    Grefenstette, JJ
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 : 241 - 276
  • [8] Evolutionary Algorithms for Reinforcement Learning
    Moriarty, David E.
    Schultz, Alan C.
    Grefenstette, John J.
    Journal of Artificial Intelligence Research, 1999, 11 (00): : 241 - 276
  • [9] REINFORCEMENT LEARNING ALGORITHMS IN ROBOTICS
    Bocsi, Botond
    Csato, Lehel
    KEPT 2011: KNOWLEDGE ENGINEERING PRINCIPLES AND TECHNIQUES, 2011, : 131 - 142
  • [10] REINFORCEMENT LEARNING - ARCHITECTURES AND ALGORITHMS
    KOKAR, MM
    REVELIOTIS, SA
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1993, 8 (08) : 875 - 894