Kernel-based direct policy search reinforcement learning based on variational Bayesian inference

被引:0
|
作者
Yamaguchi, Nobuhiko [1 ]
Fukuda, Osamu [1 ]
Okumura, Hiroshi [1 ]
机构
[1] Saga Univ, Grad Sch Sci & Engn, 1 Honjo Machi, Saga, Saga 8408502, Japan
关键词
reinforcement learning; direct policy search; variational Bayesian inference; kernel methods;
D O I
10.1109/CANDARW.2019.00040
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Direct policy search is a promising reinforcement learning framework in particular for controlling continuous, high-dimensional systems. As one of direct policy search, direct policy search reinforcement learning based on variational Bayesian inference (VBRL) was proposed. The VBRL algorithm estimates the policy parameter based on variational Bayesian inference and is therefore avoid overfitting problem. In this paper, we propose an extension of the VBRL model using techniques of kernel methods, which we call K-VBRL. The performance of the proposed K-VBRL is assessed in two experiments with mountain car task. These experiments highlight the K-VBRL produces higher average return and outperforms the conventional VBRL.
引用
收藏
页码:184 / 187
页数:4
相关论文
共 50 条
  • [1] Direct policy search reinforcement learning based on variational Bayesian inference
    Yamaguchi, Nobuhiko
    Ihara, Kazuya
    Fukuda, Osamu
    Okumura, Hiroshi
    2018 JOINT 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 19TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2018, : 1009 - 1014
  • [2] Direct Policy Search Reinforcement Learning Based on Variational Bayesian Inference
    Yamaguchi, Nobuhiko
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2020, 24 (06) : 711 - 718
  • [3] KERNEL-BASED LIFELONG POLICY GRADIENT REINFORCEMENT LEARNING
    Mowakeaa, Rami
    Kim, Seung-Jun
    Emge, Darren K.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3500 - 3504
  • [4] Kernel-Based Decentralized Policy Evaluation for Reinforcement Learning
    Liu, Jiamin
    Lian, Heng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [5] Kernel-Based Reinforcement Learning
    Hu, Guanghua
    Qiu, Yuqin
    Xiang, Liming
    INTELLIGENT COMPUTING, PART I: INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, ICIC 2006, PART I, 2006, 4113 : 757 - 766
  • [6] Kernel-Based Reinforcement Learning
    Dirk Ormoneit
    Śaunak Sen
    Machine Learning, 2002, 49 : 161 - 178
  • [7] Kernel-based reinforcement learning
    Ormoneit, D
    Sen, S
    MACHINE LEARNING, 2002, 49 (2-3) : 161 - 178
  • [8] Kernel-based least squares policy iteration for reinforcement learning
    Xu, Xin
    Hu, Dewen
    Lu, Xicheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (04): : 973 - 992
  • [9] Practical Kernel-Based Reinforcement Learning
    Barreto, Andre M. S.
    Precup, Doina
    Pineau, Joelle
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [10] Variational Inference MPC for Bayesian Model-based Reinforcement Learning
    Okada, Masashi
    Taniguchi, Tadahiro
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100