Kernel-based direct policy search reinforcement learning based on variational Bayesian inference

被引:0
|
作者
Yamaguchi, Nobuhiko [1 ]
Fukuda, Osamu [1 ]
Okumura, Hiroshi [1 ]
机构
[1] Saga Univ, Grad Sch Sci & Engn, 1 Honjo Machi, Saga, Saga 8408502, Japan
关键词
reinforcement learning; direct policy search; variational Bayesian inference; kernel methods;
D O I
10.1109/CANDARW.2019.00040
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Direct policy search is a promising reinforcement learning framework in particular for controlling continuous, high-dimensional systems. As one of direct policy search, direct policy search reinforcement learning based on variational Bayesian inference (VBRL) was proposed. The VBRL algorithm estimates the policy parameter based on variational Bayesian inference and is therefore avoid overfitting problem. In this paper, we propose an extension of the VBRL model using techniques of kernel methods, which we call K-VBRL. The performance of the proposed K-VBRL is assessed in two experiments with mountain car task. These experiments highlight the K-VBRL produces higher average return and outperforms the conventional VBRL.
引用
收藏
页码:184 / 187
页数:4
相关论文
共 50 条
  • [41] Online Kernel-Based Mode Learning
    Wang, Tao
    Yao, Weixin
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2025,
  • [42] SEQUENTIAL SAMPLING WITH KERNEL-BASED BAYESIAN NETWORK CLASSIFIERS
    Shahan, David
    Seepersad, Carolyn C.
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2011, VOL 5, PTS A AND B, 2012, : 877 - 890
  • [43] A Kernel-based Approach to Direct Action Perception
    Kroemer, O.
    Ugur, E.
    Oztop, E.
    Peters, J.
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 2605 - 2610
  • [44] Approximate Bayesian methods for kernel-based object tracking
    Zivkovic, Zoran
    Cemgil, Ali Taylan
    Krose, Ben
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2009, 113 (06) : 743 - 749
  • [45] Advanced search algorithms for information-theoretic learning with kernel-based estimators
    Morejon, RA
    Principe, JC
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (04): : 874 - 884
  • [46] A Kernel-Based Bayesian Classifier for Fault Detection and Classification
    Yu, ChunMei
    Pan, Quan
    Cheng, YongMei
    Zhang, HongCai
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 124 - 128
  • [47] A sparse kernel-based least-squares temporal difference algorithm for reinforcement learning
    Xu, Xin
    ADVANCES IN NATURAL COMPUTATION, PT 1, 2006, 4221 : 47 - 56
  • [48] Online Attentive Kernel-Based Off-Policy Temporal Difference Learning
    Yang, Shangdong
    Zhang, Shuaiqiang
    Chen, Xingguo
    APPLIED SCIENCES-BASEL, 2024, 14 (23):
  • [49] Variational Bayesian Sparse Kernel-Based Blind Image Deconvolution With Student's-t Priors
    Tzikas, Dimitris G.
    Likas, Aristidis C.
    Galatsanos, Nikolaos P.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (04) : 753 - 764
  • [50] Learning rates for kernel-based expectile regression
    Farooq, Muhammad
    Steinwart, Ingo
    MACHINE LEARNING, 2019, 108 (02) : 203 - 227