Empirical Evaluation of Contextual Policy Search with a Comparison-based Surrogate Model and Active Covariance Matrix Adaptation

被引:1
|
作者
Fabisch, Alexander [1 ]
机构
[1] DFKI GmbH, Robot Innovat Ctr, Kaiserslautern, Germany
来源
PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION) | 2019年
基金
欧盟地平线“2020”;
关键词
multi-task learning; policy search; black-box optimization;
D O I
10.1145/3319619.3321935
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Contextual policy search (CPS) is a class of multi-task reinforcement learning algorithms that is particularly useful for robotic applications. A recent state-of-the-art method is Contextual Covariance Matrix Adaptation Evolution Strategies (C-CMA-ES). It is based on the standard black-box optimization algorithm CMA-ES. There are two useful extensions of CMA-ES that we will transfer to C-CMA-ES and evaluate empirically: ACM-ES, which uses a comparison-based surrogate model, and aCMA-ES, which uses an active update of the covariance matrix. We will show that improvements with these methods can be impressive in terms of sample-efficiency, although this is not relevant any more for the robotic domain.
引用
收藏
页码:251 / 252
页数:2
相关论文
共 2 条
  • [1] Contextual Direct Policy Search With Regularized Covariance Matrix Estimation
    Abdolmaleki, Abbas
    Simoes, David
    Lau, Nuno
    Reis, Luis Paulo
    Neumann, Gerhard
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2019, 96 (02) : 141 - 157
  • [2] Overview of Surrogate-model Versions of Covariance Matrix Adaptation Evolution Strategy
    Pitra, Zbynek
    Bajer, Lukas
    Repicky, Jakub
    Holena, Martin
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCO'17 COMPANION), 2017, : 1622 - 1629