Noisy Derivative-Free Optimization with Value Suppression

被引:0
|
作者
Wang, Hong [1 ]
Qian, Hong [1 ]
Yu, Yang [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
来源
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2018年
关键词
EVOLUTIONARY; ENVIRONMENTS; STRATEGY; TIME;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Derivative-free optimization has shown advantage in solving sophisticated problems such as policy search, when the environment is noise-free. Many real-world environments are noisy, where solution evaluations are inaccurate due to the noise. Noisy evaluation can badly injure derivative-free optimization, as it may make a worse solution looks better. Sampling is a straightforward way to reduce noise, while previous studies have shown that delay the noise handling to the comparison time point (i.e., threshold selection) can be helpful for derivative-free optimization. This work further delays the noise handling, and proposes a simple noise handling mechanism, i.e., value suppression. By value suppression, we do nothing about noise until the best-so-far solution has not been improved for a period, and then suppress the value of the best-so-far solution and continue the optimization. On synthetic problems as well as reinforcement learning tasks, experiments verify that value suppression can be significantly more effective than the previous methods.
引用
收藏
页码:1447 / 1454
页数:8
相关论文
共 50 条
  • [31] DERIVATIVE-FREE OPTIMIZATION OF HEARING AID PARAMETERS
    Chu, Shu-Hsien
    Hong, Mingyi
    Luo, Zhi-Quan
    Fitz, Kelly
    McKinney, Martin
    Zhang, Tao
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 393 - 397
  • [32] Curvature-Aware Derivative-Free Optimization
    Kim, Bumsu
    Mckenzie, Daniel
    Cai, Hanqin
    Yin, Wotao
    JOURNAL OF SCIENTIFIC COMPUTING, 2025, 103 (02)
  • [33] Inexact Derivative-Free Optimization for Bilevel Learning
    Matthias J. Ehrhardt
    Lindon Roberts
    Journal of Mathematical Imaging and Vision, 2021, 63 : 580 - 600
  • [34] A Discussion on Variational Analysis in Derivative-Free Optimization
    Hare, Warren
    SET-VALUED AND VARIATIONAL ANALYSIS, 2020, 28 (04) : 643 - 659
  • [35] A Derivative-Free Geometric Algorithm for Optimization on a Sphere
    Chen, Yannan
    Xi, Min
    Zhang, Hongchao
    CSIAM TRANSACTIONS ON APPLIED MATHEMATICS, 2020, 1 (04): : 766 - 801
  • [36] Constrained derivative-free optimization on thin domains
    J. M. Martínez
    F. N. C. Sobral
    Journal of Global Optimization, 2013, 56 : 1217 - 1232
  • [37] A derivative-free algorithm for spherically constrained optimization
    Min Xi
    Wenyu Sun
    Yannan Chen
    Hailin Sun
    Journal of Global Optimization, 2020, 76 : 841 - 861
  • [38] Derivative-free robust optimization by outer approximations
    Menickelly, Matt
    Wild, Stefan M.
    MATHEMATICAL PROGRAMMING, 2020, 179 (1-2) : 157 - 193
  • [39] Constrained derivative-free optimization on thin domains
    Martinez, J. M.
    Sobral, F. N. C.
    JOURNAL OF GLOBAL OPTIMIZATION, 2013, 56 (03) : 1217 - 1232
  • [40] A derivative-free comirror algorithm for convex optimization
    Bauschke, Heinz H.
    Hare, Warren L.
    Moursi, Walaa M.
    OPTIMIZATION METHODS & SOFTWARE, 2015, 30 (04): : 706 - 726