Robust Black-Box Optimization for Stochastic Search and Episodic Reinforcement Learning

被引:0
作者
Huttenrauch, Maximilian [1 ]
Neumann, Gerhard [1 ]
机构
[1] Karlsruhe Inst Technol, Dept Comp Sci, Karlsruhe, Germany
关键词
black-box optimization; stochastic search; derivative-free optimization; evolution strategies; episodic reinforcement learning; EVOLUTIONARY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Black -box optimization is a versatile approach to solve complex problems where the objective function is not explicitly known and no higher order information is available. Due to its general nature, it finds widespread applications in function optimization as well as machine learning, especially episodic reinforcement learning tasks. While traditional black -box optimizers like CMA-ES may falter in noisy scenarios due to their reliance on ranking -based transformations, a promising alternative emerges in the form of the Model -based Relative Entropy Stochastic Search (MORE) algorithm. MORE can be derived from natural policy gradients and compatible function approximation and directly optimizes the expected fitness without resorting to rankings. However, in its original formulation, MORE often cannot achieve state of the art performance. In this paper, we improve MORE by decoupling the update of the search distribution's mean and covariance and an improved entropy scheduling technique based on an evolution path resulting in faster convergence, and a simplified model learning approach in comparison to the original paper. We show that our algorithm performs comparable to state-of-the-art black -box optimizers on standard benchmark functions. Further, it clearly outperforms ranking -based methods and other policy -gradient based black -box algorithms as well as state of the art deep reinforcement learning algorithms when used for episodic reinforcement learning tasks.
引用
收藏
页码:1 / 44
页数:44
相关论文
共 50 条
[31]   Bayesian Performance Analysis for Black-Box Optimization Benchmarking [J].
Calvo, Borja ;
Shir, Ofer M. ;
Ceberio, Josu ;
Doerr, Carola ;
Wang, Hao ;
Back, Thomas ;
Lozano, Jose A. .
PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, :1789-1797
[32]   An Evolution Strategy for Black-box Optimization on Matrix Manifold [J].
He X.-Y. ;
Zhou Y.-R. ;
Chen Z.-F. .
Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (09) :1604-1623
[33]   Tuple leading differential evolution for black-box optimization [J].
Ma, Guang-Chuan ;
Yang, Qiang ;
Li, Jian-Yu ;
Zhao, Hong ;
Gao, Xu-Dong ;
Lu, Zhen-Yu ;
Zhang, Jun .
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 287
[34]   A method for convex black-box integer global optimization [J].
Larson, Jeffrey ;
Leyffer, Sven ;
Palkar, Prashant ;
Wild, Stefan M. .
JOURNAL OF GLOBAL OPTIMIZATION, 2021, 80 (02) :439-477
[35]   A method for convex black-box integer global optimization [J].
Jeffrey Larson ;
Sven Leyffer ;
Prashant Palkar ;
Stefan M. Wild .
Journal of Global Optimization, 2021, 80 :439-477
[36]   An evolutionary approach to black-box optimization on matrix manifolds? [J].
He, Xiaoyu ;
Zhou, Yuren ;
Chen, Zefeng ;
Jiang, Siyu .
APPLIED SOFT COMPUTING, 2020, 97
[37]   Black-Box Optimization by Fourier Analysis and Swarm Intelligence [J].
Lim, Eldin Wee Chuan ;
New, Jin Rou .
JOURNAL OF CHEMICAL ENGINEERING OF JAPAN, 2012, 45 (06) :417-428
[38]   The "Black-Box" Optimization Problem: Zero-Order Accelerated Stochastic Method via Kernel Approximation [J].
Lobanov, Aleksandr ;
Bashirov, Nail ;
Gasnikov, Alexander .
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2024, 203 (03) :2451-2486
[39]   Black-Box Optimization Benchmarking the IPOP-CMA-ES on the Noisy Testbed [J].
Ros, Raymond .
GECCO-2010 COMPANION PUBLICATION: PROCEEDINGS OF THE 12TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2010, :1511-1517
[40]   SMGO-Δ: Balancing caution and reward in global optimization with black-box constraints [J].
Sabug, Lorenzo, Jr. ;
Ruiz, Fredy ;
Fagiano, Lorenzo .
INFORMATION SCIENCES, 2022, 605 :15-42