Hyper-Parameter Optimization for Deep Learning by Surrogate-based Model with Weighted Distance Exploration

被引:0
作者
Li, Zhenhua [1 ]
Shoemaker, Christine A. [2 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Sch Comp Sci & Technol, Nanjing, Peoples R China
[2] Natl Univ Singapore, Dept Ind Syst Engn, Singapore, Singapore
来源
2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021) | 2021年
关键词
hyper-parameter optimization; deep neural networks; surrogate optimization; radial basis function; GLOBAL OPTIMIZATION; SEARCH;
D O I
10.1109/CEC45853.2021.9504777
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To improve deep neural net hyper-parameter optimization we develop a deterministic surrogate optimization algorithm as an efficient alternative to Bayesian optimization. A deterministic Radial Basis Function (RBF) surrogate model is built to interpolate previously evaluated points, and this surrogate model is incrementally updated in each iteration. The stochastic algorithm CMA-ES is used to search the acquisition function based on the surrogate. The acquisition function at a point is based on a weighted average of the surrogate at x and the minimum distance from x to a previously evaluated point. We evaluate the proposed algorithm RBF-CMA on hyper-parameter optimization tasks for deep convolutional neural networks on datasets of CIFAR-10, SVHN, and CIFAR-100. We show that RBF-CMA achieves a promising performance especially when the search space dimension is high in comparison to other algorithms including GP-EI, GP-LCB, and SMBO.
引用
收藏
页码:917 / 925
页数:9
相关论文
共 28 条
[21]   A stochastic radial basis function method for the global optimization of expensive functions [J].
Regis, Rommel G. ;
Shoemaker, Christine A. .
INFORMS JOURNAL ON COMPUTING, 2007, 19 (04) :497-509
[22]   Combining radial basis function surrogates and dynamic coordinate search in high-dimensional expensive black-box optimization [J].
Regis, Rommel G. ;
Shoemaker, Christine A. .
ENGINEERING OPTIMIZATION, 2013, 45 (05) :529-555
[23]   Parallel Stochastic Global Optimization Using Radial Basis Functions [J].
Regis, Rommel G. ;
Shoemaker, Christine A. .
INFORMS JOURNAL ON COMPUTING, 2009, 21 (03) :411-426
[24]  
Seeger Matthias, 2004, Int J Neural Syst, V14, P69, DOI 10.1142/S0129065704001899
[25]   Taking the Human Out of the Loop: A Review of Bayesian Optimization [J].
Shahriari, Bobak ;
Swersky, Kevin ;
Wang, Ziyu ;
Adams, Ryan P. ;
de Freitas, Nando .
PROCEEDINGS OF THE IEEE, 2016, 104 (01) :148-175
[26]  
Snoek J., 2012, P 25 INT C NEURIPS, V25, P2951
[27]  
Snoek J, 2015, PR MACH LEARN RES, V37, P2171
[28]  
Springenberg JT, 2016, ADV NEUR IN, V29