DeepHyper: Asynchronous Hyperparameter Search for Deep Neural Networks

被引:93
作者
Balaprakash, Prasanna [1 ]
Salim, Michael
Uram, Thomas D.
Vishwanath, Venkat
Wild, Stefan M.
机构
[1] Argonne Natl Lab, Math & Comp Sci Div, Lemont, IL 60439 USA
来源
2018 IEEE 25TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC) | 2018年
关键词
D O I
10.1109/HiPC.2018.00014
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Hyperparameters employed by deep learning (DL) methods play a substantial role in the performance and reliability of these methods in practice. Unfortunately, finding performance-optimizing hyperparameter settings is a notoriously difficult task. Hyperparameter search methods typically have limited production-strength implementations or do not target scalability within a highly parallel machine, portability across different machines, experimental comparison between different methods, and tighter integration with workflow systems. In this paper, we present DeepHyper, a Python package that provides a common interface for the implementation and study of scalable hyperparameter search methods. It adopts the Balsam workflow system to hide the complexities of running large numbers of hyperparameter configurations in parallel on high-performance computing (HPC) systems. We implement and study asynchronous model-based search methods that consist of sampling a small number of input hyperparameter configurations and progressively fitting surrogate models over the input-output space until exhausting a user-defined budget of evaluations. We evaluate the efficacy of these methods relative to approaches such as random search, genetic algorithms, Bayesian optimization, and hyperband on DL benchmarks on CPU- and GPU-based HPC systems.
引用
收藏
页码:42 / 51
页数:10
相关论文
共 30 条
[1]  
Abadi M., 2016, TENSORFLOW LARGESCAL
[2]  
[Anonymous], 2017, ARXIV171200559
[3]  
[Anonymous], 2005, Genetic and Evolutionary Computation Conference, GECCO 2005, Proceedings, Washington DC, USA, June 25-29, 2005
[4]  
[Anonymous], 2018, P INT C LEARN REPR
[5]  
Baker B., 2017, INT C LEARN REPR
[6]  
Ben-Nun Tal, 2018, ARXIV180209941, V52, P1
[7]   Consciousness is not a property of states: A reply to Wilberg [J].
Berger, Jacob .
PHILOSOPHICAL PSYCHOLOGY, 2014, 27 (06) :829-842
[8]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[9]  
Bischl B., 2017, mlrMBO: A Modular Framework for Model-Based Optimization of Expensive Black-Box Functions
[10]  
Childers J. T., 2017, HUST 17