Automatic tuning of hyperparameters using Bayesian optimization

被引:331
作者
Victoria, A. Helen [1 ]
Maragatham, G. [1 ]
机构
[1] SRM Inst Sci & Technol, Dept Informat Technol, Chennai, Tamil Nadu, India
关键词
Hyperparameters; Optimization; CIFAR-10; Black box function;
D O I
10.1007/s12530-020-09345-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning is a field in artificial intelligence that works well in computer vision, natural language processing and audio recognition. Deep neural network architectures has number of layers to conceive the features well, by itself. The hyperparameter tuning plays a major role in every dataset which has major effect in the performance of the training model. Due to the large dimensionality of data it is impossible to tune the parameters by human expertise. In this paper, we have used the CIFAR-10 Dataset and applied the Bayesian hyperparameter optimization algorithm to enhance the performance of the model. Bayesian optimization can be used for any noisy black box function for hyperparameter tuning. In this work Bayesian optimization clearly obtains optimized values for all hyperparameters which saves time and improves performance. The results also show that the error has been reduced in graphical processing unit than in CPU by 6.2% in the validation. Achieving global optimization in the trained model helps transfer learning across domains as well.
引用
收藏
页码:217 / 223
页数:7
相关论文
共 27 条
[1]   A GENERALIZED-APPROACH TO FUZZY OPTIMIZATION [J].
ANGELOV, P .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1994, 9 (03) :261-268
[2]   An Approach to Automatic Real-Time Novelty Detection, Object Identification, and Tracking in Video Streams Based on Recursive Density Estimation and Evolving Takagi-Sugeno Fuzzy Systems [J].
Angelov, Plamen ;
Sadeghi-Tehran, Pouria ;
Ramezani, Ramin .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2011, 26 (03) :189-205
[3]  
[Anonymous], 2018, LETS KEEP IT SIMPLE
[4]  
[Anonymous], LION
[5]  
[Anonymous], 2014, CORRELATION BUDGET C
[6]  
[Anonymous], 2017, 31 C NEUR INF PROC S
[7]   DEC: Dynamically Evolving Clustering and Its Application to Structure Identification of Evolving Fuzzy Models [J].
Baruah, Rashmi Dutta ;
Angelov, Plamen .
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (09) :1619-1631
[8]  
Bergstra J., 2013, PMLR, P115, DOI DOI 10.5555/3042817.3042832
[9]  
Bergstra J., 2011, P 25 INT C NEUR INF, P24, DOI DOI 10.5555/2986459.2986743
[10]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281