Hyperparameters of Multilayer Perceptron with Normal Distributed Weights

被引:3
作者
Karaki, Y. [1 ]
Ivanov, N. [2 ]
机构
[1] Arts Sci & Technol Univ Lebanon, Fac Sci & Fine Arts, Dept Comp Sci, Beirut 146495, Lebanon
[2] Belarusian State Univ Informat & Radioelect, Fac Comp Syst & Networks, Dept Comp Machinery, Minsk 220013, BELARUS
关键词
neural networks; hyperparameters; Gaussian distribution; Bayesian optimization; MULTIVARIATE SKEWNESS; KURTOSIS;
D O I
10.1134/S1054661820020054
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multilayer Perceptrons, Recurrent neural networks, Convolutional networks, and others types of neural networks are widespread nowadays. Neural Networks have hyperparameters like number of hidden layers, number of units for each hidden layer, learning rate, and activation function. Bayesian Optimization is one of the methods used for tuning hyperparameters. Usually this technique treats values of neurons in network as stochastic Gaussian processes. This article reports experimental results on multivariate normality test and proves that the neuron vectors are considerably far from Gaussian distribution.
引用
收藏
页码:170 / 173
页数:4
相关论文
共 15 条
[1]  
[Anonymous], 1988, Metrika, DOI DOI 10.1007/BF02613322
[2]  
[Anonymous], Machine learning datasets
[3]  
Bergstra J., 2013, JMLR: W&CP, V28
[4]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[5]  
Bochinski E, 2017, IEEE IMAGE PROC, P3924
[6]   An effective algorithm for hyperparameter optimization of neural networks [J].
Diaz, G. I. ;
Fokoue-Nkoutche, A. ;
Nannicini, G. ;
Samulowitz, H. .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2017, 61 (4-5)
[7]  
Domhan T, 2015, PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), P3460
[8]  
Gentle JE, 2009, STAT COMPUT SER, P3
[9]   Invariant tests for multivariate normality: a critical review [J].
Henze, N .
STATISTICAL PAPERS, 2002, 43 (04) :467-506
[10]   Large-scale Video Classification with Convolutional Neural Networks [J].
Karpathy, Andrej ;
Toderici, George ;
Shetty, Sanketh ;
Leung, Thomas ;
Sukthankar, Rahul ;
Fei-Fei, Li .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1725-1732