Design of experiments and response surface methodology to tune machine learning hyperparameters, with a random forest case-study

被引:105
作者
Lujan-Moreno, Gustavo A. [1 ,2 ]
Howard, Phillip R. [3 ]
Rojas, Omar G. [1 ]
Montgomery, Douglas C. [4 ]
机构
[1] Univ Panamer, Escuela Ciencias Econ & Empresariales, Prolongac Calzada Circunvalac Poniente 49, Zapopan 45010, Jalisco, Mexico
[2] Intel Corp, Ave Bosque 1001, Zapopan 45019, Jalisco, Mexico
[3] Intel Corp, 5000 W Chandler Blvd, Chandler, AZ 85226 USA
[4] Arizona State Univ, Sch Comp Informat & Decis Syst, Brickyard Engn, 699 S Mill Ave, Tempe, AZ 85281 USA
关键词
Design of experiments; Hyperparameters; Machine learning; Random forest; Response surface methodology; Tuning; PARAMETERS; SEARCH;
D O I
10.1016/j.eswa.2018.05.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most machine learning algorithms possess hyperparameters. For example, an artificial neural network requires the determination of the number of hidden layers, nodes, and many other parameters related to the model fitting process. Despite this, there is still no clear consensus on how to tune them. The most popular methodology is an exhaustive grid search, which can be highly inefficient and sometimes infeasible. Another common solution is to change one hyperparameter at a time and measure its effect on the model's performance. However, this can also be inefficient and does not guarantee optimal results since it ignores interactions between the hyperparameters. In this paper, we propose to use the Design of Experiments (DOE) methodology (factorial designs) for screening and Response Surface Methodology (RSM) to tune a machine learning algorithm's hyperparameters. An application of our methodology is presented with a detailed discussion of the results of a random forest case-study using a publicly available dataset. Benefits include fewer training runs, better parameter selection, and a disciplined approach based on statistical theory. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:195 / 205
页数:11
相关论文
共 24 条
[1]  
[Anonymous], ARXIV14077969
[2]  
[Anonymous], 2006, Introduction to Data Mining
[3]  
[Anonymous], 2015, DESIGN ANAL SIMULATI
[4]  
Bardenet R., 2013, JMLR WORKSH C P, P199, DOI DOI 10.5555/3042817.3042916
[5]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[6]  
Bergstra JS., 2011, ADV NEURAL INFORM PR, P2546
[7]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   Extending Morris method: identification of the interaction graph using cycle-equitable designs [J].
Fedou, J-M. ;
Rendas, M-J. .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (07) :1398-1419
[10]   RIDGE ANALYSIS 25 YEARS LATER [J].
HOERL, RW .
AMERICAN STATISTICIAN, 1985, 39 (03) :186-192