VisEvol: Visual Analytics to Support Hyperparameter Search through Evolutionary Optimization

被引:17
作者
Chatzimparmpas, A. [1 ]
Martins, R. M. [1 ]
Kucher, K. [1 ]
Kerren, A. [1 ,2 ]
机构
[1] Linnaeus Univ, Dept Comp Sci & Media Technol, Vaxjo, Sweden
[2] Linkoping Univ, Dept Sci & Technol, Linkoping, Sweden
关键词
PERFORMANCE; ALGORITHMS; MODELS;
D O I
10.1111/cgf.14300
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
During the training phase of machine learning (ML) models, it is usually necessary to configure several hyperparameters. This process is computationally intensive and requires an extensive search to infer the best hyperparameter set for the given problem. The challenge is exacerbated by the fact that most ML models are complex internally, and training involves trial-and-error processes that could remarkably affect the predictive result. Moreover, each hyperparameter of an ML algorithm is potentially intertwined with the others, and changing it might result in unforeseeable impacts on the remaining hyperparameters. Evolutionary optimization is a promising method to try and address those issues. According to this method, performant models are stored, while the remainder are improved through crossover and mutation processes inspired by genetic algorithms. We present VisEvol, a visual analytics tool that supports interactive exploration of hyperparameters and intervention in this evolutionary procedure. In summary, our proposed tool helps the user to generate new models through evolution and eventually explore powerful hyperparameter combinations in diverse regions of the extensive hyperparameter space. The outcome is a voting ensemble (with equal rights) that boosts the final predictive performance. The utility and applicability of VisEvol are demonstrated with two use cases and interviews with ML experts who evaluated the effectiveness of the tool.
引用
收藏
页码:201 / 214
页数:14
相关论文
共 95 条
[1]   Optuna: A Next-generation Hyperparameter Optimization Framework [J].
Akiba, Takuya ;
Sano, Shotaro ;
Yanase, Toshihiko ;
Ohta, Takeru ;
Koyama, Masanori .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2623-2631
[2]  
[Anonymous], 2017, ARXIV171109846
[3]  
[Anonymous], 2014, ARXIV14121114
[4]  
[Anonymous], MACH LEARN, DOI [10.1023/A:1010933404324, DOI 10.1023/A:1010933404324]
[5]  
[Anonymous], 2010, FLASK MICRO WEB FRAM
[6]  
Bardenet R., 2013, PMLR, P199, DOI DOI 10.5555/3042817.3042916
[7]  
Bekkar M., 2013, Journal of Information Engineering and Applications, V3, P5
[8]   Automated optimized parameters for T-distributed stochastic neighbor embedding improve visualization and analysis of large datasets [J].
Belkina, Anna C. ;
Ciccolella, Christopher O. ;
Anno, Rina ;
Halpert, Richard ;
Spidlen, Josef ;
Snyder-Cappione, Jennifer E. .
NATURE COMMUNICATIONS, 2019, 10 (1)
[9]  
Bergstra James, 2015, Computational Science and Discovery, V8, DOI 10.1088/1749-4699/8/1/014008
[10]  
Bergstra J., 2011, ADV NEURAL INFORM PR, V24