A New Automatic Hyperparameter Recommendation Approach Under Low-Rank Tensor Completion e Framework

被引:12
作者
Deng, Liping [1 ]
Xiao, Mingqing [1 ]
机构
[1] Southern Illinois Univ, Sch Math & Stat Sci, Carbondale, IL 62901 USA
关键词
Tensors; Task analysis; Search problems; Support vector machines; Optimization; Correlation; Testing; Automatic hyperparameter recommendation; classification; meta-learning; tensor completion; kernel function; CLASSIFICATION ALGORITHMS; SELECTION; SEARCH;
D O I
10.1109/TPAMI.2022.3195658
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hyperparameter optimization (HPO), characterized by hyperparameter tuning, is not only a critical step for effective modeling but also is the most time-consuming process in machine learning. Traditional search-based algorithms tend to require extensive configuration evaluations for each round to select the desirable hyperparameters during the process, and they are often very inefficient for the implementations on large-scale tasks. In this paper, we study the HPO problem via meta-learning (MtL) approach under the low-rank tensor completion (LRTC) framework. Our proposed approach predicts the performance for hyperparameters of new problems based on their previous performance so that the underlying suitable hyperparameters with better efficiency can be attained. Different from existing approaches, the hyperparameter performance space is instantiated under tensor framework that can preserve the spatial structure and reflect the correlations among the adjacent hyperparameters. When some partial evaluations are available for a new problem, the task of estimating the performance of the unevaluated hyperparameters can be formulated as a tensor completion (TC) problem. Toward the completion purpose, we develop an LRTC algorithm utilizing the sum of nuclear norm (SNN) model. A kernelized version is further developed to capture the nonlinear structure of the performance space. In addition, a corresponding coupled matrix factorization (CMF) algorithm is established to render the predictions solely depend on the meta-features to avoid additional hyperparameter evaluations. Finally, a strategy integrating LRTC and CMF is provided to further enhance the recommendation capacity. We test recommendation performance with our proposed methods for classical SVM and the state-of-the-art deep neural networks such as vision transformer (ViT) and residual network (ResNet), and the obtained results demonstrate the effectiveness of our approaches under various evaluation metrics by comparing with the baselines commonly used for MtL.
引用
收藏
页码:4038 / 4050
页数:13
相关论文
共 44 条
[1]  
Alcobaça E, 2020, J MACH LEARN RES, V21
[2]   On learning algorithm selection for classification [J].
Ali, S ;
Smith, KA .
APPLIED SOFT COMPUTING, 2006, 6 (02) :119-138
[3]  
[Anonymous], 2007, Machine learning: ECML 2001, DOI DOI 10.1007/3-540-44795-43
[4]  
[Anonymous], 2012, ADV NEURAL INF PROCE
[5]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[6]  
Chen M., 2021, 2021 IEEECVF INT C C, p12 270
[7]  
Chen Minghao, 2021, Advances in Neural Information Processing Systems, V34
[8]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[9]  
Dyrmishi Salijona, 2019, 2019 International Conference on Data Mining Workshops (ICDMW). Proceedings, P97, DOI 10.1109/ICDMW.2019.00025
[10]  
Fusi N, 2018, ADV NEUR IN, V31