Transfer Learning based Search Space Design for Hyperparameter Tuning

被引:3
作者
Li, Yang [1 ]
Shen, Yu [2 ]
Jiang, Huaijun [2 ]
Bai, Tianyi [3 ]
Zhang, Wentao [1 ]
Zhang, Ce [4 ]
Cui, Bin [5 ]
机构
[1] Peking Univ Tencent Data Platform, Tencent Inc, Sch CS, Technol & Engn Grp, Beijing, Peoples R China
[2] Peking Univ, Sch CS, Beijing, Peoples R China
[3] Beijing Inst Technol, Sch Math & Stat, Beijing, Peoples R China
[4] Swiss Fed Inst Technol, Syst Grp, DS3Lab, Dept Comp Sci, Zurich, Switzerland
[5] Peking Univ Qingdao, Sch CS, Peking Univ Inst Computat Social Sci, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022 | 2022年
基金
中国国家自然科学基金;
关键词
hyperparameter optimization; search space design; bayesian optimization; transfer learning;
D O I
10.1145/3534678.3539369
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The tuning of hyperparameters becomes increasingly important as machine learning (ML) models have been extensively applied in data mining applications. Among various approaches, Bayesian optimization (BO) is a successful methodology to tune hyperparameters automatically. While traditional methods optimize each tuning task in isolation, there has been recent interest in speeding up BO by transferring knowledge across previous tasks. In this work, we introduce an automatic method to design the BO search space with the aid of tuning history from past tasks. This simple yet effective approach can be used to endow many existing BO methods with transfer learning capabilities. In addition, it enjoys the three advantages: universality, generality, and safeness. The extensive experiments show that our approach considerably boosts BO by designing a promising and compact search space instead of using the entire space, and outperforms the state-of-the-arts on a wide range of benchmarks, including machine learning and deep learning tuning tasks, and neural architecture search.
引用
收藏
页码:967 / 977
页数:11
相关论文
共 49 条
[1]  
[Anonymous], 2018, PROCEEDINGS OF THE A
[2]  
[Anonymous], 2015, ECML PKDD, DOI DOI 10.1007/978-3-319-23525-7_7
[3]  
Bardenet R., 2013, INT C MACHINE LEARNI, P199
[4]  
Bergstra J.S., 2011, ADV NEURAL INFORM PR
[5]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[6]  
Bischl Bernd, 2021, ARXIV210705847
[7]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]   Random and Coherent Noise Suppression in DAS-VSP Data by Using a Supervised Deep Learning Method [J].
Dong, Xintong ;
Li, Yue ;
Zhong, Tie ;
Wu, Ning ;
Wang, Hongzhou .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[9]  
Falkner S, 2018, PR MACH LEARN RES, V80
[10]  
Feurer M, 2015, ADV NEUR IN, V28