Hyperparameter optimization through context-based meta-reinforcement learning with task-aware representation

被引:6
作者
Wu, Jia [1 ]
Liu, Xiyuan [1 ]
Chen, Senpeng [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Hyperparameter optimization; Reinforcement learning; Meta; -learning; Deep learning;
D O I
10.1016/j.knosys.2022.110160
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we combine context-based Meta-Reinforcement Learning with task-aware representation to efficiently overcome data-inefficiency and limited generalization in the hyperparameter optimiza-tion problem. First, we propose a new context-based meta-RL model that disentangles task inference and control, which improves the meta-training efficiency and accelerates the learning process for unseen tasks. Second, the task properties are inferred on-line, which includes not only the dataset representation but also the task-solving experience, thus encouraging the agent to explore in a much smarter fashion. Third, we employ amortized meta-learning to meta-train the agent, which is simple and runs faster than the gradient-based meta-training method. Experimental results suggest that our method can search for the optimal hyperparameter configuration with limited computational cost in a reasonable time.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 38 条
[1]  
Bay S.D., 2000, ACM SIGKDD Explor. Newsl., P81, DOI DOI 10.1145/380995.381030
[2]  
Bergstra J, 2011, P 24 INT C NEURAL IN, V24
[3]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[4]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[5]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[6]  
Duan Y, 2016, Arxiv, DOI arXiv:1611.02779
[7]  
Edwards H., 2017, P INT C LEARNING REP
[8]  
Fakoor R., 2020, P INT C LEARN REPR I
[9]  
Feurer M, 2021, Arxiv, DOI [arXiv:2007.04074, 10.48550/arXiv.2007.04074]
[10]  
Feurer M, 2015, AAAI CONF ARTIF INTE, P1128