Hyperparameter optimization through context-based meta-reinforcement learning with task-aware representation

被引：6

作者：

Wu, Jia ^{[1
]}

Liu, Xiyuan ^{[1
]}

Chen, Senpeng ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2023年 / 260卷

基金：

中国国家自然科学基金;

关键词：

Hyperparameter optimization; Reinforcement learning; Meta; -learning; Deep learning;

D O I：

10.1016/j.knosys.2022.110160

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we combine context-based Meta-Reinforcement Learning with task-aware representation to efficiently overcome data-inefficiency and limited generalization in the hyperparameter optimiza-tion problem. First, we propose a new context-based meta-RL model that disentangles task inference and control, which improves the meta-training efficiency and accelerates the learning process for unseen tasks. Second, the task properties are inferred on-line, which includes not only the dataset representation but also the task-solving experience, thus encouraging the agent to explore in a much smarter fashion. Third, we employ amortized meta-learning to meta-train the agent, which is simple and runs faster than the gradient-based meta-training method. Experimental results suggest that our method can search for the optimal hyperparameter configuration with limited computational cost in a reasonable time.(c) 2022 Elsevier B.V. All rights reserved.

引用

页数：11

共 38 条

[1]

Bay S.D., 2000, ACM SIGKDD Explor. Newsl., P81, DOI DOI 10.1145/380995.381030

[2]

Bergstra J, 2011, P 24 INT C NEURAL IN, V24

[3]

Bergstra J, 2012, J MACH LEARN RES, V13, P281

[4] XGBoost: A Scalable Tree Boosting System [J].

Chen, Tianqi ;

Guestrin, Carlos .

KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794

[5]

Demsar J, 2006, J MACH LEARN RES, V7, P1

[6]

Duan Y, 2016, Arxiv, DOI arXiv:1611.02779

[7]

Edwards H., 2017, P INT C LEARNING REP

[8]

Fakoor R., 2020, P INT C LEARN REPR I

[9]

Feurer M, 2021, Arxiv, DOI [arXiv:2007.04074, 10.48550/arXiv.2007.04074]

[10]

Feurer M, 2015, AAAI CONF ARTIF INTE, P1128

← 1 2 3 4 →