Efficient hyperparameters optimization through model-based reinforcement learning with experience exploiting and meta-learning

被引：5

作者：

Liu, Xiyuan ^{[1
]}

Wu, Jia ^{[1
]}

Chen, Senpeng ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China

来源：

SOFT COMPUTING | 2023年 / 27卷 / 13期

基金：

美国国家科学基金会;

关键词：

Hyperparameters optimization; Reinforcement learning; Meta-learning; Deep learning; CLASSIFIERS;

D O I：

10.1007/s00500-023-08050-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Hyperparameter optimization plays a significant role in the overall performance of machine learning algorithms. However, the computational cost of algorithm evaluation can be extremely high for complex algorithm or large dataset. In this paper, we propose a model-based reinforcement learning with experience variable and meta-learning optimization method to speed up the training process of hyperparameter optimization. Specifically, an RL agent is employed to select hyperparameters and treat the k-fold cross-validation result as a reward signal to update the agent. To guide the agent's policy update, we design an embedding representation called "experience variable" and dynamically update it during the training process. Besides, we employ a predictive model to predict the performance of machine learning algorithm with the selected hyperparameters and limit the model rollout in short horizon to reduce the impact of the inaccuracy of the model. Finally, we use the meta-learning technique to pre-train the model for fast adapting to a new task. To prove the advantages of our method, we conduct experiments on 25 real HPO tasks and the experimental results show that with the limited computational resources, the proposed method outperforms the state-of-the-art Bayesian methods and evolution method.

引用

页码：8661 / 8678

页数：18

共 50 条

[21] Developer recommendation for Topcoder through a meta-learning based policy model
Zhang, Zhenyu
Sun, Hailong
Zhang, Hongyu
EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (01) : 859 - 889
[22] Developer recommendation for Topcoder through a meta-learning based policy model
Zhenyu Zhang
Hailong Sun
Hongyu Zhang
Empirical Software Engineering, 2020, 25 : 859 - 889
[23] Model-Based Meta-Reinforcement Learning for Flight With Suspended Payloads
Belkhale, Suneel
Li, Rachel
Kahn, Gregory
McAllister, Rowan
Calandra, Roberto
Levine, Sergey
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 1471 - 1478
[24] Efficient Neural Network Pruning Using Model-Based Reinforcement Learning
Bencsik, Blanka
Szemenyei, Marton
2022 INTERNATIONAL SYMPOSIUM ON MEASUREMENT AND CONTROL IN ROBOTICS (ISMCR), 2022, : 130 - 137
[25] An Optimization-Based Meta-Learning Model for MRI Reconstruction with Diverse Dataset
Bian, Wanyu
Chen, Yunmei
Ye, Xiaojing
Zhang, Qingchao
JOURNAL OF IMAGING, 2021, 7 (11)
[26] MODEL-BASED SECURITY ANALYSIS OF FPGA DESIGNS THROUGH REINFORCEMENT LEARNING
Vetter, Michael
ACTA POLYTECHNICA, 2019, 59 (05) : 518 - 526
[27] Certifiably Robust Reinforcement Learning through Model-Based Abstract Interpretation
Yang, Chenxi
Anderson, Greg
Chaudhuri, Swarat
IEEE CONFERENCE ON SAFE AND TRUSTWORTHY MACHINE LEARNING, SATML 2024, 2024, : 233 - 251
[28] Online Optimization Method of Learning Process for Meta-Learning
Xu, Zhixiong
Zhang, Wei
Li, Ailin
Zhao, Feifei
Jing, Yuanyuan
Wan, Zheng
Cao, Lei
Chen, Xiliang
COMPUTER JOURNAL, 2023, 67 (05) : 1645 - 1651
[29] Robustness challenges in Reinforcement Learning based time-critical cloud resource scheduling: A Meta-Learning based solution
Liu, Hongyun
Chen, Peng
Ouyang, Xue
Gao, Hui
Yan, Bing
Grosso, Paola
Zhao, Zhiming
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 146 : 18 - 33
[30] Deep Learning-Based Maximum Temperature Forecasting Assisted with Meta-Learning for Hyperparameter Optimization
Tran, Trang Thi Kieu
Lee, Taesam
Shin, Ju-Young
Kim, Jong-Suk
Kamruzzaman, Mohamad
ATMOSPHERE, 2020, 11 (05)

← 1 2 3 4 5 →