Efficient hyperparameters optimization through model-based reinforcement learning with experience exploiting and meta-learning

被引:5
|
作者
Liu, Xiyuan [1 ]
Wu, Jia [1 ]
Chen, Senpeng [1 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
基金
美国国家科学基金会;
关键词
Hyperparameters optimization; Reinforcement learning; Meta-learning; Deep learning; CLASSIFIERS;
D O I
10.1007/s00500-023-08050-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hyperparameter optimization plays a significant role in the overall performance of machine learning algorithms. However, the computational cost of algorithm evaluation can be extremely high for complex algorithm or large dataset. In this paper, we propose a model-based reinforcement learning with experience variable and meta-learning optimization method to speed up the training process of hyperparameter optimization. Specifically, an RL agent is employed to select hyperparameters and treat the k-fold cross-validation result as a reward signal to update the agent. To guide the agent's policy update, we design an embedding representation called "experience variable" and dynamically update it during the training process. Besides, we employ a predictive model to predict the performance of machine learning algorithm with the selected hyperparameters and limit the model rollout in short horizon to reduce the impact of the inaccuracy of the model. Finally, we use the meta-learning technique to pre-train the model for fast adapting to a new task. To prove the advantages of our method, we conduct experiments on 25 real HPO tasks and the experimental results show that with the limited computational resources, the proposed method outperforms the state-of-the-art Bayesian methods and evolution method.
引用
收藏
页码:8661 / 8678
页数:18
相关论文
共 50 条
  • [21] Developer recommendation for Topcoder through a meta-learning based policy model
    Zhang, Zhenyu
    Sun, Hailong
    Zhang, Hongyu
    EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (01) : 859 - 889
  • [22] Developer recommendation for Topcoder through a meta-learning based policy model
    Zhenyu Zhang
    Hailong Sun
    Hongyu Zhang
    Empirical Software Engineering, 2020, 25 : 859 - 889
  • [23] Model-Based Meta-Reinforcement Learning for Flight With Suspended Payloads
    Belkhale, Suneel
    Li, Rachel
    Kahn, Gregory
    McAllister, Rowan
    Calandra, Roberto
    Levine, Sergey
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 1471 - 1478
  • [24] Efficient Neural Network Pruning Using Model-Based Reinforcement Learning
    Bencsik, Blanka
    Szemenyei, Marton
    2022 INTERNATIONAL SYMPOSIUM ON MEASUREMENT AND CONTROL IN ROBOTICS (ISMCR), 2022, : 130 - 137
  • [25] An Optimization-Based Meta-Learning Model for MRI Reconstruction with Diverse Dataset
    Bian, Wanyu
    Chen, Yunmei
    Ye, Xiaojing
    Zhang, Qingchao
    JOURNAL OF IMAGING, 2021, 7 (11)
  • [26] MODEL-BASED SECURITY ANALYSIS OF FPGA DESIGNS THROUGH REINFORCEMENT LEARNING
    Vetter, Michael
    ACTA POLYTECHNICA, 2019, 59 (05) : 518 - 526
  • [27] Certifiably Robust Reinforcement Learning through Model-Based Abstract Interpretation
    Yang, Chenxi
    Anderson, Greg
    Chaudhuri, Swarat
    IEEE CONFERENCE ON SAFE AND TRUSTWORTHY MACHINE LEARNING, SATML 2024, 2024, : 233 - 251
  • [28] Online Optimization Method of Learning Process for Meta-Learning
    Xu, Zhixiong
    Zhang, Wei
    Li, Ailin
    Zhao, Feifei
    Jing, Yuanyuan
    Wan, Zheng
    Cao, Lei
    Chen, Xiliang
    COMPUTER JOURNAL, 2023, 67 (05) : 1645 - 1651
  • [29] Robustness challenges in Reinforcement Learning based time-critical cloud resource scheduling: A Meta-Learning based solution
    Liu, Hongyun
    Chen, Peng
    Ouyang, Xue
    Gao, Hui
    Yan, Bing
    Grosso, Paola
    Zhao, Zhiming
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 146 : 18 - 33
  • [30] Deep Learning-Based Maximum Temperature Forecasting Assisted with Meta-Learning for Hyperparameter Optimization
    Tran, Trang Thi Kieu
    Lee, Taesam
    Shin, Ju-Young
    Kim, Jong-Suk
    Kamruzzaman, Mohamad
    ATMOSPHERE, 2020, 11 (05)