Deep Reinforcement Learning with Model-based Acceleration for Hyperparameter Optimization

被引:9
作者
Chen, SenPeng [1 ]
Wu, Jia [1 ]
Chen, XiuYun [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu, Peoples R China
来源
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019) | 2019年
关键词
Hyperparameter optimization; Automated machine learning; Deep Reinforcement learning;
D O I
10.1109/ICTAI.2019.00032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hyperparameter optimization is a key part of AutoML. In recent years, there have been successful hyperparameter optimization algorithms. However, these methods still face several challenges, such as high cost of evaluating large models or large datasets. In this paper, we introduce a new deep reinforcement learning architecture with model-based acceleration to optimize hyperparameters for any machine learning model. In this method, an agent constructed by a Long Short-Term Memory Network aims at maximizing the expected accuracy of a machine learning model on a validation set. To speed up training, we employ a model to predict the accuracy on a validation set instead of evaluating a machine learning model. To effectively train the agent and the predictive model, Real-Predictive-Real training process is proposed. Besides, to reduce the variance, we propose a bootstrap pool to guide the exploration in the search space. The experiment was carried out by optimizing hyperparameters of two widely used machine learning models: Random Forests and XGBoost. Experimental results show that the proposed method outperforms random search, Bayesian optimization, and Tree-structured Parzen Estimator in terms of accuracy, time efficiency and stability.
引用
收藏
页码:170 / 177
页数:8
相关论文
共 50 条
[21]   Multiagent Reinforcement Learning for Hyperparameter Optimization of Convolutional Neural Networks [J].
Iranfar, Arman ;
Zapater, Marina ;
Atienza, David .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (04) :1034-1047
[22]   Evolutionary Reinforcement Learning for Automated Hyperparameter Optimization in EEG Classification [J].
Shin, Dong-Hee ;
Ko, Dong-Hee ;
Han, Ji-Wung ;
Kam, Tae-Eui .
10TH INTERNATIONAL WINTER CONFERENCE ON BRAIN-COMPUTER INTERFACE (BCI2022), 2022,
[23]   Design and Optimization of Hybrid CNN-DT Model-Based Network Intrusion Detection Algorithm Using Deep Reinforcement Learning [J].
Qiu, Lu ;
Xu, Zhiping ;
Lin, Lixiong ;
Zheng, Jiachun ;
Su, Jiahui .
MATHEMATICS, 2025, 13 (09)
[24]   A context-based meta-reinforcement learning approach to efficient hyperparameter optimization [J].
Liu, Xiyuan ;
Wu, Jia ;
Chen, Senpeng .
NEUROCOMPUTING, 2022, 478 :89-103
[25]   Node selection for model quality optimization in hierarchical federated learning based on deep reinforcement learning [J].
Li, Zhuo ;
Dang, Yashi ;
Chen, Xin .
PEER-TO-PEER NETWORKING AND APPLICATIONS, 2024, 17 (03) :1720-1731
[26]   Hyperparameter Optimization of Deep Learning Models for EEG-Based Vigilance Detection [J].
Khessiba, Souhir ;
Blaiech, Ahmed Ghazi ;
Manzanera, Antoine ;
Ben Khalifa, Khaled ;
Ben Abdallah, Asma ;
Bedoui, Mohamed Hedi .
ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2022, 2022, 1653 :200-210
[27]   Hyperparameter Optimization and Importance Ranking in Deep Learning-Based Crack Segmentation [J].
Canchila, Carlos ;
Zhou, Shanglian ;
Song, Wei .
JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2024, 38 (02)
[28]   BTTackler: A Diagnosis-based Framework for Efficient Deep Learning Hyperparameter Optimization [J].
Pei, Zhongyi ;
Cen, Zhiyao ;
Huang, Yipeng ;
Wang, Chen ;
Liu, Lin ;
Yu, Philip ;
Long, Mingsheng ;
Wang, Jianmin .
PROCEEDINGS OF THE 30TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2024, 2024, :2340-2351
[29]   Container stacking optimization based on Deep Reinforcement Learning [J].
Jin, Xin ;
Duan, Zhentang ;
Song, Wen ;
Li, Qiqiang .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
[30]   Reentry trajectory optimization based on Deep Reinforcement Learning [J].
Gao, Jiashi ;
Shi, Xinming ;
Cheng, Zhongtao ;
Xiong, Jizhang ;
Liu, Lei ;
Wang, Yongji ;
Yang, Ye .
PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, :2588-2592