Hyperparameter optimization of neural networks based on Q-learning

被引：11

作者：

Qi, Xin ^{[1
]}

Xu, Bing ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hung Hom, Hong Kong, Peoples R China

来源：

SIGNAL IMAGE AND VIDEO PROCESSING | 2023年 / 17卷 / 04期

关键词：

Hyperparameter optimization; Q-learning; Neural networks; Markov decision process;

D O I：

10.1007/s11760-022-02377-y

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Machine learning algorithms are sensitive to hyperparameters, and hyperparameter optimization techniques are often computationally expensive, especially for complex deep neural networks. In this paper, we use Q-learning algorithm to search for good hyperparameter configurations for neural networks, where the learning agent searches for the optimal hyperparameter configuration by continuously updating the Q-table to optimize hyperparameter tuning strategy. We modify the initial states and termination conditions of Q-learning to improve search efficiency. The experimental results on hyperparameter optimization of a convolutional neural network and a bidirectional long short-term memory network show that our method has higher search efficiency compared with tree of Parzen estimators, random search and genetic algorithm and can find out the optimal or near-optimal hyperparameter configuration of neural network models with minimum number of trials.

引用

页码：1669 / 1676

页数：8

共 36 条

[1] Abreu S., 2019, ARXIV
[2] Awad N, 2021, ARXIV
[3] Baker B, 2016, ARXIV
[4] Bergstra J., 2011, ADV NEURAL INFORM PR, P2546
[5] Bergstra J, 2012, J MACH LEARN RES, V13, P281
[6] EMORL: Effective multi-objective reinforcement learning method for hyperparameter optimization
Chen, SenPeng
Wu, Jia
Liu, XiYuan
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 104
[7] Dynamical Hyperparameter Optimization via Deep Reinforcement Learning in Tracking
Dong, Xingping
Shen, Jianbing
Wang, Wenguan
Shao, Ling
Ling, Haibin
Porikli, Fatih
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1515 - 1529
[8] Falkner S, 2018, PR MACH LEARN RES, V80
[9] Framewise phoneme classification with bidirectional LSTM and other neural network architectures
Graves, A
Schmidhuber, J
[J]. NEURAL NETWORKS, 2005, 18 (5-6) : 602 - 610
[10] Efficient Hyperparameter Optimization for Convolution Neural Networks in Deep Learning: A Distributed Particle Swarm Optimization Approach
Guo, Yu
Li, Jian-Yu
Zhan, Zhi-Hui
[J]. CYBERNETICS AND SYSTEMS, 2020, 52 (01) : 36 - 57

← 1 2 3 4 →