Hyperparameter optimization of neural networks based on Q-learning

被引：0

作者：

Xin Qi

Bing Xu

机构：

[1] The Hong Kong Polytechnic University,Department of Aeronautical and Aviation Engineering

来源：

Signal, Image and Video Processing | 2023年 / 17卷

关键词：

Hyperparameter optimization; Q-learning; Neural networks; Markov decision process;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Machine learning algorithms are sensitive to hyperparameters, and hyperparameter optimization techniques are often computationally expensive, especially for complex deep neural networks. In this paper, we use Q-learning algorithm to search for good hyperparameter configurations for neural networks, where the learning agent searches for the optimal hyperparameter configuration by continuously updating the Q-table to optimize hyperparameter tuning strategy. We modify the initial states and termination conditions of Q-learning to improve search efficiency. The experimental results on hyperparameter optimization of a convolutional neural network and a bidirectional long short-term memory network show that our method has higher search efficiency compared with tree of Parzen estimators, random search and genetic algorithm and can find out the optimal or near-optimal hyperparameter configuration of neural network models with minimum number of trials.

引用

页码：1669 / 1676

页数：7

共 29 条

[1] Reddy AH(2022)Deep cross feature adaptive network for facial emotion classification SIViP 16 369-376
[2] Kolli K(2016)A review of automatic selection methods for machine learning algorithms and hyper-parameter values Netw. Model. Anal. Health Inform. Bioinform. 5 1-16
[3] Kiran YL(2018)Speeding up the hyperparameter optimization of deep convolutional neural networks Int. J. Comput. Intell. Appl. 17 1850008-57
[4] Luo G(2020)Efficient hyperparameter optimization for convolution neural networks in deep learning: A distributed particle swarm optimization approach Cybern. Syst. 52 36-6816
[5] Hinz T(2017)Hyperband: A novel bandit-based approach to hyperparameter optimization J. Mach. Learn. Res. 18 6765-393
[6] Navarro-Guerrero N(2020)Efficient hyperparameter optimization through model-based reinforcement learning Neurocomputing 409 381-103
[7] Magg S(2021)EMORL: Effective multi-objective reinforcement learning method for hyperparameter optimization Eng. Appl. Artif. Intell. 104 89-551
[8] Guo Y(2022)A context-based meta-reinforcement learning approach to efficient hyperparameter optimization Neurocomputing 478 541-610
[9] Li J-Y(1989)Backpropagation applied to handwritten zip code recognition Neural Comput. 1 602-1780
[10] Zhan Z-H(2005)Framewise phoneme classification with bidirectional LSTM and other neural network architectures Neural Netw. 18 1735-undefined

← 1 2 3 →