Hyperparameter optimization of neural networks based on Q-learning

被引:0
作者
Xin Qi
Bing Xu
机构
[1] The Hong Kong Polytechnic University,Department of Aeronautical and Aviation Engineering
来源
Signal, Image and Video Processing | 2023年 / 17卷
关键词
Hyperparameter optimization; Q-learning; Neural networks; Markov decision process;
D O I
暂无
中图分类号
学科分类号
摘要
Machine learning algorithms are sensitive to hyperparameters, and hyperparameter optimization techniques are often computationally expensive, especially for complex deep neural networks. In this paper, we use Q-learning algorithm to search for good hyperparameter configurations for neural networks, where the learning agent searches for the optimal hyperparameter configuration by continuously updating the Q-table to optimize hyperparameter tuning strategy. We modify the initial states and termination conditions of Q-learning to improve search efficiency. The experimental results on hyperparameter optimization of a convolutional neural network and a bidirectional long short-term memory network show that our method has higher search efficiency compared with tree of Parzen estimators, random search and genetic algorithm and can find out the optimal or near-optimal hyperparameter configuration of neural network models with minimum number of trials.
引用
收藏
页码:1669 / 1676
页数:7
相关论文
共 29 条
  • [1] Reddy AH(2022)Deep cross feature adaptive network for facial emotion classification SIViP 16 369-376
  • [2] Kolli K(2016)A review of automatic selection methods for machine learning algorithms and hyper-parameter values Netw. Model. Anal. Health Inform. Bioinform. 5 1-16
  • [3] Kiran YL(2018)Speeding up the hyperparameter optimization of deep convolutional neural networks Int. J. Comput. Intell. Appl. 17 1850008-57
  • [4] Luo G(2020)Efficient hyperparameter optimization for convolution neural networks in deep learning: A distributed particle swarm optimization approach Cybern. Syst. 52 36-6816
  • [5] Hinz T(2017)Hyperband: A novel bandit-based approach to hyperparameter optimization J. Mach. Learn. Res. 18 6765-393
  • [6] Navarro-Guerrero N(2020)Efficient hyperparameter optimization through model-based reinforcement learning Neurocomputing 409 381-103
  • [7] Magg S(2021)EMORL: Effective multi-objective reinforcement learning method for hyperparameter optimization Eng. Appl. Artif. Intell. 104 89-551
  • [8] Guo Y(2022)A context-based meta-reinforcement learning approach to efficient hyperparameter optimization Neurocomputing 478 541-610
  • [9] Li J-Y(1989)Backpropagation applied to handwritten zip code recognition Neural Comput. 1 602-1780
  • [10] Zhan Z-H(2005)Framewise phoneme classification with bidirectional LSTM and other neural network architectures Neural Netw. 18 1735-undefined