Hyperparameter optimization of neural networks based on Q-learning

被引:11
作者
Qi, Xin [1 ]
Xu, Bing [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hung Hom, Hong Kong, Peoples R China
关键词
Hyperparameter optimization; Q-learning; Neural networks; Markov decision process;
D O I
10.1007/s11760-022-02377-y
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Machine learning algorithms are sensitive to hyperparameters, and hyperparameter optimization techniques are often computationally expensive, especially for complex deep neural networks. In this paper, we use Q-learning algorithm to search for good hyperparameter configurations for neural networks, where the learning agent searches for the optimal hyperparameter configuration by continuously updating the Q-table to optimize hyperparameter tuning strategy. We modify the initial states and termination conditions of Q-learning to improve search efficiency. The experimental results on hyperparameter optimization of a convolutional neural network and a bidirectional long short-term memory network show that our method has higher search efficiency compared with tree of Parzen estimators, random search and genetic algorithm and can find out the optimal or near-optimal hyperparameter configuration of neural network models with minimum number of trials.
引用
收藏
页码:1669 / 1676
页数:8
相关论文
共 36 条
  • [1] Abreu S., 2019, ARXIV
  • [2] Awad N, 2021, ARXIV
  • [3] Baker B, 2016, ARXIV
  • [4] Bergstra J., 2011, ADV NEURAL INFORM PR, P2546
  • [5] Bergstra J, 2012, J MACH LEARN RES, V13, P281
  • [6] EMORL: Effective multi-objective reinforcement learning method for hyperparameter optimization
    Chen, SenPeng
    Wu, Jia
    Liu, XiYuan
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 104
  • [7] Dynamical Hyperparameter Optimization via Deep Reinforcement Learning in Tracking
    Dong, Xingping
    Shen, Jianbing
    Wang, Wenguan
    Shao, Ling
    Ling, Haibin
    Porikli, Fatih
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1515 - 1529
  • [8] Falkner S, 2018, PR MACH LEARN RES, V80
  • [9] Framewise phoneme classification with bidirectional LSTM and other neural network architectures
    Graves, A
    Schmidhuber, J
    [J]. NEURAL NETWORKS, 2005, 18 (5-6) : 602 - 610
  • [10] Efficient Hyperparameter Optimization for Convolution Neural Networks in Deep Learning: A Distributed Particle Swarm Optimization Approach
    Guo, Yu
    Li, Jian-Yu
    Zhan, Zhi-Hui
    [J]. CYBERNETICS AND SYSTEMS, 2020, 52 (01) : 36 - 57