Efficient Learning Rate Adaptation for Convolutional Neural Network Training

被引:0
作者
Georgakopoulos, Spiros V. [1 ]
Plagianakos, Vassilis P. [1 ]
机构
[1] Univ Thessaly, Dept Comp Sci & Biomed Informat, Lamia 35100, Greece
来源
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2019年
关键词
Convolutional Neural Networks; Adaptive Leaning Rate; Gradient Information; ALGORITHMS;
D O I
10.1109/ijcnn.2019.8852033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks (CNNs) have been established as substantial supervised methods for classification problems in many research fields. However, a large number of parameters have to be tuned to achieve high performance and good classification results. One of the most crucial parameter for the performance of a CNN is the learning rate (step) of the training algorithm. Although the heuristic search to tune the learning rate is a common practice, it is extremely time-consuming, considering the fact that CNNs require a significant amount of time for each training, due to their complex architectures and high number of weights. Approaches that integrate the adaptation of the initial learning rate in the optimization algorithm, manage to converge to high quality solutions and have been embraced by the research community. In this work, we propose an improvement of the recently proposed Adaptive Learning Rate algorithm (AdLR). The proposed learning rate adaptation algorithm (e-AdLR) exhibits excellent convergence properties and classification accuracy, while at the same time is fast and robust.
引用
收藏
页数:8
相关论文
共 28 条
  • [1] Akiba T., 2017, ABS171104325 CORR
  • [2] Almeida L. B., 1998, On-line Learning in Neural Networks, P111
  • [3] [Anonymous], 2017, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2017.322
  • [4] [Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
  • [5] [Anonymous], ABS180209957 CORR
  • [6] Arvin AM, 2009, LIVE VARIOLA VIRUS: CONSIDERATIONS FOR CONTINUING RESEARCH, P9
  • [7] Ba J., 2015, INT C LEARN REPR
  • [8] Bottou L., 1999, Online Learning, P9
  • [9] Assessing Image Analysis Filters as Augmented Input to Convolutional Neural Networks for Image Classification
    Delibasis, K. d
    Maglogiannis, Ilias
    Georgakopoulos, S.
    Kottari, K.
    Plagianakos, V.
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 188 - 196
  • [10] Deng L., 2018, DEEP LEARNING NATURA, DOI DOI 10.1007/978-981-10-5209-5