Adaptive learning rate algorithms based on the improved Barzilai-Borwein method

被引:0
作者
Wang, Zhi-Jun [1 ,2 ,3 ]
Li, Hong [1 ]
Xu, Zhou-Xiang [1 ]
Zhao, Shuai-Ye [1 ]
Wang, Peng-Jun [4 ]
Gao, He-Bei [2 ]
机构
[1] Wenzhou Univ, Coll Comp Sci & Artificial Intelligence, Wenzhou 325035, Zhejiang, Peoples R China
[2] Wenzhou Med Univ, Eye Hosp, Oujiang Lab, Zhejiang Lab Regenerat Med Vis & Brain Hlth, Wenzhou 325000, Zhejiang, Peoples R China
[3] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200333, Peoples R China
[4] Wenzhou Univ, Coll Elect & Elect Engn, Wenzhou 325035, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Barzilai-Borwein step size; Momentum method; Unconstrained optimization; Deep learning; GRADIENT; STEP;
D O I
10.1016/j.patcog.2024.111179
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: The Barzilai-Borwein(BB) method is essential in solving unconstrained optimization problems. The momentum method accelerates optimization algorithms with exponentially weighted moving average. In order to design reliable deep learning optimization algorithms, this paper proposes applying the BB method in four variants to the optimization algorithm of deep learning. Findings: The momentum method generates the BB step size under different step range limits. We also apply the momentum method and its variants to the stochastic gradient descent with the BB step size. Novelty: The algorithm's robustness has been demonstrated through experiments on the initial learning rate and random seeds. The algorithm's sensitivity is tested by choosing different momentum factors until a suitable momentum factor is found. Moreover, we compare our algorithms with popular algorithms in various neural networks. The results show that the new algorithms improve the efficiency of the BB step size in deep learning and provide a variety of optimization algorithm choices.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] A new simple model trust-region method with generalized Barzilai-Borwein parameter for large-scale optimization
    ZHOU QunYan
    SUN WenYu
    ZHANG HongChao
    Science China(Mathematics), 2016, 59 (11) : 2265 - 2280
  • [22] A faster path-based algorithm with Barzilai-Borwein step size for solving stochastic traffic equilibrium models
    Du, Muqing
    Tan, Heqing
    Chen, Anthony
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 290 (03) : 982 - 999
  • [23] A new simple model trust-region method with generalized Barzilai-Borwein parameter for large-scale optimization
    QunYan Zhou
    WenYu Sun
    HongChao Zhang
    Science China Mathematics, 2016, 59 : 2265 - 2280
  • [24] Adaptive Learning Rate Method Based on Nesterov Accelerated Gradient
    Xu, Zhenxing
    Yang, Ping
    Xu, Bing
    Li, Heping
    AOPC 2017: OPTICAL SENSING AND IMAGING TECHNOLOGY AND APPLICATIONS, 2017, 10462
  • [25] A novel method based on deep learning algorithms for material deformation rate detection
    Ozdem, Selim
    Orak, Ilhami Muharrem
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024, : 3249 - 3270
  • [26] The Improved Training Algorithm of Deep Learning with Self-Adaptive Learning Rate
    Ongart, Sutit
    Jearanaitanakij, Kietikul
    Sangthong, Jirapat
    2018 18TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2018, : 463 - 466
  • [27] Appropriate Learning Rates of Adaptive Learning Rate Optimization Algorithms for Training Deep Neural Networks
    Iiduka, Hideaki
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 13250 - 13261
  • [28] An Improved Reinforcement Learning Method Based on Unsupervised Learning
    Chang, Xin
    Li, Yanbin
    Zhang, Guanjie
    Liu, Donghui
    Fu, Changjun
    IEEE ACCESS, 2024, 12 : 12295 - 12307
  • [29] An improved ensemble learning method for exchange rate forecasting based on complementary effect of shallow and deep features
    Wang, Gang
    Tao, Tao
    Ma, Jingling
    Li, Hui
    Fu, Huimin
    Chu, Yan
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 184
  • [30] AdaCB: An Adaptive Gradient Method with Convergence Range Bound of Learning Rate
    Liao, Xuanzhi
    Sahran, Shahnorbanun
    Abdullah, Azizi
    Shukor, Syaimak Abdul
    APPLIED SCIENCES-BASEL, 2022, 12 (18):