Adaptive learning rate algorithms based on the improved Barzilai-Borwein method

被引:1
作者
Wang, Zhi-Jun [1 ,2 ,3 ]
Li, Hong [1 ]
Xu, Zhou-Xiang [1 ]
Zhao, Shuai-Ye [1 ]
Wang, Peng-Jun [4 ]
Gao, He-Bei [2 ]
机构
[1] Wenzhou Univ, Coll Comp Sci & Artificial Intelligence, Wenzhou 325035, Zhejiang, Peoples R China
[2] Wenzhou Med Univ, Eye Hosp, Oujiang Lab, Zhejiang Lab Regenerat Med Vis & Brain Hlth, Wenzhou 325000, Zhejiang, Peoples R China
[3] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200333, Peoples R China
[4] Wenzhou Univ, Coll Elect & Elect Engn, Wenzhou 325035, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Barzilai-Borwein step size; Momentum method; Unconstrained optimization; Deep learning; GRADIENT; STEP;
D O I
10.1016/j.patcog.2024.111179
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: The Barzilai-Borwein(BB) method is essential in solving unconstrained optimization problems. The momentum method accelerates optimization algorithms with exponentially weighted moving average. In order to design reliable deep learning optimization algorithms, this paper proposes applying the BB method in four variants to the optimization algorithm of deep learning. Findings: The momentum method generates the BB step size under different step range limits. We also apply the momentum method and its variants to the stochastic gradient descent with the BB step size. Novelty: The algorithm's robustness has been demonstrated through experiments on the initial learning rate and random seeds. The algorithm's sensitivity is tested by choosing different momentum factors until a suitable momentum factor is found. Moreover, we compare our algorithms with popular algorithms in various neural networks. The results show that the new algorithms improve the efficiency of the BB step size in deep learning and provide a variety of optimization algorithm choices.
引用
收藏
页数:9
相关论文
共 50 条
[31]   Performance Enhancement of Adaptive Neural Networks Based on Learning Rate [J].
Zubair, Swaleha ;
Singha, Anjani Kumar ;
Pathak, Nitish ;
Sharma, Neelam ;
Urooj, Shabana ;
Larguech, Samia Rabeh .
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01) :2005-2019
[32]   Seismic first-break picking based on BP neural network integrated with momentum method and adaptive learning rate method [J].
Cao X. ;
Liu B. ;
Wang S. ;
Wan X. ;
Zhang T. ;
Zhang H. .
Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2020, 55 (01) :71-79
[33]   An Adaptive Corner Detection Method Based on Deep Learning [J].
Wang, Liang ;
Han, Kesheng ;
Sun, Hongfei .
PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, :8478-8482
[34]   The mineral intelligence identification method based on deep learning algorithms [J].
Guo Y. ;
Zhou Z. ;
Lin H. ;
Liu X. ;
Chen D. ;
Zhu J. ;
Wu J. .
Earth Science Frontiers, 2020, 27 (05) :39-47
[35]   A Novel Short-Term Residential Electric Load Forecasting Method Based on Adaptive Load Aggregation and Deep Learning Algorithms [J].
Hou, Tingting ;
Fang, Rengcun ;
Tang, Jinrui ;
Ge, Ganheng ;
Yang, Dongjun ;
Liu, Jianchao ;
Zhang, Wei .
ENERGIES, 2021, 14 (22)
[36]   A Zeroth-Order Adaptive Learning Rate Method to Reduce Cost of Hyperparameter Tuning for Deep Learning [J].
Li, Yanan ;
Ren, Xuebin ;
Zhao, Fangyuan ;
Yang, Shusen .
APPLIED SCIENCES-BASEL, 2021, 11 (21)
[37]   Auto-Ensemble: An Adaptive Learning Rate Scheduling Based Deep Learning Model Ensembling [J].
Yang, Jun ;
Wang, Fei .
IEEE ACCESS, 2020, 8 :217499-217509
[38]   A Method of Small Object Detection Based on Improved Deep Learning [J].
Yu, Changgeng ;
Liu, Kai ;
Zou, Wei .
OPTICAL MEMORY AND NEURAL NETWORKS, 2020, 29 (02) :69-76
[39]   Sports Video Classification Method Based on Improved Deep Learning [J].
Gao, Tianhao ;
Zhang, Meng ;
Zhu, Yifan ;
Zhang, Youjian ;
Pang, Xiangsheng ;
Ying, Jing ;
Liu, Wenming .
APPLIED SCIENCES-BASEL, 2024, 14 (02)
[40]   A Method of Small Object Detection Based on Improved Deep Learning [J].
Kai Changgeng Yu ;
Wei Liu .
Optical Memory and Neural Networks, 2020, 29 :69-76