DiffMoment: an adaptive optimization technique for convolutional neural network

被引：0

作者：

Shubhankar Bhakta

Utpal Nandi

Tapas Si

Sudipta Kr Ghosal

Chiranjit Changdar

Rajat Kumar Pal

机构：

[1] Vidyasagar University,Dept. of Computer Science

[2] Bankura Unnayani Institute of Engineering,Dept. of Computer Science and Engineering

[3] Behala Goverment Polytechnic,Dept. of Computer Science and Technology

[4] Belda College,Dept. of Computer Science

[5] University of Calcutta,Dept. of Computer Science and Engineering

来源：

Applied Intelligence | 2023年 / 53卷

关键词：

Neural networks; Optimizer; Gradient descent; Adam; Difference of momentum;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Stochastic Gradient Decent (SGD) is a very popular basic optimizer applied in the learning algorithms of deep neural networks. However, it has fixed-sized steps for every epoch without considering gradient behaviour to determine step size. The improved SGD optimizers like AdaGrad, Adam, AdaDelta, RAdam, and RMSProp make step sizes adaptive in every epoch. However, these optimizers depend on square roots of exponential moving averages (EMA) of squared previous gradients or momentums or both and cannot take the benefit of local change in gradients or momentums or both. To reduce these limitations, a novel optimizer has been presented in this paper where the adjustment of step size is done for each parameter based on changing information between the 1st and the 2nd moment estimate (i.e., diffMoment). The experimental results depict that diffMoment offers better performance than AdaGrad, Adam, AdaDelta, RAdam, and RMSProp optimizers. It is also noticed that diffMoment does uniformly better for training Convolutional Neural Networks (CNN) applying different activation functions.

引用

页码：16844 / 16858

页数：14

共 50 条

[1] DiffMoment: an adaptive optimization technique for convolutional neural network
Bhakta, Shubhankar
Nandi, Utpal
Si, Tapas
Ghosal, Sudipta Kr
Changdar, Chiranjit
Pal, Rajat Kumar
APPLIED INTELLIGENCE, 2023, 53 (13) : 16844 - 16858
[2] Adaptive Protection Scheme for FREEDM Microgrid Based on Convolutional Neural Network and Gorilla Troops Optimization Technique
Hatata, Ahmed Y.
Essa, Mohamed A.
Sedhom, Bishoy E.
IEEE ACCESS, 2022, 10 : 55583 - 55601
[3] Adaptive Deep Learning with Optimization Hybrid Convolutional Neural Network and Recurrent Neural Network for Prediction Lemon Fruit Ripeness
Watnakornbuncha, Darunee
Am-Dee, Noppadol
Sangsongfa, Adisak
PRZEGLAD ELEKTROTECHNICZNY, 2024, 100 (03): : 202 - 211
[4] A Scalable and Adaptive Convolutional Neural Network Accelerator
Pidanic, Jan
Vyas, Arpan
Karki, Rishav
Vij, Prateek
Trivedi, Gaurav
Nemec, Zdenek
2022 32ND INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2022, : 138 - 142
[5] Multiobjective visual evolutionary neural network and related convolutional neural network optimization
Zhang, Zhuhong
Li, Lun
Lu, Jiaxuan
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 243
[6] A novel IoT network intrusion detection approach based on Adaptive Particle Swarm Optimization Convolutional Neural Network
Kan, Xiu
Fan, Yixuan
Fang, Zhijun
Cao, Le
Xiong, Neal N.
Yang, Dan
Li, Xuan
INFORMATION SCIENCES, 2021, 568 : 147 - 162
[7] An evolutionary framework for designing adaptive convolutional neural network
Mishra, Vidyanand
Kane, Lalit
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224
[8] The Adaptive Wideband Beamforming using Convolutional Neural Network
Wu, Xun
Zhang, Shurui
Ma, Xiaofeng
Guo, Shanhong
Sheng, Weixing
2022 INTERNATIONAL CONFERENCE ON MICROWAVE AND MILLIMETER WAVE TECHNOLOGY (ICMMT), 2022,
[9] Robust Adaptive Beamforming Based on a Convolutional Neural Network
Liao, Zhipeng
Duan, Keqing
He, Jinjun
Qiu, Zizhou
Li, Binbin
ELECTRONICS, 2023, 12 (12)
[10] Adaptive Modular Convolutional Neural Network for Image Recognition
Wu, Wenbo
Pan, Yun
SENSORS, 2022, 22 (15)

← 1 2 3 4 5 →