DiffMoment: an adaptive optimization technique for convolutional neural network

被引:0
|
作者
Shubhankar Bhakta
Utpal Nandi
Tapas Si
Sudipta Kr Ghosal
Chiranjit Changdar
Rajat Kumar Pal
机构
[1] Vidyasagar University,Dept. of Computer Science
[2] Bankura Unnayani Institute of Engineering,Dept. of Computer Science and Engineering
[3] Behala Goverment Polytechnic,Dept. of Computer Science and Technology
[4] Belda College,Dept. of Computer Science
[5] University of Calcutta,Dept. of Computer Science and Engineering
来源
Applied Intelligence | 2023年 / 53卷
关键词
Neural networks; Optimizer; Gradient descent; Adam; Difference of momentum;
D O I
暂无
中图分类号
学科分类号
摘要
Stochastic Gradient Decent (SGD) is a very popular basic optimizer applied in the learning algorithms of deep neural networks. However, it has fixed-sized steps for every epoch without considering gradient behaviour to determine step size. The improved SGD optimizers like AdaGrad, Adam, AdaDelta, RAdam, and RMSProp make step sizes adaptive in every epoch. However, these optimizers depend on square roots of exponential moving averages (EMA) of squared previous gradients or momentums or both and cannot take the benefit of local change in gradients or momentums or both. To reduce these limitations, a novel optimizer has been presented in this paper where the adjustment of step size is done for each parameter based on changing information between the 1st and the 2nd moment estimate (i.e., diffMoment). The experimental results depict that diffMoment offers better performance than AdaGrad, Adam, AdaDelta, RAdam, and RMSProp optimizers. It is also noticed that diffMoment does uniformly better for training Convolutional Neural Networks (CNN) applying different activation functions.
引用
收藏
页码:16844 / 16858
页数:14
相关论文
共 50 条
  • [1] DiffMoment: an adaptive optimization technique for convolutional neural network
    Bhakta, Shubhankar
    Nandi, Utpal
    Si, Tapas
    Ghosal, Sudipta Kr
    Changdar, Chiranjit
    Pal, Rajat Kumar
    APPLIED INTELLIGENCE, 2023, 53 (13) : 16844 - 16858
  • [2] Adaptive Protection Scheme for FREEDM Microgrid Based on Convolutional Neural Network and Gorilla Troops Optimization Technique
    Hatata, Ahmed Y.
    Essa, Mohamed A.
    Sedhom, Bishoy E.
    IEEE ACCESS, 2022, 10 : 55583 - 55601
  • [3] Adaptive Deep Learning with Optimization Hybrid Convolutional Neural Network and Recurrent Neural Network for Prediction Lemon Fruit Ripeness
    Watnakornbuncha, Darunee
    Am-Dee, Noppadol
    Sangsongfa, Adisak
    PRZEGLAD ELEKTROTECHNICZNY, 2024, 100 (03): : 202 - 211
  • [4] A Scalable and Adaptive Convolutional Neural Network Accelerator
    Pidanic, Jan
    Vyas, Arpan
    Karki, Rishav
    Vij, Prateek
    Trivedi, Gaurav
    Nemec, Zdenek
    2022 32ND INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2022, : 138 - 142
  • [5] Multiobjective visual evolutionary neural network and related convolutional neural network optimization
    Zhang, Zhuhong
    Li, Lun
    Lu, Jiaxuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 243
  • [6] A novel IoT network intrusion detection approach based on Adaptive Particle Swarm Optimization Convolutional Neural Network
    Kan, Xiu
    Fan, Yixuan
    Fang, Zhijun
    Cao, Le
    Xiong, Neal N.
    Yang, Dan
    Li, Xuan
    INFORMATION SCIENCES, 2021, 568 : 147 - 162
  • [7] An evolutionary framework for designing adaptive convolutional neural network
    Mishra, Vidyanand
    Kane, Lalit
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224
  • [8] The Adaptive Wideband Beamforming using Convolutional Neural Network
    Wu, Xun
    Zhang, Shurui
    Ma, Xiaofeng
    Guo, Shanhong
    Sheng, Weixing
    2022 INTERNATIONAL CONFERENCE ON MICROWAVE AND MILLIMETER WAVE TECHNOLOGY (ICMMT), 2022,
  • [9] Robust Adaptive Beamforming Based on a Convolutional Neural Network
    Liao, Zhipeng
    Duan, Keqing
    He, Jinjun
    Qiu, Zizhou
    Li, Binbin
    ELECTRONICS, 2023, 12 (12)
  • [10] Adaptive Modular Convolutional Neural Network for Image Recognition
    Wu, Wenbo
    Pan, Yun
    SENSORS, 2022, 22 (15)