Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks

被引:135
|
作者
Goncalves Dos Santos, Claudio Filipi [1 ,2 ]
Papa, Joao Paulo [3 ]
机构
[1] Fed Inst Sao Carlos UFSCar, Rod Washington Luiz 235, Sao Carlos, SP, Brazil
[2] Eldorados Inst Technol, Av Alan Turing 275, Campinas, SP, Brazil
[3] Sao Paulo State Univ, UNESP, Av Eng Luis Edmundo Carrijo Coube 14-01, Bauru, SP, Brazil
关键词
Regularization; convolutional neural networks;
D O I
10.1145/3510413
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Several image processing tasks, such as image classification and object detection, have been significantly improved using Convolutional Neural Networks (CNN). Like ResNet and EfficientNet, many architectures have achieved outstanding results in at least one dataset by the time of their creation. A critical factor in training concerns the network's regularization, which prevents the structure from overfitting. This work analyzes several regularization methods developed in the past few years, showing significant improvements for different CNN models. The works are classified into three main areas: the first one is called "data augmentation," where all the techniques focus on performing changes in the input data. The second, named "internal changes," aims to describe procedures to modify the feature maps generated by the neural network or the kernels. The last one, called "label," concerns transforming the labels of a given input. This work presents two main differences comparing to other available surveys about regularization: (i) the first concerns the papers gathered in the manuscript, which are not older than five years, and (ii) the second distinction is about reproducibility, i.e., all works referred here have their code available in public repositories or they have been directly implemented in some framework, such as TensorFlow or Torch.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Automatically Avoiding Overfitting in Deep Neural Networks by Using Hyper-Parameters Optimization Methods
    Kadhim, Zahraa Saddi
    Abdullah, Hasanen S.
    Ghathwan, Khalil I.
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (05) : 146 - 162
  • [2] Comparison of Regularization Methods for ImageNet Classification with Deep Convolutional Neural Networks
    Smirnov, Evgeny A.
    Timoshenko, Denis M.
    Andrianov, Serge N.
    2ND AASRI CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND BIOINFORMATICS, 2014, 6 : 89 - 94
  • [3] Convolutional Neural Networks With Dynamic Regularization
    Wang, Yi
    Bian, Zhen-Peng
    Hou, Junhui
    Chau, Lap-Pui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) : 2299 - 2304
  • [4] Cropout: A General Mechanism for Reducing Overfitting on Convolutional Neural Networks
    Hou, Wenbo
    Wang, Wenhai
    Liu, Ruo-Ze
    Lu, Tong
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [5] The transition module: a method for preventing overfitting in convolutional neural networks
    Akbar, S.
    Peikari, M.
    Salama, S.
    Nofech-Mozes, S.
    Martel, A. L.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2019, 7 (03): : 260 - 265
  • [6] Regularization of Deep Neural Networks for EEG Seizure Detection to Mitigate Overfitting
    Saqib, Mohammed
    Zhu, Yuanda
    Wang, May
    Beaulieu-Jones, Brett
    2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 664 - 673
  • [7] Benign Overfitting in Two-layer Convolutional Neural Networks
    Cao, Yuan
    Chen, Zixiang
    Belkin, Mikhail
    Gu, Quanquan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [8] HOW CONVOLUTIONAL NEURAL NETWORKS SEE THE WORLD - A SURVEY OF CONVOLUTIONAL NEURAL NETWORK VISUALIZATION METHODS
    Qin, Zhuwei
    Yu, Fuxun
    Liu, Chenchen
    Chen, Xiang
    MATHEMATICAL FOUNDATIONS OF COMPUTING, 2018, 1 (02): : 149 - 180
  • [9] Multiscale Conditional Regularization for Convolutional Neural Networks
    Lu, Yao
    Lu, Guangming
    Li, Jinxing
    Xu, Yuanrong
    Zhang, Zheng
    Zhang, David
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (01) : 444 - 458
  • [10] LMix: regularization strategy for convolutional neural networks
    Yan, Linyu
    Zheng, Kunpeng
    Xia, Jinyao
    Li, Ke
    Ling, Hefei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1245 - 1253