Revisiting spatial dropout for regularizing convolutional neural networks

被引：36

作者：

Lee, Sanghun ^{[1
]}

Lee, Chulhee ^{[1
]}

机构：

[1] Yonsei Univ, Dept Elect & Elect Engn, 134 Shinchon Dong, Seoul, South Korea

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2020年 / 79卷 / 45-46期

基金：

新加坡国家研究基金会;

关键词：

Network regularization; Convolutional neural network; Spatial dropout; Deep learning;

D O I：

10.1007/s11042-020-09054-7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Overfitting is one of the most challenging problems in deep neural networks with a large number of trainable parameters. To prevent networks from overfitting, the dropout method, which is a strong regularization technique, has been widely used in fully-connected neural networks. In several state-of-the-art convolutional neural network architectures for object classification, however, dropout was partially or not even applied since its accuracy gain was relatively insignificant in most cases. Also, the batch normalization technique reduced the need for the dropout method because of its regularization effect. In this paper, we show that conventional element-wise dropout can be ineffective for convolutional layers. We found that dropout between channels in the CNNs can be functionally similar to dropout in the FCNNs, and spatial dropout can be an effective way to take advantage of the dropout technique for regularizing. To prove our points, we conducted several experiments using the CIFAR-10 and CIFAR-100 databases. For comparison, we only replaced the dropout layers with spatial dropout layers and kept all other hyperparameters and methods intact. DenseNet-BC with spatial dropout showed promising results (3.32% error rates with CIFAR-10, 3.0 M parameters) compared to other existing competitive methods.

引用

页码：34195 / 34207

页数：13

共 27 条

[1]

[Anonymous], 2019, Advances in Neural Information Processing Systems

[2]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[3]

Fortunato M., 2017, arXiv preprint arXiv: 1704.02798

[4]

Gal Y, 2016, ADV NEURAL INFORM PR, P1019, DOI DOI 10.48550/ARXIV.1512.05287

[5]

Ghiasi G, 2018, ADV NEUR IN, V31

[6]

Gross S., 2016, Facebook AI Res.

[7]

He K, 2016, PROC CVPR IEEE, P770, DOI [10.1109/CVPR.2016.90, DOI 10.1109/CVPR.2016.90]

[8]

Hinton G., 2009, Handbook of Systemic Autoimmune Diseases

[9]

Hinton G. E., 2012, Improving neural networks by preventing co-adaptation of feature detectors

[10]

Huang G., 2017, P IEEE C COMP VIS PA, P4700, DOI [10.1109/CVPR.2017.243, DOI 10.1109/CVPR.2017.243]

← 1 2 3 →