ATTENTIONDROP FOR CONVOLUTIONAL NEURAL NETWORKS

被引：6

作者：

Ouyang, Zhihao ^{[1
]}

Feng, Yan ^{[1
,2
]}

He, Zihao ^{[1
]}

Hao, Tianbo ^{[1
]}

Dai, Tao ^{[1
,2
]}

Xia, Shu-Tao ^{[1
,2
]}

机构：

[1] Tsinghua Univ, Grad Sch Shenzhen, Shenzhen, Guangdong, Peoples R China

[2] Peng Cheng Lab, PCL Res Ctr Networks & Commun, Shenzhen, Guangdong, Peoples R China

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2019年

基金：

中国国家自然科学基金;

关键词：

Dropout; Attention; Adaptive; Regularization; Convolutional Neural Networks;

D O I：

10.1109/ICME.2019.00233

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

but becomes less effective for convolutional neural networks (CNNs), since the spatially correlated features still allow dropped information to flow through the network. To make dropout more practical for CNNs, structured dropout methods have been recently proposed by dropping regions with fixed shapes and random positions, which nonetheless may lead to unexpected discarding of information. To address this problem, in this paper, we propose a novel dropout variant based on attention information named AttentionDrop that drops features adaptively. Specifically, it precisely localizes masks that have irregular shapes according to the values of activation units. In addition, the use of soft values in adaptive masks lowers the risk of a complete loss of indispensable information. Experimental results demonstrate the effectiveness of our AttentionDrop on public datasets for image classification. Code is available at https://github.com/Kira0096/smart-drop/.

引用

页码：1342 / 1347

页数：6

共 17 条

[1] Bengio Y., 2013, INT C NEURAL INFORM, P1319
[2] Devries Terrance, 2017, arXiv
[3] Gastaldi Xavier, 2017, ARXIV170507485
[4] Ghiasi G, 2018, ADV NEUR IN, V31
[5] He K., 2016, IEEE C COMPUT VIS PA, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38, DOI 10.1109/CVPR.2016.90]
[6] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[7] Hinton GE., 2012, ARXIV
[8] Deep Networks with Stochastic Depth
Huang, Gao
Sun, Yu
Liu, Zhuang
Sedra, Daniel
Weinberger, Kilian Q.
[J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 646 - 661
[9] Krizhevsky A., 2009, Tech. Rep. TR-2009, P1
[10] ImageNet Classification with Deep Convolutional Neural Networks
Krizhevsky, Alex
Sutskever, Ilya
Hinton, Geoffrey E.
[J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90

← 1 2 →