ATTENTIONDROP FOR CONVOLUTIONAL NEURAL NETWORKS

被引:6
作者
Ouyang, Zhihao [1 ]
Feng, Yan [1 ,2 ]
He, Zihao [1 ]
Hao, Tianbo [1 ]
Dai, Tao [1 ,2 ]
Xia, Shu-Tao [1 ,2 ]
机构
[1] Tsinghua Univ, Grad Sch Shenzhen, Shenzhen, Guangdong, Peoples R China
[2] Peng Cheng Lab, PCL Res Ctr Networks & Commun, Shenzhen, Guangdong, Peoples R China
来源
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2019年
基金
中国国家自然科学基金;
关键词
Dropout; Attention; Adaptive; Regularization; Convolutional Neural Networks;
D O I
10.1109/ICME.2019.00233
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
but becomes less effective for convolutional neural networks (CNNs), since the spatially correlated features still allow dropped information to flow through the network. To make dropout more practical for CNNs, structured dropout methods have been recently proposed by dropping regions with fixed shapes and random positions, which nonetheless may lead to unexpected discarding of information. To address this problem, in this paper, we propose a novel dropout variant based on attention information named AttentionDrop that drops features adaptively. Specifically, it precisely localizes masks that have irregular shapes according to the values of activation units. In addition, the use of soft values in adaptive masks lowers the risk of a complete loss of indispensable information. Experimental results demonstrate the effectiveness of our AttentionDrop on public datasets for image classification. Code is available at https://github.com/Kira0096/smart-drop/.
引用
收藏
页码:1342 / 1347
页数:6
相关论文
共 17 条
  • [1] Bengio Y., 2013, INT C NEURAL INFORM, P1319
  • [2] Devries Terrance, 2017, arXiv
  • [3] Gastaldi Xavier, 2017, ARXIV170507485
  • [4] Ghiasi G, 2018, ADV NEUR IN, V31
  • [5] He K., 2016, IEEE C COMPUT VIS PA, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38, DOI 10.1109/CVPR.2016.90]
  • [6] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [7] Hinton GE., 2012, ARXIV
  • [8] Deep Networks with Stochastic Depth
    Huang, Gao
    Sun, Yu
    Liu, Zhuang
    Sedra, Daniel
    Weinberger, Kilian Q.
    [J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 646 - 661
  • [9] Krizhevsky A., 2009, Tech. Rep. TR-2009, P1
  • [10] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90