Mutual information-based dropout: Learning deep relevant feature representation architectures

被引:9
|
作者
Chen, Jie [1 ,2 ]
Wu, ZhongCheng [1 ,3 ]
Zhang, Jun [1 ]
Li, Fang [1 ]
机构
[1] Chinese Acad Sci, Hefei Inst Phys Sci, Hefei, Anhui, Peoples R China
[2] Univ Sci & Technol China, Grad Sch Comp Appl Technol, Hefei, Anhui, Peoples R China
[3] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
关键词
Mutual information; Feature representation; Dropout; Overfitting; Autoencoder; Convolutional neural networks; NEURAL-NETWORKS; AUTOENCODERS; SELECTION;
D O I
10.1016/j.neucom.2019.04.090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new regularization strategy called DropMI, which is a generalization of Dropout for the regularization of networks that introduces mutual information (MI) dynamic analysis. The standard Dropout randomly drops a certain proportion of neural units, according to the Bernoulli distribution, thereby resulting in the loss of some important hidden feature information. In DropMI, we first evaluate the importance of each neural unit in the feature representation of the hidden layer based on the MI between it and the target. We then construct a new binary mask matrix based on the sorting distribution of MI, thus developing a dynamic DropMI strategy that highlights the important neural units that are beneficial to the feature representation. The results from the MNIST, NORB, CIFAR-10, CIFAR-100, SVHN, and Multi-PIE datasets indicate that, relative to other state-of-the-art regularization methods based on the benchmark autoencoder and convolutional neural networks, our method has better feature representation performance and effectively reduces the overfitting of the model. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:173 / 184
页数:12
相关论文
共 50 条
  • [21] Mutual information-based feature selection for intrusion detection systems
    Amiri, Fatemeh
    Yousefi, MohammadMahdi Rezaei
    Lucas, Caro
    Shakery, Azadeh
    Yazdani, Nasser
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (04) : 1184 - 1199
  • [22] Heterogeneous feature subset selection using mutual information-based feature transformation
    Wei, Min
    Chow, Tommy W. S.
    Chan, Rosa H. M.
    NEUROCOMPUTING, 2015, 168 : 706 - 718
  • [23] Modified Mutual Information-based Feature Selection for Intrusion Detection Systems in Decision Tree Learning
    Song, Jingping
    Zhu, Zhiliang
    Scully, Peter
    Price, Chris
    JOURNAL OF COMPUTERS, 2014, 9 (07) : 1542 - 1546
  • [24] MIFS-ND: A mutual information-based feature selection method
    Hoque, N.
    Bhattacharyya, D. K.
    Kalita, J. K.
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (14) : 6371 - 6385
  • [25] SHADE: INFORMATION-BASED REGULARIZATION FOR DEEP LEARNING
    Blot, Michael
    Robert, Thomas
    Thome, Nicolas
    Cord, Matthieu
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 813 - 817
  • [26] Conditional Mutual Information-Based Feature Selection Analyzing for Synergy and Redundancy
    Cheng, Hongrong
    Qin, Zhiguang
    Feng, Chaosheng
    Wang, Yong
    Li, Fagen
    ETRI JOURNAL, 2011, 33 (02) : 210 - 218
  • [27] Multivariate Mutual Information-based Feature Selection for Cyber Intrusion Detection
    Mohammadi, Sara
    Desai, Vraj
    Karimipour, Hadis
    2018 IEEE ELECTRICAL POWER AND ENERGY CONFERENCE (EPEC), 2018,
  • [28] Mutual information-based feature extraction on the time-frequency plane
    Grall-Maës, E
    Beauseroy, P
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (04) : 779 - 790
  • [29] Modified Pointwise Mutual Information-Based Feature Selection for Text Classification
    Georgieva-Trifonova, Tsvetanka
    PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2021, VOL 2, 2022, 359 : 333 - 353
  • [30] A novel mutual information-based feature selection approach forefficient transfer learning in aerial scene classification
    Devi, Nilakshi
    Borah, Bhogeswar
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (15-16) : 5961 - 5975