Regularizing Deep Neural Networks by Enhancing Diversity in Feature Extraction

被引:59
作者
Ayinde, Babajide O. [1 ]
Inanc, Tamer [1 ]
Zurada, Jacek M. [1 ,2 ]
机构
[1] Univ Louisville, Dept Elect & Comp Engn, Louisville, KY 40292 USA
[2] Univ Social Sci, Informat Technol Inst, PL-90113 Lodz, Poland
基金
美国国家科学基金会;
关键词
Cosine similarity; deep learning; feature clustering; feature correlation; redundancy elimination; regularization;
D O I
10.1109/TNNLS.2018.2885972
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new and efficient technique to regularize the neural network in the context of deep learning using correlations among features. Previous studies have shown that oversized deep neural network models tend to produce a lot of redundant features that are either the shifted version of one another or are very similar and show little or no variations, thus resulting in redundant filtering. We propose a way to address this problem and show that such redundancy can be avoided using regularization and adaptive feature dropout mechanism. We show that regularizing both negative and positive correlated features according to their differentiation and based on their relative cosine distances yields network extracting dissimilar features with less overfitting and better generalization. This concept is illustrated with deep multilayer perceptron, convolutional neural network, sparse autoencoder, gated recurrent unit, and long short-term memory on MNIST digits recognition, CIFAR-10, ImageNet, and Stanford Natural Language Inference data sets.
引用
收藏
页码:2650 / 2661
页数:12
相关论文
共 47 条
[1]  
Abadi M., 2015, TENSORFLOW LARGESCAL
[2]  
[Anonymous], NEURAL NETWORKS REGU
[3]  
[Anonymous], REGULARIZING CNNS LO
[4]  
[Anonymous], 2015, ARXIV PREPRINT ARXIV
[5]  
[Anonymous], NEURAL SEMANTIC ENCO
[6]  
[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
[7]  
[Anonymous], 2015, ALL YOU NEED IS GOOD
[8]  
[Anonymous], TECH REP
[9]  
[Anonymous], 2009, Advances in Neural Information Processing Systems
[10]  
[Anonymous], CONVOLUTIONAL CLUSTE