AUTOMATIC NODE SELECTION FOR DEEP NEURAL NETWORKS USING GROUP LASSO REGULARIZATION

被引:0
作者
Ochiai, Tsubasa [1 ]
Matsuda, Shigeki [1 ]
Watanabe, Hideyuki [2 ]
Katagiri, Shigeru [1 ]
机构
[1] Doshisha Univ, Grad Sch Sci & Engn, Kyoto, Japan
[2] Adv Telecommun Res Inst Int, Kyoto, Japan
来源
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年
关键词
Deep neural networks; Group Lasso regularization; Speech recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We examine the effect of the Group Lasso (gLasso) regularizer in selecting the salient nodes of Deep Neural Network (DNN) hidden layers by applying a DNN-HMM hybrid speech recognizer to TED Talks speech data. We test two types of gLasso regularization, one for outgoing weight vectors and another for incoming weight vectors, as well as two sizes of DNNs: 2048 hidden layer nodes and 4096 nodes. Furthermore, we compare gLasso and L2 regularizers. Our experiment results demonstrate that our DNN training, in which the gLasso regularizer was embedded, successfully selected the hidden layer nodes that are necessary and sufficient for achieving high classification power.
引用
收藏
页码:5485 / 5489
页数:5
相关论文
共 14 条
[1]  
[Anonymous], 2013, INTERSPEECH
[2]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[3]  
Hinton G., 2012, SIGNAL PROCESSING MA, V29, P8297
[4]   A fast learning algorithm for deep belief nets [J].
Hinton, Geoffrey E. ;
Osindero, Simon ;
Teh, Yee-Whye .
NEURAL COMPUTATION, 2006, 18 (07) :1527-1554
[5]  
Jian Xue, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P6359, DOI 10.1109/ICASSP.2014.6854828
[6]  
Murray K., 2015, P EMNLP
[7]  
Ochiai Tsubasa, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P6349, DOI 10.1109/ICASSP.2014.6854826
[8]  
Ochiai T, 2016, INT CONF ACOUST SPEE, P5015, DOI 10.1109/ICASSP.2016.7472632
[9]  
Ochiai T, 2015, INT CONF ACOUST SPEE, P4605, DOI 10.1109/ICASSP.2015.7178843
[10]   Boosted MMI for model and feature-space discriminative training [J].
Povey, Daniel ;
Kanevsky, Dimitri ;
Kingsbury, Brian ;
Ramabhodran, Bhuvana ;
Saon, George ;
Visweswariah, Karthik .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4057-4060