Stochastic gradient descent analysis for the evaluation of a speaker recognition

被引:1
作者
Nasef, Ashrf [1 ]
Marjanovic-Jakovljevic, Marina [1 ]
Njegus, Angelina [2 ]
机构
[1] Singidunum Univ, Fac Elect Engn & Comp, Danijelova 32, Belgrade 11000, Serbia
[2] Singidunum Univ, Fac Informat & Comp, Belgrade 11000, Serbia
关键词
Pattern recognition; Speech analysis; Deep learning Neural Network; Stochastic gradient descent; Learning rate; Dropout rate; NEURAL-NETWORKS; SPEECH;
D O I
10.1007/s10470-016-0918-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Performance optimization in speaker recognition is a challenging task in the field of vocal based human-computer interaction. Many researches have shown that deep learning Neural Network methods have the best performance in comparison with other classifiers. However, those methods with many parameters require a lot of tunings in order to optimize the performance in different supervised learning tasks. In this paper, we show that picking a good combination of parameters can significantly improve the performance of Stochastic Gradient Descent deep learning Neural Network method in automatic speaker recognition even in a noisy environment. Parameters that are analyzed are learning rate, hidden and input layer dropout rate.
引用
收藏
页码:389 / 397
页数:9
相关论文
共 36 条
[1]  
[Anonymous], 2011, Speech and Audio Signal Processing: Processing and Perception of Speech and Music, DOI 10.1002/9781118142882
[2]  
[Anonymous], 1977, DISCRETE TIME SIGNAL
[3]  
[Anonymous], 2013, IEEE INT C AC SPEECH
[4]  
[Anonymous], 2011, ICML
[5]  
Anusuya MA., 2009, INT J COMPUTER SCI I, V6, P181, DOI DOI 10.1109/PROC.1976.10158
[6]  
Biswas A., 2015, IEEE 2 INT C REC TRE
[7]  
Boersma P., 2016, PRAAT SOFTWARE
[8]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[9]   Speaker recognition: A tutorial [J].
Campbell, JP .
PROCEEDINGS OF THE IEEE, 1997, 85 (09) :1437-1462
[10]  
CHEN C.-H., 1988, SIGNAL PROCESSING HD