Speech Features Evaluation for Small Set Automatic Speaker Verification Using GMM-UBM System

被引:0
作者
Rakhmanenko, Ivan [1 ]
Meshcheryakov, Roman [1 ]
机构
[1] Tomsk State Univ, Control Syst & Radioelect, Tomsk, Russia
来源
SPEECH AND COMPUTER | 2016年 / 9811卷
关键词
Speaker recognition; Speaker verification; Gaussian mixture model; GMM-UBM system; Mel frequency cepstral coefficients; Speech features; Small speaker set; Speech processing; RECOGNITION;
D O I
10.1007/978-3-319-43958-7_78
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper overviews the application sphere of speaker verification systems and illustrates the use of the Gaussian mixture model and the universal background model (GMM-UBM) in an automatic text-independent speaker verification task. The experimental evaluation of the GMM-UBM system using different speech features is conducted on a 50 speaker set and a result is presented. Equal error rate (EER) using 256 component Gaussian mixture model and feature vector containing 14 mel frequency cepstral coefficients (MFCC) and the voicing probability is 0,76 %. Comparing to standard 14 MFCC vector 23,7 % of EER improvement was acquired.
引用
收藏
页码:645 / 650
页数:6
相关论文
共 10 条
[1]  
[Anonymous], 2013, Proceedings of the 21st ACM International Conference on Multimedia, DOI DOI 10.1145/2502081.2502224
[2]  
[Anonymous], 2012, INFORM PROCESSES
[3]   AUTOMATIC RECOGNITION OF SPEAKERS FROM THEIR VOICES [J].
ATAL, BS .
PROCEEDINGS OF THE IEEE, 1976, 64 (04) :460-475
[4]   Speaker recognition: A tutorial [J].
Campbell, JP .
PROCEEDINGS OF THE IEEE, 1997, 85 (09) :1437-1462
[5]   COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].
DAVIS, SB ;
MERMELSTEIN, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366
[6]  
Jurafsky D., 2009, SPEECH LANGUAGE PROC
[7]  
Reynolds D.A., 2008, ENCY BIOMETRIC RECOG
[8]   ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS [J].
REYNOLDS, DA ;
ROSE, RC .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01) :72-83
[9]   Speaker verification using adapted Gaussian mixture models [J].
Reynolds, DA ;
Quatieri, TF ;
Dunn, RB .
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :19-41
[10]  
Sadjadi S. O., 2013, SPEECH LANGUAGE PROC