Support vector machines using GMM supervectors for speaker verification

被引:703
作者
Campbell, WM [1 ]
Sturim, DE [1 ]
Reynolds, DA [1 ]
机构
[1] MIT, Lincoln Lab, Lexington, MA 02420 USA
关键词
Gaussian mixture models (GMMs); speaker recognition; support vector machines (SVMs);
D O I
10.1109/LSP.2006.870086
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Gaussian mixture models (GMMs) have proven extremely successful for text-independent speaker recognition. The standard training method for GMM models is to use MAP adaptation of the means of the mixture components based on speech from a target speaker. Recent methods in compensation for speaker and channel variability have proposed the idea of stacking the means of the GMM model to form a GMM mean supervector. We examine the idea of using the GMM supervector in a support vector machine (SVM) classifier. We propose two new SVM kernels based on distance metrics between GMM models. We show that these SVM kernels produce excellent classification accuracy in a NIST speaker recognition evaluation task.
引用
收藏
页码:308 / 311
页数:4
相关论文
共 15 条
[11]   Speaker verification using adapted Gaussian mixture models [J].
Reynolds, DA ;
Quatieri, TF ;
Dunn, RB .
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :19-41
[12]  
Solomonoff A, 2005, INT CONF ACOUST SPEE, P629
[13]  
SOLOMONOFF A, 2004, P OD SPEAK LANG REC, P57
[14]  
Sturim D., 2005, P INT C AC SPEECH SI
[15]   Speaker verification using sequence discriminant support vector machines [J].
Wan, V ;
Renals, S .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (02) :203-210