Support vector machines using GMM supervectors for speaker verification

被引：703

作者：

Campbell, WM ^{[1
]}

Sturim, DE ^{[1
]}

Reynolds, DA ^{[1
]}

机构：

[1] MIT, Lincoln Lab, Lexington, MA 02420 USA

来源：

IEEE SIGNAL PROCESSING LETTERS | 2006年 / 13卷 / 05期

关键词：

Gaussian mixture models (GMMs); speaker recognition; support vector machines (SVMs);

D O I：

10.1109/LSP.2006.870086

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Gaussian mixture models (GMMs) have proven extremely successful for text-independent speaker recognition. The standard training method for GMM models is to use MAP adaptation of the means of the mixture components based on speech from a target speaker. Recent methods in compensation for speaker and channel variability have proposed the idea of stacking the means of the GMM model to form a GMM mean supervector. We examine the idea of using the GMM supervector in a support vector machine (SVM) classifier. We propose two new SVM kernels based on distance metrics between GMM models. We show that these SVM kernels produce excellent classification accuracy in a NIST speaker recognition evaluation task.

引用

页码：308 / 311

页数：4

共 15 条

[11] Speaker verification using adapted Gaussian mixture models [J].

Reynolds, DA ;

Quatieri, TF ;

Dunn, RB .

DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :19-41

[12]

Solomonoff A, 2005, INT CONF ACOUST SPEE, P629

[13]

SOLOMONOFF A, 2004, P OD SPEAK LANG REC, P57

[14]

Sturim D., 2005, P INT C AC SPEECH SI

[15] Speaker verification using sequence discriminant support vector machines [J].

Wan, V ;

Renals, S .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (02) :203-210

← 1 2 →