Speech Features Evaluation for Small Set Automatic Speaker Verification Using GMM-UBM System

被引：0

作者：

Rakhmanenko, Ivan ^{[1
]}

Meshcheryakov, Roman ^{[1
]}

机构：

[1] Tomsk State Univ, Control Syst & Radioelect, Tomsk, Russia

来源：

SPEECH AND COMPUTER | 2016年 / 9811卷

关键词：

Speaker recognition; Speaker verification; Gaussian mixture model; GMM-UBM system; Mel frequency cepstral coefficients; Speech features; Small speaker set; Speech processing; RECOGNITION;

D O I：

10.1007/978-3-319-43958-7_78

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper overviews the application sphere of speaker verification systems and illustrates the use of the Gaussian mixture model and the universal background model (GMM-UBM) in an automatic text-independent speaker verification task. The experimental evaluation of the GMM-UBM system using different speech features is conducted on a 50 speaker set and a result is presented. Equal error rate (EER) using 256 component Gaussian mixture model and feature vector containing 14 mel frequency cepstral coefficients (MFCC) and the voicing probability is 0,76 %. Comparing to standard 14 MFCC vector 23,7 % of EER improvement was acquired.

引用

页码：645 / 650

页数：6

共 10 条

[1]

[Anonymous], 2013, Proceedings of the 21st ACM International Conference on Multimedia, DOI DOI 10.1145/2502081.2502224

[2]

[Anonymous], 2012, INFORM PROCESSES

[3] AUTOMATIC RECOGNITION OF SPEAKERS FROM THEIR VOICES [J].

ATAL, BS .

PROCEEDINGS OF THE IEEE, 1976, 64 (04) :460-475

[4] Speaker recognition: A tutorial [J].

Campbell, JP .

PROCEEDINGS OF THE IEEE, 1997, 85 (09) :1437-1462

[5] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].

DAVIS, SB ;

MERMELSTEIN, P .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366

[6]

Jurafsky D., 2009, SPEECH LANGUAGE PROC

[7]

Reynolds D.A., 2008, ENCY BIOMETRIC RECOG

[8] ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS [J].

REYNOLDS, DA ;

ROSE, RC .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01) :72-83

[9] Speaker verification using adapted Gaussian mixture models [J].

Reynolds, DA ;

Quatieri, TF ;

Dunn, RB .

DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :19-41

[10]

Sadjadi S. O., 2013, SPEECH LANGUAGE PROC

← 1 →