Speaker Model Clustering to Construct Background Models for Speaker Verification

被引：0

作者：

Disken, Gokay ^{[1
]}

Tufekci, Zekeriya ^{[2
]}

Cevik, Ulus ^{[3
]}

机构：

[1] Adana Sci & Technol Univ, Dept Elect Elect Engn, Adana, Turkey

[2] Cukurova Univ, Dept Comp Engn, Adana, Turkey

[3] Cukurova Univ, Dept Elect Elect Engn, Adana, Turkey

来源：

ARCHIVES OF ACOUSTICS | 2017年 / 42卷 / 01期

关键词：

Gaussian mixture models; k-means; imposter models; speaker clustering; speaker verification; IDENTIFICATION; RECOGNITION; SELECTION; UBM;

D O I：

10.1515/aoa-2017-0014

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Conventional speaker recognition systems use the Universal Background Model (UBM) as an imposter for all speakers. In this paper, speaker models are clustered to obtain better imposter model representations for speaker verification purpose. First, a UBM is trained, and speaker models are adapted from the UBM. Then, the k-means algorithm with the Euclidean distance measure is applied to the speaker models. The speakers are divided into two, three, four, and five clusters. The resulting cluster centers are used as background models of their respective speakers. Experiments showed that the proposed method consistently produced lower Equal Error Rates (EER) than the conventional UBM approach for 3, 10, and 30 seconds long test utterances, and also for channel mismatch conditions. The proposed method is also compared with the i-vector approach. The three-cluster model achieved the best performance with a 12.4% relative EER reduction in average, compared to the i-vector method. Statistical significance of the results are also given.

引用

页码：127 / 135

页数：9

共 30 条

[1] [Anonymous], EUROSPEECH 1999
[2] Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications
Apsingekar, Vijendra Raj
De Leon, Phillip L.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04): : 848 - 853
[3] Auckenthaler R., 2001, PROC SPEAKER ODYSSEY, P83
[4] A tutorial on text-independent speaker verification
Bimbot, F
Bonastre, JF
Fredouille, C
Gravier, G
Magrin-Chagnolleau, I
Meignier, S
Merlin, T
Ortega-García, J
Petrovska-Delacrétaz, D
Reynolds, DA
[J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) : 430 - 451
[5] Combining cohort and UBM models in open set speaker detection
Brew, Anthony
Cunningham, Pedraig
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2010, 48 (01) : 141 - 159
[6] Combining Cohort and UBM Models in Open Set Speaker Identification
Brew, Anthony
Cunningham, Padraig
[J]. CBMI: 2009 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2009, : 62 - 67
[7] Speaker recognition: A tutorial
Campbell, JP
[J]. PROCEEDINGS OF THE IEEE, 1997, 85 (09) : 1437 - 1462
[8] Campbell WM, 2006, INT CONF ACOUST SPEE, P97
[9] De Leon P.L., 2007, P BIOMETRICS S, P1, DOI [10.1109/BCC.2007.4430544, DOI 10.1109/BCC.2007.4430544]
[10] Front-End Factor Analysis for Speaker Verification
Dehak, Najim
Kenny, Patrick J.
Dehak, Reda
Dumouchel, Pierre
Ouellet, Pierre
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 788 - 798

← 1 2 3 →