Speaker Model Clustering to Construct Background Models for Speaker Verification

被引:0
作者
Disken, Gokay [1 ]
Tufekci, Zekeriya [2 ]
Cevik, Ulus [3 ]
机构
[1] Adana Sci & Technol Univ, Dept Elect Elect Engn, Adana, Turkey
[2] Cukurova Univ, Dept Comp Engn, Adana, Turkey
[3] Cukurova Univ, Dept Elect Elect Engn, Adana, Turkey
关键词
Gaussian mixture models; k-means; imposter models; speaker clustering; speaker verification; IDENTIFICATION; RECOGNITION; SELECTION; UBM;
D O I
10.1515/aoa-2017-0014
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Conventional speaker recognition systems use the Universal Background Model (UBM) as an imposter for all speakers. In this paper, speaker models are clustered to obtain better imposter model representations for speaker verification purpose. First, a UBM is trained, and speaker models are adapted from the UBM. Then, the k-means algorithm with the Euclidean distance measure is applied to the speaker models. The speakers are divided into two, three, four, and five clusters. The resulting cluster centers are used as background models of their respective speakers. Experiments showed that the proposed method consistently produced lower Equal Error Rates (EER) than the conventional UBM approach for 3, 10, and 30 seconds long test utterances, and also for channel mismatch conditions. The proposed method is also compared with the i-vector approach. The three-cluster model achieved the best performance with a 12.4% relative EER reduction in average, compared to the i-vector method. Statistical significance of the results are also given.
引用
收藏
页码:127 / 135
页数:9
相关论文
共 30 条
  • [1] [Anonymous], EUROSPEECH 1999
  • [2] Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications
    Apsingekar, Vijendra Raj
    De Leon, Phillip L.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04): : 848 - 853
  • [3] Auckenthaler R., 2001, PROC SPEAKER ODYSSEY, P83
  • [4] A tutorial on text-independent speaker verification
    Bimbot, F
    Bonastre, JF
    Fredouille, C
    Gravier, G
    Magrin-Chagnolleau, I
    Meignier, S
    Merlin, T
    Ortega-García, J
    Petrovska-Delacrétaz, D
    Reynolds, DA
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) : 430 - 451
  • [5] Combining cohort and UBM models in open set speaker detection
    Brew, Anthony
    Cunningham, Pedraig
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2010, 48 (01) : 141 - 159
  • [6] Combining Cohort and UBM Models in Open Set Speaker Identification
    Brew, Anthony
    Cunningham, Padraig
    [J]. CBMI: 2009 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2009, : 62 - 67
  • [7] Speaker recognition: A tutorial
    Campbell, JP
    [J]. PROCEEDINGS OF THE IEEE, 1997, 85 (09) : 1437 - 1462
  • [8] Campbell WM, 2006, INT CONF ACOUST SPEE, P97
  • [9] De Leon P.L., 2007, P BIOMETRICS S, P1, DOI [10.1109/BCC.2007.4430544, DOI 10.1109/BCC.2007.4430544]
  • [10] Front-End Factor Analysis for Speaker Verification
    Dehak, Najim
    Kenny, Patrick J.
    Dehak, Reda
    Dumouchel, Pierre
    Ouellet, Pierre
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 788 - 798