Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications

被引：36

作者：

Apsingekar, Vijendra Raj ^{[1
]}

De Leon, Phillip L. ^{[1
]}

机构：

[1] New Mexico State Univ, Klipsch Sch Elect & Comp Engn, Las Cruces, NM 88003 USA

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2009年 / 17卷 / 04期

关键词：

Clustering methods; speaker recognition; VERIFICATION;

D O I：

10.1109/TASL.2008.2010882

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In large population speaker identification (SI) systems, likelihood computations between an unknown speaker's feature vectors and the registered speaker models can be very time-consuming and impose a bottleneck. For applications requiring fast SI, this is a recognized problem and improvements in efficiency would be beneficial. In this paper, we propose a method whereby GMM-based speaker models are clustered using a simple k-means algorithm. Then, during the test stage, only a small proportion of speaker models in selected clusters are used in the likelihood computations resulting in a significant speed-up with little to no loss in accuracy. In general, as the number of selected clusters is reduced, the identification accuracy decreases; however, this loss can be controlled through proper tradeoff. The proposed method may also be combined with other test stage speed-up techniques resulting in even greater speed-up gains without additional sacrifices in accuracy.

引用

页码：848 / 853

页数：6

共 23 条

[1] Discriminative in-set/out-of-set speaker recognition [J].

Angkititrakul, Pongtep ;

Hansen, John H. L. .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02) :498-508

[2]

Aronowitz H., 2004, P ICSLP, P609

[3] Efficient speaker recognition using approximated cross entropy (ACE) [J].

Aronowitz, Hagai ;

Burshtein, David .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07) :2033-2043

[4]

BEIGI HSM, 1999, P 6 EUR C SPEECH COM, P2203

[5]

BEN M, 2002, P IEEE INT C AC SPEE

[6]

Campbell WM, 2006, INT CONF ACOUST SPEE, P97

[7]

CAMPBELL WM, 2007, P ICASSP 2007, V4, P217

[8]

DELEON PL, 2007, P IEEE BIOM S

[9]

Faloutsos C., 1994, Journal of Intelligent Information Systems: Integrating Artificial Intelligence and Database Technologies, V3, P231, DOI 10.1007/BF00962238

[10]

GOLDBERGER J, 2005, P INT, P1982

← 1 2 3 →