Reducing speaker model search space in speaker identification

被引：0

作者：

De Leon, Phillip L. ^{[1
]}

Apsingekar, Vijendra ^{[1
]}

机构：

[1] New Mexico State Univ, Klipsch Sch Elect & Comp Engn, Las Cruces, NM 88003 USA

来源：

2007 BIOMETRICS SYMPOSIUM | 2007年

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

For large population speaker identification (SID) systems, likelihood computations between an unknown speaker's test feature set and speaker models can be very time-consuming and detrimental to applications where fast SID is required. In this paper, we propose a method whereby speaker models are clustered during the training stage. Then during the testing stage, only those clusters which are likely to contain high-likelihood speaker models are searched. The proposed method reduces the speaker model space which directly results in faster SID. Although there maybe a slight loss in identification accuracy depending on the number of clusters searched, this loss can be controlled by trading off speed and accuracy.

引用

页码：90 / 95

页数：6

共 15 条

[1] Discriminative in-set/out-of-set speaker recognition [J].

Angkititrakul, Pongtep ;

Hansen, John H. L. .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02) :498-508

[2] Support vector machines using GMM supervectors for speaker verification [J].

Campbell, WM ;

Sturim, DE ;

Reynolds, DA .

IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (05) :308-311

[3]

CAMPBELL WM, 2007, P INT C AC SPEECH SI

[4]

Faloutsos C., 1994, Journal of Intelligent Information Systems: Integrating Artificial Intelligence and Database Technologies, V3, P231, DOI 10.1007/BF00962238

[5]

Gersho A., 1992, VECTOR QUANTIZATION

[6] Real-time speaker identification and verification [J].

Kinnunen, T ;

Karpov, E ;

Fränti, P .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01) :277-288

[7]

MASHAO DJ, 2004, IEEE AFRICON S SEP

[8]

MCLAUGHLIN J, 1999, P 6 EUR C SPEECH COM

[9] An efficient scoring algorithm for Gaussian mixture model based speaker identification [J].

Pellom, BL ;

Hansen, JHL .

IEEE SIGNAL PROCESSING LETTERS, 1998, 5 (11) :281-284

[10]

Quatieri T., 2002, DISCRETE TIME SPEECH

← 1 2 →