Environment Based Threshold for Speaker Identification

被引：0

作者：

Kanrar, Soumen ^{[1
]}

机构：

[1] Vehere Interact Pvt Ltd, Kolkata 53, W Bengal, India

来源：

2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, SIGNALS, COMMUNICATION AND OPTIMIZATION (EESCO) | 2015年

关键词：

Speaker Identification; Gaussian mixture mode; acoustic feature vectors; decision threshold; false accept; false reject;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Speaker Identification process is to identify a particular vocal cord from a set of existing speakers. In the speaker identification processes, the unknown speaker voice sample targets each of the existing speakers in the system and gives a predication. The predication is more than one existing known speaker voice and is very close to the unknown speaker voice. It is a one to many mapping. The mapping function gives a set of predicated values associated with the order pair of speakers. In the order pair, the first coordinate is the unknown speaker and the second coordinates is the existing known speaker from the speaker recognition system. The set of predicated values helps to identify the unknown speaker. The identification process makes a comparison of the unknown speaker model with the models of the existing voice in the system. In this paper, the model is a Gaussian mixture model built by the extraction of the acoustic feature vectors. This paper presents the impact of the decision threshold based on false accepts and false reject for an unknown number of speaker conversion in the speaker identification result. In the simulation, the considered known speaker voices are collected through different channels. In the testing, the GMM voice models of the known speakers are distributed among the numbers of clusters in the test data set.

引用

页数：5

共 10 条

[1] Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications
Apsingekar, Vijendra Raj
De Leon, Phillip L.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04): : 848 - 853
[2] Score normalization for text-independent speaker verification systems
Auckenthaler, R
Carey, M
Lloyd-Thomas, H
[J]. DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 42 - 54
[3] A tutorial on text-independent speaker verification
Bimbot, F
Bonastre, JF
Fredouille, C
Gravier, G
Magrin-Chagnolleau, I
Meignier, S
Merlin, T
Ortega-García, J
Petrovska-Delacrétaz, D
Reynolds, DA
[J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) : 430 - 451
[4] Mirghafori N., 2002, P ICSLP
[5] Measuring the validity and reliability of forensic likelihood-ratio systems
Morrison, Geoffrey Stewart
[J]. SCIENCE & JUSTICE, 2011, 51 (03) : 91 - 98
[6] Reynolds D. A., 1995, Lincoln Laboratory Journal, V8, P173
[7] SPEAKER IDENTIFICATION AND VERIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
REYNOLDS, DA
[J]. SPEECH COMMUNICATION, 1995, 17 (1-2) : 91 - 108
[8] Reynolds DA, 2003, INT CONF ACOUST SPEE, P53
[9] Speaker verification using adapted Gaussian mixture models
Reynolds, DA
Quatieri, TF
Dunn, RB
[J]. DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 19 - 41
[10] Xiang B, 2002, INT CONF ACOUST SPEE, P681

← 1 →