Combining cohort and UBM models in open set speaker detection

被引:5
作者
Brew, Anthony [1 ]
Cunningham, Pedraig [1 ]
机构
[1] Univ Coll Dublin, Machine Learning Grp, Sch Informat & Comp Sci, Dublin 2, Ireland
关键词
Speaker detection; Speaker verification; Gaussian Mixture Models; Support Vector Machines; UBM; Cohort;
D O I
10.1007/s11042-009-0381-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In speaker detection it is important to build an alternative model against which to compare scores from the 'target' speaker model. Two alternative strategies for building an alternative model are to build a single global model by sampling from a pool of training data, the Universal Background (UBM), or to build a cohort of models from selected individuals in the training data for the target speaker. The main contribution in this paper is to show that these approaches can be unified by using a Support Vector Machine (SVM) to learn a decision rule in the score space made up of the output scores of the client, cohort and UBM model.
引用
收藏
页码:141 / 159
页数:19
相关论文
共 28 条
[1]  
[Anonymous], 2 INT C SPOK LANG PR
[2]  
Ariyaeeinia A.M., 1997, EUROSPEECH
[3]   Score normalization for text-independent speaker verification systems [J].
Auckenthaler, R ;
Carey, M ;
Lloyd-Thomas, H .
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :42-54
[4]  
Bengio S, 2001, INT CONF ACOUST SPEE, P425, DOI 10.1109/ICASSP.2001.940858
[5]   A tutorial on text-independent speaker verification [J].
Bimbot, F ;
Bonastre, JF ;
Fredouille, C ;
Gravier, G ;
Magrin-Chagnolleau, I ;
Meignier, S ;
Merlin, T ;
Ortega-García, J ;
Petrovska-Delacrétaz, D ;
Reynolds, DA .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) :430-451
[6]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[7]   Corpora for the evaluation of speaker recognition systems [J].
Campbell, JP ;
Reynolds, DA .
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, :829-832
[8]  
CAMPBELL W, 2004, OD SPEAK LANG REC WO
[9]  
CHARLET D, 2008, INTERSPEECH
[10]  
Higgins A., 1991, Digital Signal Processing, V1, P89, DOI 10.1016/1051-2004(91)90098-6