A modified speaker clustering method for efficient speaker identification

被引:0
作者
Yan, JiaChang [1 ]
Wang, Lei [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing, Peoples R China
来源
2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2 | 2014年
关键词
speaker identification; GMM-UBM; speaker clustering; k-means initialization; VERIFICATION; MODELS;
D O I
10.1109/ISCID.2014.125
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In speaker identification system, along with the growth of the population size, scoring process can be extremely time consuming. In such a case, speaker clustering is generally used to alleviate the situation. K-means is the widely used clustering algorithms, however, its performance suffers from the so-called local optimum problem. To deal with the problem, a novel initialization approach was introduced in this paper, which performs the initialization to the intrinsic spreading patterns of speaker models. In essence, the proposal is of the same spirit to the well-known Canopy mechanism. However, it differs from the Canopy in the aspects of candidate selection and thresholds setting. It is showed, to the application purpose, the proposed approach could work effectively and generates more rational and stable clustering outcome.
引用
收藏
页数:4
相关论文
共 8 条
[1]  
Apsingekar V R, 2008, P EUR SIGN PROC C EU
[2]   Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications [J].
Apsingekar, Vijendra Raj ;
De Leon, Phillip L. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04) :848-853
[3]   Efficient speaker recognition using approximated cross entropy (ACE) [J].
Aronowitz, Hagai ;
Burshtein, David .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07) :2033-2043
[4]  
Goldberger J., 2005, INTERSPEECH 2005-Eurospeech, 9th European Conference on Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005, P1985
[5]   Real-time speaker identification and verification [J].
Kinnunen, T ;
Karpov, E ;
Fränti, P .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01) :277-288
[6]  
McCallum A., P 6 ACM SIGKDD INT C, P169
[7]   ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS [J].
REYNOLDS, DA ;
ROSE, RC .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01) :72-83
[8]   Speaker verification using adapted Gaussian mixture models [J].
Reynolds, DA ;
Quatieri, TF ;
Dunn, RB .
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :19-41