Research on the Parameter Optimal Algorithm of Gaussian Mixture Model in Speaker Identification

被引:0
作者
Ding, Hui [1 ,2 ]
Tang, Zhenmin [1 ]
Li, Yanping [3 ]
机构
[1] Nanjing Univ Sci & Technol, Lab Pattern Recognit & Intelligent Syst, Nanjing 210094, Jiangsu, Peoples R China
[2] Jiaxing Univ, Sch Math & Informat Engn, Jiaxing 314001, Zhejiang, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Coll Telecommun & Informat Engn, Nanjing, Peoples R China
来源
PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2 | 2009年
关键词
Speaker Identification; Diagonal Covariance; Transformation Embedded; Sharing Matrix; Clustering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of speaker recognition, the Gaussian Mixture Model with diagonal covariance matrices is a popular technique, in this way, it simplified model and reduced the amount of computation, but lost the correlation information between feature vectors, and then influenced the classification performance. In this paper, in order to compensate the correlation between feature elements, we proposed a novel method based on clustering transformation algorithm, we calculate the similarity between Gaussian components, and the cluster of same components will share one transformation matrix, thus multi-transformation matrices, together with weights and means vectors are obtained simultaneously by Maximum Likelihood estimation. Theory analysis and experimental results demonstrated that this proposed method can get a better balance between training speed and recognition rate, improve the performance of classifier and reduce the complexity and memory burden relatively.
引用
收藏
页码:639 / +
页数:2
相关论文
共 7 条
[1]  
CHIHCHIEN TC, 2004, PATTERN RECOGN, V37, P1073
[2]  
Flury B., 1988, Common Principal Components & Related Multivariate Models
[3]   Semi-tied covariance matrices for hidden Markov models [J].
Gales, MJF .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (03) :272-281
[4]  
KUOHWEI Y, 1999, SPEECH COMMUN, V28, P227
[5]  
LIU L, 1999, P IEEE INT C AC SPEE, V2, P845
[6]   THE IMPORTANCE OF CEPSTRAL PARAMETER CORRELATIONS IN SPEECH RECOGNITION [J].
LJOLJE, A .
COMPUTER SPEECH AND LANGUAGE, 1994, 8 (03) :223-232
[7]   ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS [J].
REYNOLDS, DA ;
ROSE, RC .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01) :72-83