A hybrid system based on GMM-SVM for Speaker Identification

被引:0
作者
Chakroun, Rania [1 ,4 ]
Zouari, Leila Beltaifa [1 ,3 ]
Frikha, Mondher [1 ,2 ]
Ben Hamida, Ahmed [1 ,4 ]
机构
[1] Adv Technol Med & Signals ATMS Res Unit, Sfax, Tunisia
[2] Natl Sch Elect & Telecommun Sfax, Sfax, Tunisia
[3] Natl Sch Engn Sousse, Sousse, Tunisia
[4] Natl Sch Engn Sfax, Sfax, Tunisia
来源
2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA) | 2015年
关键词
Support Vector Machines; Gaussian mixture models; speaker Recognition; speaker identification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gaussian mixture models (GMM) have become the standard method used for speaker recognition systems. A recent discovery is that combining GMM approach with another classifier is an effective method for speaker classification. We consider the GMM supervector in the context of support vector machines (SVM). We construct a support vector machine tested with two kernel functions employing the GMM supervectors. The main idea of the study is to combine the discriminative classifier SVM and the traditional GMM pattern classification with a new dimensional cepstral feature vector extracted from the speech to achieve better classification rate. This idea has been analytically formulated and tested on speakers from TIMIT database. First we describe the SVM-GMM system then we briefly discuss how the new low dimensional feature vector can feed to identification rate. We show comparative results obtained with GMM, SVM, GMM-SVM based system and existing works. Thereafter, we show that the new hybrid system can outperform the standard GMM-SVM based system and give remarkable increases in speaker identification rates.
引用
收藏
页码:654 / 658
页数:5
相关论文
共 13 条
[1]  
[Anonymous], P IEEE INT C AC SPEE
[2]   EFFECTIVENESS OF LINEAR PREDICTION CHARACTERISTICS OF SPEECH WAVE FOR AUTOMATIC SPEAKER IDENTIFICATION AND VERIFICATION [J].
ATAL, BS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (06) :1304-1312
[3]  
Barras C., INT C AC SPEECH SIGN
[4]   Support vector machines using GMM supervectors for speaker verification [J].
Campbell, WM ;
Sturim, DE ;
Reynolds, DA .
IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (05) :308-311
[5]  
Campbell WM, 2002, INT CONF ACOUST SPEE, P161
[6]  
Cortes C., 1995, Machine Learning, V297, P273, DOI [DOI 10.1007/BF00994018, DOI 10.1023/A:1022627411411]
[7]   Robust speaker recognition - A feature-based approach [J].
Mammone, RJ ;
Zhang, XY ;
Ramachandran, RP .
IEEE SIGNAL PROCESSING MAGAZINE, 1996, 13 (05) :58-71
[8]  
Reynolds D. A., 1995, Lincoln Laboratory Journal, V8, P173
[9]   SPEAKER IDENTIFICATION AND VERIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS [J].
REYNOLDS, DA .
SPEECH COMMUNICATION, 1995, 17 (1-2) :91-108
[10]   Speaker verification using adapted Gaussian mixture models [J].
Reynolds, DA ;
Quatieri, TF ;
Dunn, RB .
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :19-41