A Speaker Recognition Algorithm Based on Factor Analysis

被引:0
作者
Shen, Xuanjing [1 ,2 ]
Zhai, Yujie [1 ,2 ]
Wang, Yu [1 ,2 ,3 ]
Chen, Haipeng [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130023, Jilin, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130023, Jilin, Peoples R China
[3] Jilin Univ, Coll Appl Technol, Changchun 130012, Peoples R China
来源
2014 7TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP 2014) | 2014年
关键词
Speaker recognition; SVM; GMM; Latent factor analysis; LFA;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Channel interference factor for the identification result is prevalent among the existing speaker recognition algorithms. In order to improve the accuracy of the algorithm, the paper utilizes the technique of latent factor analysis(LFA) to deal with the channel factors in the speaker's Gaussian Mixture Model(GMM). In the endpoint detection phase of speaker recognition, the algorithm introduces the GMM for speech modeling to accurately determine the beginning and ending points of the speech segment, and then establish speaker GMM. The algorithm use factor analysis technique to fit the differences between the speaker characteristics space and the channel space, and removes channel factor in speaker's GMM. And then the algorithm extracts GMM super-vectors as the input of Support Vector Machine(SVM) to obtain recognition results. Experimental results show that the combination of factor analysis and SVM can obtain better recognition rate and ensure the robustness of the recognition algorithm.
引用
收藏
页码:897 / 901
页数:5
相关论文
共 11 条
[1]  
Brutti A, 2013, GEOMETRIC CONTAMINAT
[2]   Analysis of feature extraction and channel compensation in a GMM speaker recognition system [J].
Burget, Lukas ;
Matejka, Pavel ;
Schwarz, Petr ;
Glembek, Ondfei ;
Cernocky, Jan 'Honza' .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07) :1979-1986
[3]   Support vector machines using GMM supervectors for speaker verification [J].
Campbell, WM ;
Sturim, DE ;
Reynolds, DA .
IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (05) :308-311
[4]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[5]  
Ding J, 2013, MULTIMED TOOLS APPL, P1
[6]  
Kasuriya S, 2001, INT J UNCERTAIN FUZZ, V9, P673
[7]   An overview of text-independent speaker recognition: From features to supervectors [J].
Kinnunen, Tomi ;
Li, Haizhou .
SPEECH COMMUNICATION, 2010, 52 (01) :12-40
[8]   A study of voice activity detection techniques for NIST speaker recognition evaluations [J].
Mak, Man-Wai ;
Yu, Hon-Bill .
COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01) :295-313
[9]  
Munteanu DP, 2010, PROCEEDINGS OF THE 2010 8TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), P107, DOI 10.1109/ICCOMM.2010.5509021
[10]  
Sen Nirmalya, 2013, Mining Intelligence and Knowledge Exploration. First International Conference, MIKE 2013. Proceedings: LNCS 8284, P780, DOI 10.1007/978-3-319-03844-5_76