On the use of Classifiers for Text-independent Speaker Identification

被引:0
作者
Jawarkar, Naresh P. [1 ]
Holambe, Raghunath S. [2 ]
Basu, Tapan Kumar [3 ]
机构
[1] BN Coll Engn, Elect & Telecommun Engn Dept, Pusad, MS, India
[2] SGGS Inst Engg & Tech, Instrumentat Engn Dept, Nanded, MS, India
[3] Acad Technol, Hooghly, WB, India
来源
2014 FIRST INTERNATIONAL CONFERENCE ON AUTOMATION, CONTROL, ENERGY & SYSTEMS (ACES-14) | 2014年
关键词
GMM; fuzzy neural networks; SOM; VQ based probabilistic neural network; speaker identification; HIDDEN MARKOV-MODELS; NEURAL-NETWORKS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we have presented the comparative study of different modelling techniques (classifiers) for the text independent speaker identification. Four classifiers, namely, Gaussian mixture models, Fuzzy min-max neural network, Self organizing map, and Vector Quantization based Probabilistic Neural Network (VQ-PNN) have been used for the study. The database containing speech utterances recorded from forty two speakers in Hindi language was used for experimentation. Mel frequency cepstral coefficients that represent short time spectrum are used as features for identification. The performance of four classifiers is analysed under clean-and noisy-speech environment for different signal to noise ratios. All the four classifiers have almost similar performance for 10 second test speech utterances under clean environment. However, GMM outperforms other three classifiers under noisy test conditions.
引用
收藏
页码:238 / 242
页数:5
相关论文
共 25 条
[1]  
Assaleh K. T., ISSPA 99 BRISB AUSTR, P115
[2]   TEXT-DEPENDENT SPEAKER VERIFICATION USING VECTOR QUANTIZATION SOURCE-CODING [J].
BURTON, DK .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (02) :133-143
[3]   Speaker recognition with polynomial classifiers [J].
Campbell, WM ;
Assaleh, KT ;
Broun, CC .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (04) :205-212
[4]   Support vector machines for speaker and language recognition [J].
Campbell, WM ;
Campbell, JP ;
Reynolds, DA ;
Singer, E ;
Torres-Carrasquillo, PA .
COMPUTER SPEECH AND LANGUAGE, 2006, 20 (2-3) :210-229
[5]  
Che C., 1995, EUROSPEECH 1995 4 EU, P625
[6]   COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].
DAVIS, SB ;
MERMELSTEIN, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366
[7]  
Farrell K. R., IEEE T SPEECH AUDI 2, V2, P194
[8]   State-of-the-art in speaker recognition [J].
Faundez-Zanuy, M ;
Monte-Moreno, E .
IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2005, 20 (05) :7-12
[9]   Recent advances in speaker recognition [J].
Furui, S .
PATTERN RECOGNITION LETTERS, 1997, 18 (09) :859-872
[10]   CEPSTRAL ANALYSIS TECHNIQUE FOR AUTOMATIC SPEAKER VERIFICATION [J].
FURUI, S .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (02) :254-272