The speaker identification by using genetic wavelet adaptive network based fuzzy inference system

被引:3
作者
Avci, E. [1 ]
Avci, D. [1 ]
机构
[1] Firat Univ, Dept Elect & Comp Educ, TR-23119 Elazig, Turkey
关键词
Turkish speech signal; Adaptive feature extraction; Wavelet decomposition; Entropy; Genetic algorithm; ANFIS; Intelligent system; RECOGNITION; EXTRACTION;
D O I
10.1016/j.eswa.2009.01.081
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, an intelligent speaker identification system is presented for speaker identification by using speech/voice signal. This study includes both combination of the adaptive feature extraction and classification by using optimum wavelet entropy parameter Values. These Optimum wavelet entropy values are obtained from measured Turkish speech/voice signal waveforms using speech experimental set. It is developed a genetic wavelet adaptive network based on fuzzy inference system (GWANFIS) model in this study. This model consists of three layers which are genetic algorithm, wavelet and adaptive network based on fuzzy inference system (ANFIS). The genetic algorithm layer is used for selecting of the feature extraction method and obtaining the Optimum wavelet entropy parameter values. In this study, one of the eight different feature extraction methods is selected by using genetic algorithm. Alternative feature extraction methods are wavelet decomposition, wavelet decomposition - short time Fourier transform, wavelet decomposition - Born-Jordan time-frequency representation, wavelet decomposition - Choi-Williams time-frequency representation, wavelet decomposition - Margenau-Hill time-frequency representation, wavelet decomposition - Wigner-Ville time-frequency representation. wavelet decomposition - Page time-frequency representation, wavelet decomposition - Zhao-Atlas-Marks time-frequency representation, The wavelet layer is used for Optimum feature extraction in the time-frequency domain and is composed of wavelet decomposition and wavelet entropies. The ANFIS approach is used for evaluating to fitness function of the genetic algorithm and for classification speakers. It has been evaluated the performance of the developed system by using noisy Turkish speech/voice signals. The test results showed that this system is effective in detecting real speech signals. The correct classification rate is about 91% for speaker classification. (C) 2009 Published by Elsevier Ltd.
引用
收藏
页码:9928 / 9940
页数:13
相关论文
共 31 条
[1]  
Avci E, 2005, LECT NOTES COMPUT SC, V3522, P594
[2]  
AVCI E, 2003, INT J COMPUTATIONAL, V1, P231
[3]   A new optimum feature extraction and classification method for speaker recognition: GWPNN [J].
Avci, Engin .
EXPERT SYSTEMS WITH APPLICATIONS, 2007, 32 (02) :485-498
[4]  
Bakhtazad A., 1999, Intelligent Data Analysis, V3, P267, DOI 10.1016/S1088-467X(99)00023-2
[5]  
Boashash B., 1992, TIME FREQUENCY SIGNA
[6]  
Buckheit J.B., 1995, WAVELAB REPRODUCIBLE
[7]  
Burrus C.S., 1998, INTRO WAVELET WAVELE
[8]   ENTROPY-BASED ALGORITHMS FOR BEST BASIS SELECTION [J].
COIFMAN, RR ;
WICKERHAUSER, MV .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (02) :713-718
[9]  
DEVASAHAYAM S, 2000, TOP BIOMED ENGN, pR7
[10]   PITCH-SYNCHRONOUS WAVELET REPRESENTATIONS OF SPEECH AND MUSIC SIGNALS [J].
EVANGELISTA, G .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (12) :3313-3330