Speaker identification using vowels features through a combined method of formants, wavelets, and neural network classifiers

被引:44
作者
Daqrouq, Khaled [1 ]
Tutunji, Tarek A. [2 ]
机构
[1] King Abdulaziz Univ, Dept Elect & Comp Engn, Jeddah 21413, Saudi Arabia
[2] Philadelphia Univ, Mechatron Engn Dept, Philadelphia, PA USA
关键词
Speaker verification and identification; Wavelet packet; Neural networks; Formants; VERIFICATION; RECOGNITION; SYSTEM; ALGORITHM;
D O I
10.1016/j.asoc.2014.11.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new method for speaker feature extraction based on Formants, Wavelet Entropy and Neural Networks denoted as FWENN. In the first stage, five formants and seven Shannon entropy wavelet packet are extracted from the speakers' signals as the speaker feature vector. In the second stage, these 12 feature extraction coefficients are used as inputs to feed-forward neural networks. Probabilistic neural network is also proposed for comparison. In contrast to conventional speaker recognition methods that extract features from sentences (or words), the proposed method extracts the features from vowels. Advantages of using vowels include the ability to recognize speakers when only partially-recorded words are available. This may be useful for deaf-mute persons or when the recordings are damaged. Experimental results show that the proposed method succeeds in the speaker verification and identification tasks with high classification rate. This is accomplished with minimum amount of information, using only 12 coefficient features (i.e. vector length) and only one vowel signal, which is the major contribution of this work. The results are further compared to well-known classical algorithms for speaker recognition and are found to be superior. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:231 / 239
页数:9
相关论文
共 58 条