Speaker identification using vowels features through a combined method of formants, wavelets, and neural network classifiers

被引：44

作者：

Daqrouq, Khaled ^{[1
]}

Tutunji, Tarek A. ^{[2
]}

机构：

[1] King Abdulaziz Univ, Dept Elect & Comp Engn, Jeddah 21413, Saudi Arabia

[2] Philadelphia Univ, Mechatron Engn Dept, Philadelphia, PA USA

来源：

APPLIED SOFT COMPUTING | 2015年 / 27卷

关键词：

Speaker verification and identification; Wavelet packet; Neural networks; Formants; VERIFICATION; RECOGNITION; SYSTEM; ALGORITHM;

D O I：

10.1016/j.asoc.2014.11.016

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a new method for speaker feature extraction based on Formants, Wavelet Entropy and Neural Networks denoted as FWENN. In the first stage, five formants and seven Shannon entropy wavelet packet are extracted from the speakers' signals as the speaker feature vector. In the second stage, these 12 feature extraction coefficients are used as inputs to feed-forward neural networks. Probabilistic neural network is also proposed for comparison. In contrast to conventional speaker recognition methods that extract features from sentences (or words), the proposed method extracts the features from vowels. Advantages of using vowels include the ability to recognize speakers when only partially-recorded words are available. This may be useful for deaf-mute persons or when the recordings are damaged. Experimental results show that the proposed method succeeds in the speaker verification and identification tasks with high classification rate. This is accomplished with minimum amount of information, using only 12 coefficient features (i.e. vector length) and only one vowel signal, which is the major contribution of this work. The results are further compared to well-known classical algorithms for speaker recognition and are found to be superior. (C) 2014 Elsevier B.V. All rights reserved.

引用

页码：231 / 239

页数：9

共 58 条

[1]

Alotaibi Y, 2009, P 1 INT C FGIT JEJ I

[2]

Alotaibi Y, 2009, P BIOID MULTICOMM MA

[3]

[Anonymous], P WORLD ACAD SCI ENG

[4]

[Anonymous], EUR J SCI RES

[5]

[Anonymous], 1999, P EUR C SPEECH COMM

[6] Audio-visual speaker identification using dynamic facial movements and utterance phonetic content [J].

Asadpour, Vahid ;

Homayounpour, Mohammad Mehdi ;

Towhidkhah, Farzad .

APPLIED SOFT COMPUTING, 2011, 11 (02) :2083-2093

[7] An expert system for speaker identification using adaptive wavelet sure entropy [J].

Avci, Derya .

EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :6295-6300

[8] An expert Discrete Wavelet Adaptive Network Based Fuzzy Inference System for Digital Modulation Recognition [J].

Avci, Engin ;

Hanbay, Davut ;

Varol, Asaf .

EXPERT SYSTEMS WITH APPLICATIONS, 2007, 33 (03) :582-589

[9] A new optimum feature extraction and classification method for speaker recognition: GWPNN [J].

Avci, Engin .

EXPERT SYSTEMS WITH APPLICATIONS, 2007, 32 (02) :485-498

[10] Acoustic correlates of talker sex and individual talker identity are present in a short vowel segment produced in running speech [J].

Bachorowski, JA ;

Owren, MJ .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (02) :1054-1063

← 1 2 3 4 5 6 →