A novel hybrid of genetic algorithm and ANN for developing a high efficient method for vocal fold pathology diagnosis

被引:0
作者
Vahid Majidnezhad
机构
[1] Islamic Azad University,Department of Computer Engineering, Shabestar Branch
来源
EURASIP Journal on Audio, Speech, and Music Processing | / 2015卷
关键词
Vocal fold pathology diagnosis; Wavelet packet decomposition; Mel frequency cepstral coefficient (MFCC); Principal component analysis (PCA); Artificial neural network (ANN); Genetic algorithm (GA);
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, an initial feature vector based on the combination of the wavelet packet decomposition (WPD) and the Mel frequency cepstral coefficients (MFCCs) is proposed. For optimizing the initial feature vector, a genetic algorithm (GA)-based approach is proposed and compared with the well-known principal component analysis (PCA) approach. The artificial neural network (ANN) with the different learning algorithms is used as the classifier. Some experiments are carried out for evaluating and comparing the classification accuracies which are obtained by the use of the different learning algorithms and the different feature vectors (the initial and the optimized ones). Finally, a hybrid of the ANN with the ‘trainscg’ training algorithm and the genetic algorithm is proposed for the vocal fold pathology diagnosis. Also, the performance of the proposed method is compared with the recent works. The experiments' results show a better performance (the higher classification accuracy) of the proposed method in comparison with the others.
引用
收藏
相关论文
共 51 条
  • [1] Chen G(2014)The glottaltopogram: a method of analyzing high-speed images of the vocal folds Comput Speech Lang 28 1156-1169
  • [2] Kreiman J(2011)Investigating acoustic correlates of human vocal fold vibratory phase asymmetry through modeling and laryngeal high-speed videoendoscopy J Acoust Soc Am 130 3999-4009
  • [3] Alwan A(1986)Glottography, the electrophysiological investigation of phonatory biomechanics Acta Otorhinolaryngol Belg 40 863-878
  • [4] Mehta DD(1987)Videostroboscopic evaluation of the larynx Ear Nose Throat J 66 289-296
  • [5] Zañartu M(2000)Adaptive noise energy estimation in pathological speech signals IEEE Trans Biomed Eng 47 1538-1543
  • [6] Quatieri TF(2001)Automatic detection of pathologies in the voice by HOS based parameters EURASIP J Appl Signal Process 2001 275-284
  • [7] Deliyski DD(2000)Adaptive estimation of residue signal for voice pathology diagnosis IEEE Trans Biomed Eng 47 96-104
  • [8] Hillman RE(1998)A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment IEEE Trans Biomed Eng 45 300-313
  • [9] Kitzing P(2013)A hybrid of genetic algorithm and support vector machine for feature reduction and detection of vocal fold pathology Int J Image Graph Signal Process 5 1-7
  • [10] Bless DM(2011)Voice pathology detection and discrimination based on modulation spectral features IEEE Trans Audio Speech Lang Process 19 1938-1948