An improved method for voice pathology detection by means of a HMM-based feature space transformation

被引:55
作者
Arias-Londono, Julian D. [1 ,2 ]
Godino-Llorente, Juan I. [1 ]
Saenz-Lechon, Nicolas [1 ]
Osma-Ruiz, Victor [1 ]
Castellanos-Dominguez, German [2 ]
机构
[1] Univ Politecn Madrid, Dep ICS, EUIT Telecomunicac, Madrid 28031, Spain
[2] Univ Nacl Colombia, GC&PDS, Manizales, Caldas, Colombia
关键词
Pathological voice; Hidden Markov models; Minimum classification error; Dynamic feature space transformation; TO-NOISE RATIO; DIMENSIONALITY REDUCTION; AUTOMATIC DETECTION; ACOUSTIC ANALYSIS; SYSTEM; ENERGY;
D O I
10.1016/j.patcog.2010.03.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents new a feature transformation technique applied to improve the screening accuracy for the automatic detection of pathological voices. The statistical transformation is based on Hidden Markov Models, obtaining a transformation and classification stage simultaneously and adjusting the parameters of the model with a criterion that minimizes the classification error. The original feature vectors are built up using classic short-term noise parameters and mel-frequency cepstral coefficients. With respect to conventional approaches found in the literature of automatic detection of pathological voices, the proposed feature space transformation technique demonstrates a significant improvement of the performance with no addition of new features to the original input space. In view of the results, it is expected that this technique could provide good results in other areas such as speaker verification and/or identification. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3100 / 3112
页数:13
相关论文
共 57 条
  • [1] Alonso J. B., 2001, EURASIP Journal on Applied Signal Processing, V2001, P275, DOI 10.1155/S1110865701000336
  • [2] [Anonymous], 1994, VOIC DIS DAT VERS 1
  • [3] [Anonymous], 2007, Applied multivariate statistical analysis, sixth edition M
  • [4] Baken R. J., 2000, Clinical Measurement of Speech and Voice
  • [5] Bishop CM., 1995, NEURAL NETWORKS PATT
  • [6] Acoustic analysis of pathological voices
    Boyanov, B
    Hadjitodorov, S
    [J]. IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 1997, 16 (04): : 74 - 82
  • [7] BRIDLE JS, 1991, INT CONF ACOUST SPEE, P277, DOI 10.1109/ICASSP.1991.150331
  • [8] CAPPE O, 2005, SPR S STAT, P1
  • [9] CHEN W, 2007, P 29 ANN INT C IEEE
  • [10] HMM-based speech recognition using state-dependent, discriminatively derived transforms on mel-warped DFT features
    Chengalvarayan, R
    Deng, L
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03): : 243 - 256