Zero Frequency Filter Based Analysis of Voice Disorders

被引:6
作者
Adiga, Nagaraj [1 ]
Vikram, C. M. [1 ]
Pullela, Keerthi [2 ]
Prasanna, S. R. M. [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati, India
[2] VIT Univ, Chennai, India
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
关键词
Zero-frequency filter; epoch-based features; jitter; and shimmer; AUTOMATIC DETECTION; SPEECH;
D O I
10.21437/Interspeech.2017-589
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pitch period and amplitude perturbations are widely used parameters to discriminate normal and voice disorder speech. Instantaneous pitch period and amplitude of glottal vibrations directly from the speech waveform may not give an accurate estimation of jitter and shimmer. In this paper, the significance of epochs (glottal closure instants) and strength of excitation (SoE) derived from the zero-frequency filter (ZFF) are exploited to discriminate the voice disorder and normal speech. Pitch epoch derived from ZFF is used to compute the jitter, and SoE derived around each epoch is used compute the shimmer. The derived epoch-based features are analyzed on the some of the voice disorders like Parkinson's disease. vocal fold paralysis. cyst, and gastroesophageal reflux disease. The significance of proposed epoch-based features for discriminating normal and pathological voices is analyzed and compared with the state-of-the-art methods using a support vector machine classifier. The results show that epoch-based features performed significantly better than other methods both in clean and noisy conditions.
引用
收藏
页码:1824 / 1828
页数:5
相关论文
共 20 条
[1]   Detection of Glottal Activity Using Different Attributes of Source Information [J].
Adiga, Nagaraj ;
Prasanna, S. R. M. .
IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (11) :2107-2111
[2]  
Al-nasheri A., 2016, J VOICE
[3]   Identification of Voice Disorders Using Long-Time Features and Support Vector Machine With Different Feature Reduction Methods [J].
Arjmandi, Meisam Khalil ;
Pooyan, Mohammad ;
Mikaili, Mohammad ;
Vali, Mansour ;
Moqarehzadeh, Alireza .
JOURNAL OF VOICE, 2011, 25 (06) :E275-E289
[4]  
Boersma P., 2018, Praat: doing phonetics by computer (Version 5.3) Computer software
[5]  
Deepak K., 2015, P TENCON, P1
[6]   Detection of Glottal Closure Instants From Speech Signals: A Quantitative Review [J].
Drugman, Thomas ;
Thomas, Mark ;
Gudnason, Jon ;
Naylor, Patrick ;
Dutoit, Thierry .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (03) :994-1006
[7]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[8]   Automatic detection of voice impairments from text-dependent running speech [J].
Godino-Llorente, J. I. ;
Fraile, Ruben ;
Saenz-Lechon, N. ;
Osma-Ruiz, V. ;
Gomez-Vilda, P. .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2009, 4 (03) :176-182
[9]   The Effectiveness of the Glottal to Noise Excitation Ratio for the Screening of Voice Disorders [J].
Ignacio Godino-Llorente, Juan ;
Osma-Ruiz, Victor ;
Saenz-Lechon, Nicolas ;
Gomez-Vilda, Pedro ;
Blanco-Velasco, Manuel ;
Cruz-Roldan, Fernando .
JOURNAL OF VOICE, 2010, 24 (01) :47-56