Kullback-Leibler divergence and sample skewness for pathological voice quality assessment

被引:13
作者
Barreira, Ramiro R. A. [1 ]
Ling, Lee Luan [1 ]
机构
[1] Univ Estadual Campinas, Fac Engn Eletr & Comp, Dept Comunicacoes, Av Albert Einstein 400,Cidade Univ Zeferino Vaz, BR-13083852 Campinas, SP, Brazil
关键词
Voice pathology detection; Kullback-Leibler divergence; Mel-frequency cepstral coefficients (MFCC); Generalized extreme value (GEV); distribution; Gaussian mixture models (GMM); Na ve Bayes classifier; TO-NOISE RATIO; GLOTTAL CHARACTERISTICS; AUTOMATIC DETECTION; PARAMETERS; SPEAKERS;
D O I
10.1016/j.bspc.2019.101697
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
This paper proposes new features aiming to improve the performance of an automatic voice pathology detection system. The features are designed precisely in terms of voice pathologies effects upon the speech signal. The system is intended to deliver high accuracy with a low number of parameters. Kullback-Leibler divergence (KLD) applied to consecutive frames of the speech signal provides a measure of voice instability. In this work, the KLD is applied to frame's histogram and a modified form of its spectrum named higher amplitude suppression spectrum (HASS). The H-KLD (histogram KLD) and the HASS-KLD are two of the three features presently approached. An additional feature that provides the level of damping of the voice pitch period waveform is proposed, the short-term sample skewness of the signal. The H-KLD, the HASS-KLD, and the sample skewness are features employed along with mel-frequency cepstral coefficients (MFCC) in a voice pathology detection system. The system is composed of a Gaussian mixture models (GMM) classifier and two generalized extreme value (GEV) distribution classifiers. They are fused by means of a Gaussian naive Bayes classifier. A standard subset of the Massachusetts Eye and Ear Infirmary (MEEI) voice disorders database is adopted for evaluating the system. The obtained global success rate of 99.55% shows that the proposed features are suitable for pathological voice quality assessment. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 30 条