The UCD System for the 2018 FEMH Voice Data Challenge

被引:0
作者
Degila, Kevin [1 ]
Errattahi, Rahhal [1 ]
El Hannani, Asmaa [1 ]
机构
[1] Univ Chouaib Doukkali, Natl Sch Appl Sci El Jadida, Informat Technol Lab, El Jadida, Morocco
来源
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2018年
关键词
FEMH Voice Data; Pathological Voice Detection; MFCC; Spectral Features; SVM; AUTOMATIC DETECTION; PARAMETERS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We are all exposed to a risk of voice disorders at some point in our life, as we use our voice daily for speaking, singing, laughing and more. Voice disorders are often characterized by difficulty when breathing, a vocal fatigue, a sore throat, changes in voice such as hoarseness, repetitive cough, and difficulty when swallowing. When these disorders occur, a change in the acoustic characteristics can be noticed. It is then possible to diagnose these diseases based on the automatic analysis of the acoustic quality of the voice signal. In this paper, we present the National School of Applied Sciences of El Jadida - University of Chouaib Doukkali (UCD) system for automatic voice disorder classification. The system was developed for participation in the 2018 FEMH Voice Data Challenge. For this task, we used audio recorded data from the patients of the Far Eastern Memorial Hospital (FEMH). Different types of features are investigated; the Mel Frequency Cepstral Coefficients, the zero-crossing rate, the roll-off frequency, the spectral centroid, the spectral bandwidth and the spectral contrast. All of these features are normalized using summary statistics such as mean, standard deviation, skewness, maximum, minimum and the median. Experimental results show that the Support Vector Machine outperforms the other classifiers and that the best performance of our classifier is reached when using MFCC with spectral features.
引用
收藏
页码:5242 / 5246
页数:5
相关论文
共 19 条
[1]  
[Anonymous], 2015, 16 ANN C INT SPEECH
[2]  
[Anonymous], SYST SIGN DEV SSD 20
[3]  
[Anonymous], 2000, USE ZERO CROSSING RA
[4]   Automatic Detection of Pathological Voices Using Complexity Measures, Noise Parameters, and Mel-Cepstral Coefficients [J].
Arias-Londono, Julian D. ;
Godino-Llorente, Juan I. ;
Saenz-Lechon, Nicolas ;
Osma-Ruiz, Victor ;
Castellanos-Dominguez, German .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2011, 58 (02) :370-379
[5]   On combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices [J].
Arias-Londono, Julian David ;
Godino-Llorente, Juan I. ;
Markaki, Maria ;
Stylianou, Yannis .
LOGOPEDICS PHONIATRICS VOCOLOGY, 2011, 36 (02) :60-69
[6]  
Belhaj Asma, 2014, International Journal of Computer and Information Technology, V3, P1083
[7]  
CHEN C.-H., 1988, SIGNAL PROCESSING HD
[8]  
Dieleman Sander, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P6964, DOI 10.1109/ICASSP.2014.6854950
[9]  
Fang Shih-Hau., 2018, Journal of Voice
[10]  
Firdos S., 2016, COGNITIVE COMPUTING, P1