Class-Imbalanced Voice Pathology Detection and Classification Using Fuzzy Cluster Oversampling Method

被引:18
作者
Fan, Ziqi [1 ]
Wu, Yuanbo [1 ]
Zhou, Changwei [1 ]
Zhang, Xiaojun [1 ]
Tao, Zhi [1 ]
机构
[1] Soochow Univ, Sch Optoelect Sci & Engn, Suzhou 215000, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 08期
基金
中国国家自然科学基金;
关键词
imbalanced learning; voice pathology detection and classification; SMOTE; intelligence medical diagnosis system; SMOTE;
D O I
10.3390/app11083450
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The Massachusetts Eye and Ear Infirmary (MEEI) database is an international-standard training database for voice pathology detection (VPD) systems. However, there is a class-imbalanced distribution in normal and pathological voice samples and different types of pathological voice samples in the MEEI database. This study aimed to develop a VPD system that uses the fuzzy clustering synthetic minority oversampling technique algorithm (FC-SMOTE) to automatically detect and classify four types of pathological voices in a multi-class imbalanced database. The proposed FC-SMOTE algorithm processes the initial class-imbalanced dataset. A set of machine learning models was evaluated and validated using the resulting class-balanced dataset as an input. The effectiveness of the VPD system with FC-SMOTE was further verified by an external validation set and another pathological voice database (Saarbruecken Voice Database (SVD)). The experimental results show that, in the multi-classification of pathological voice for the class-imbalanced dataset, the method we propose can significantly improve the diagnostic accuracy. Meanwhile, FC-SMOTE outperforms the traditional imbalanced data oversampling algorithms, and it is preferred for imbalanced voice diagnosis in practical applications.
引用
收藏
页数:21
相关论文
共 49 条
  • [1] Voice Pathology Detection and Classification Using Auto-Correlation and Entropy Features in Different Frequency Regions
    Al-Nasheri, Ahmed
    Muhammad, Ghulam
    Alsulaiman, Mansour
    Ali, Zulfiqar
    Malki, Khalid H.
    Mesallam, Tamer A.
    Ibrahim, Mohamed Farahat
    [J]. IEEE ACCESS, 2018, 6 : 6961 - 6974
  • [2] An Investigation of Multidimensional Voice Program Parameters in Three Different Databases for Voice Pathology Detection and Classification
    Al-nasheri, Ahmed
    Muhammad, Ghulam
    Alsulaiman, Mansour
    Ali, Zulfiqar
    Mesallam, Tamer A.
    Farahat, Mohamed
    Malki, Khalid H.
    Bencherif, Mohamed A.
    [J]. JOURNAL OF VOICE, 2017, 31 (01) : 113.e9 - 113.e18
  • [3] An incremental method combining density clustering and support vector machines for voice pathology detection
    Amami, Rimah
    Smiti, Abir
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2017, 57 : 257 - 265
  • [4] Automatic Detection of Pathological Voices Using Complexity Measures, Noise Parameters, and Mel-Cepstral Coefficients
    Arias-Londono, Julian D.
    Godino-Llorente, Juan I.
    Saenz-Lechon, Nicolas
    Osma-Ruiz, Victor
    Castellanos-Dominguez, German
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2011, 58 (02) : 370 - 379
  • [5] Fuzzy C-Means clustering algorithm for data with unequal cluster sizes and contaminated with noise and outliers: Review and development
    Askari, Salar
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165
  • [6] Kullback-Leibler divergence and sample skewness for pathological voice quality assessment
    Barreira, Ramiro R. A.
    Ling, Lee Luan
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 57
  • [7] Barry J., SAARBRUCKEN VOICE DA
  • [8] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [9] The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation
    Chicco, Davide
    Jurman, Giuseppe
    [J]. BMC GENOMICS, 2020, 21 (01)
  • [10] Combined Generative Adversarial Network and Fuzzy C-Means Clustering for Multi-Class Voice Disorder Detection with an Imbalanced Dataset
    Chui, Kwok Tai
    Lytras, Miltiadis D.
    Vasant, Pandian
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (13):