An Efficient SMOTE-Based Deep Learning Model for Voice Pathology Detection

被引:8
作者
Lee, Ji-Na [1 ]
Lee, Ji-Yeoun [2 ]
机构
[1] Seokyeong Univ, Div Global Business Languages, Seoul 02173, South Korea
[2] Eulji Univ, Dept Bigdata Med Convergence, 553 Sanseong daero, Seongnam Si 13135, South Korea
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 06期
基金
新加坡国家研究基金会;
关键词
pathological voice; disordered voice; imbalanced learning; voice pathology classification; SMOTE; ADASYN; Borderline-SMOTE; deep learning; intelligent medical diagnosis system; DISEASE DETECTION; IMBALANCED DATA;
D O I
10.3390/app13063571
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The Saarbruecken Voice Database (SVD) is a public database used by voice pathology detection systems. However, the distributions of the pathological and normal voice samples show a clear class imbalance. This study aims to develop a system for the classification of pathological and normal voices that uses efficient deep learning models based on various oversampling methods, such as the adaptive synthetic sampling (ADASYN), synthetic minority oversampling technique (SMOTE), and Borderline-SMOTE directly applied to feature parameters. The suggested combinations of oversampled linear predictive coefficients (LPCs), mel-frequency cepstral coefficients (MFCCs), and deep learning methods can efficiently classify pathological and normal voices. The balanced datasets from ADASYN, SMOTE, and Borderline-SMOTE are used to validate and evaluate the various deep learning models. The experiments are conducted using model evaluation metrics such as the recall, specificity, G, and F1 value. The experimental results suggest that the proposed voice pathology detection (VPD) system integrating the LPCs oversampled by the SMOTE and a convolutional neural network (CNN) can effectively yield the highest accuracy at 98.89% when classifying pathological and normal voices. Finally, the performances of oversampling algorithms such as the ADASYN, SMOTE, and Borderline-SMOTE are discussed. Furthermore, the performance of SMOTE is superior to conventional imbalanced data oversampling algorithms, and it can be used to diagnose pathological signals in real-world applications.
引用
收藏
页数:16
相关论文
共 50 条
[41]   An efficient deep learning model for brain tumour detection with privacy preservation [J].
Rehman, Mujeeb Ur ;
Shafique, Arslan ;
Khan, Imdad Ullah ;
Ghadi, Yazeed Yasin ;
Ahmad, Jawad ;
Alshehri, Mohammed S. ;
Al Qathrady, Mimonah ;
Alhaisoni, Majed ;
Zayyan, Muhammad H. .
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023,
[42]   Metric-Based Few-Shot Transfer Learning Approach for Voice Pathology Detection [J].
Won, Jong-Ho ;
Kim, Deok-Hwan .
IEEE ACCESS, 2024, 12 :159226-159238
[43]   An efficient hybrid weather prediction model based on deep learning [J].
Utku, A. ;
Can, U. .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY, 2023, 20 (10) :11107-11120
[44]   An efficient hybrid weather prediction model based on deep learning [J].
A. Utku ;
U. Can .
International Journal of Environmental Science and Technology, 2023, 20 :11107-11120
[45]   An Efficient Indoor Localization Based on Deep Attention Learning Model [J].
Abozeid A. ;
Taloba A.I. ;
Abd El-Aziz R.M. ;
Alwaghid A.F. ;
Salem M. ;
Elhadad A. .
Computer Systems Science and Engineering, 2023, 46 (02) :2637-2650
[46]   An Efficient and Effective Deep Learning-Based Model for Real-Time Face Mask Detection [J].
Habib, Shabana ;
Alsanea, Majed ;
Aloraini, Mohammed ;
Al-Rawashdeh, Hazim Saleh ;
Islam, Muhammad ;
Khan, Sheroz .
SENSORS, 2022, 22 (07)
[47]   Light-YOLO: A Lightweight and Efficient YOLO-Based Deep Learning Model for Mango Detection [J].
Zhong, Zhengyang ;
Yun, Lijun ;
Cheng, Feiyan ;
Chen, Zaiqing ;
Zhang, Chunjie .
AGRICULTURE-BASEL, 2024, 14 (01)
[48]   Detection of Advertising Users Based on K-SMOTE and Ensemble Learning [J].
Qiu, Zihan ;
Zhou, Zekai ;
Long, Yongxu ;
Ji, Chang ;
Li, Jianguo ;
Tang, Yong .
HUMAN CENTERED COMPUTING, HCC 2021, 2022, 13795 :133-145
[49]   Efficient Vulnerability Detection based on abstract syntax tree and Deep Learning [J].
Feng, Hantao ;
Fu, Xiaotong ;
Sun, Hongyu ;
Wang, He ;
Zhang, Yuqing .
IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, :722-727
[50]   Efficient face detection and tracking in video sequences based on deep learning [J].
Zheng, Guangyong ;
Xu, Yuming .
INFORMATION SCIENCES, 2021, 568 :265-285