Voice pathology detection using machine learning algorithms based on different voice databases

被引：0

作者：

Latiff, Nurul Mu'azzah Abdul ^{[1
]}

Al-Dhief, Fahad Taha ^{[1
,2
]}

Sazihan, Nurul Fariesya Suhaila Md ^{[1
]}

Baki, Marina Mat ^{[3
]}

Abd Malik, Nik Noordini Nik ^{[1
]}

Albadr, Musatafa Abbas Abbood ^{[4
]}

Abbas, Ali Hashim ^{[5
]}

机构：

[1] Univ Teknol Malaysia, Fac Elect Engn, Fac Engn, Utm Johor Bahru, Johor, Malaysia

[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi 43600, Malaysia

[3] Univ Kebangsaan Malaysia Med Ctr, Fac Med, Dept Otorhinolaryngol, Kuala Lumpur, Malaysia

[4] Basrah Univ Oil & Gas, Coll Ind Management Oil & Gas, Dept Petr Project Management, Al Basrah, Iraq

[5] Imam Jaafar Al Sadiq Univ, Coll Informat Technol, Dept Comp Tech Engn, Al Muthanna, Iraq

来源：

RESULTS IN ENGINEERING | 2025年 / 25卷

关键词：

Machine learning; Voice pathology detection; OSELM; SVM; DT; NB; MFCC; SVD; MVPD; CLASSIFICATION; TRANSFORM; FEATURES;

D O I：

10.1016/j.rineng.2025.103937

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

The application of machine learning in analyzing voice disorders has become crucial for non-invasive voice pathology detection using voice signals. However, current systems face challenges such as low detection accuracy, limited databases, and evaluation metrics. More importantly, most existing studies rely on training and testing algorithms based on the same database, limiting their applicability in real-world scenarios with diverse data sources. Unlike traditional approaches that focus solely on single-database training and testing, this study presents a cross-database evaluation strategy to assess the robustness and generalizability of machine learning algorithms for voice pathology detection. Several algorithms, including Online Sequential Extreme Learning Machine (OSELM), Support Vector Machine (SVM), Decision Tree (DT), and Na & iuml;ve Bayes (NB), were evaluated using two databases: the Saarbrucken Voice Database (SVD) and the Malaysian Voice Pathology Database (MVPD). Two scenarios were considered: (1) training and testing on the same database and (2) training on one database and testing on another. The proposed study uses the Mel-Frequency Cepstral Coefficient (MFCC) technique for extracting features from voices. The algorithms are assessed using many evaluation metrics such as accuracy, precision, sensitivity, specificity, F-measure, and G-mean. Experimental results demonstrate that the OSELM algorithm achieves superior performance across both scenarios, with accuracies of up to 85.71 % in Scenario 1 and 80.77 % in Scenario 2, outperforming other algorithms. This novel approach highlights the reliability of OSELM and the importance of cross-database testing for developing robust and generalizable voice pathology detection systems.

引用

页数：14

共 74 条

[21] Machine Learning for Medical Imaging1
Erickson, Bradley J.
Korfiatis, Panagiotis
Akkus, Zeynettin
Kline, Timothy L.
[J]. RADIOGRAPHICS, 2017, 37 (02) : 505 - 515
[22] Class-Imbalanced Voice Pathology Detection and Classification Using Fuzzy Cluster Oversampling Method
Fan, Ziqi
Wu, Yuanbo
Zhou, Changwei
Zhang, Xiaojun
Tao, Zhi
[J]. APPLIED SCIENCES-BASEL, 2021, 11 (08):
[23] Fatima M., 2017, Journal of Intelligent Learning Systems and Applications, V09, P1, DOI [DOI 10.4236/JILSA.2017.91001, 10.4236/jilsa.2017]
[24] Voice Quality Assessment by Simulating GRBAS Scoring
Gadepalli, Chaitanya
Jalalinajafabadi, Farideh
Xie, Zheng
Cheetham, Barry M. G.
Homer, Jarrod J.
[J]. UKSIM-AMSS 11TH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2017), 2017, : 107 - 111
[25] Voice Pathologies Classification and Detection Using EMD-DWT Analysis Based on Higher Order Statistic Features
Hammami, I
Salhi, L.
Labidi, S.
[J]. IRBM, 2020, 41 (03) : 161 - 171
[26] Harar P., 2017, 2017 INT C WORKSHOP, P1, DOI 10.1109/iwobi.2017.7985525
[27] TIME-DOMAIN FEATURES AND PROBABILISTIC NEURAL NETWORK FOR THE DETECTION OF VOCAL FOLD PATHOLOGY
Hariharan, M.
Paulraj, M. P.
Yaacob, Sazali
[J]. MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2010, 23 (01) : 60 - 67
[28] The Effect of Noise on Deep Learning for Classification of Pathological Voice
Hasebe, Koki
Kojima, Tsuyoshi
Fujimura, Shintaro
Tamura, Keiichi
Kawai, Yoshitaka
Kishimoto, Yo
Omori, Koichi
[J]. LARYNGOSCOPE, 2024, 134 (08) : 3537 - 3541
[29] A Survey on Machine Learning Approaches for Automatic Detection of Voice Disorders
Hegde, Sarika
Shetty, Surendra
Rai, Smitha
Dodderi, Thejaswi
[J]. JOURNAL OF VOICE, 2019, 33 (06) : 947.e11 - 947.e33
[30] Healthcare Big Data Voice Pathology Assessment Framework
Hossain, M. Shamim
Muhammad, Ghulam
[J]. IEEE ACCESS, 2016, 4 : 7806 - 7815

← 1 2 3 4 5 6 7 8 →