Wavelet-based methodology for non-invasive detection and multiclass classification of voice disorders: a comprehensive study across multilingual datasets

被引:0
作者
Shrivas, Avinash [1 ]
Deshpande, Shrinivas [2 ]
Gidaye, Girish [3 ]
机构
[1] HVPM, PG Dept Comp Sci & Technol, DCPE, Amravati, India
[2] HVPM, Dept Comp Sci & Technol, DCPE, Amravati, India
[3] Vidyalankar Inst Technol, Mumbai 400037, India
关键词
voice disorder; wavelet transform; statistical features; multiclass classification; PATHOLOGY DETECTION; COMPLEXITY-MEASURES; AUTOMATIC DETECTION; SPEECH; DYSPHONIA; FEATURES; IDENTIFICATION; IMPAIRMENTS; POPULATION; PREVALENCE;
D O I
10.1504/IJBET.2024.143289
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Impaired voice function affects 1.2% of the global population and is often diagnosed through invasive procedures. Past efforts in automated voice disorder detection mainly tackled the binary 'healthy vs. unhealthy' classification. In this study, we suggest a non-invasive alternative based on speech analysis, diverging from the conventional invasive surgical methods. Both binary and multiclass classification is carried out in the present work by decomposing the speech signal extracted from German, Spanish, English, and Arabic datasets using discrete wavelet transform (DWT). The impact of varying decomposition levels on detection and classification accuracy is evident, with the fifth level of decomposition demonstrating the highest recognition rate of 90% to 99% for tasks involving voice disorder identification and multiclass classification. Results indicate that energy and statistical features derived from DWT offer richer information on pathological voices. Consequently, the proposed system could serve as a valuable adjunct for clinical diagnosis of laryngeal pathologies.
引用
收藏
页码:323 / 347
页数:26
相关论文
共 59 条
[1]   Voice Pathology Detection and Classification by Adopting Online Sequential Extreme Learning Machine [J].
Al-Dhief, Fahad Taha ;
Baki, Marina Mat ;
Latiff, Nurul Mu'azzah Abdul ;
Abd Malik, Nik Noordini Nik ;
Salim, Naseer Sabri ;
Albader, Musatafa Abbas Abbood ;
Mahyuddin, Nor Muzlifah ;
Mohammed, Mazin Abed .
IEEE ACCESS, 2021, 9 :77293-77306
[2]   A Survey of Voice Pathology Surveillance Systems Based on Internet of Things and Machine Learning Algorithms [J].
Al-Dhief, Fahad Taha ;
Latiff, Nurul Mu'azzah Abdul ;
Abd Malik, Nik Noordini Nik ;
Salim, Naseer Sabri ;
Baki, Marina Mat ;
Albadr, Musatafa Abbas Abbood ;
Mohammed, Mazin Abed .
IEEE ACCESS, 2020, 8 :64514-64533
[3]   Voice Pathology Detection and Classification Using Auto-Correlation and Entropy Features in Different Frequency Regions [J].
Al-Nasheri, Ahmed ;
Muhammad, Ghulam ;
Alsulaiman, Mansour ;
Ali, Zulfiqar ;
Malki, Khalid H. ;
Mesallam, Tamer A. ;
Ibrahim, Mohamed Farahat .
IEEE ACCESS, 2018, 6 :6961-6974
[4]   Automatic Voice Pathology Monitoring Using Parallel Deep Models for Smart Healthcare [J].
Alhussein, Musaed ;
Muhammad, Ghulam .
IEEE ACCESS, 2019, 7 :46474-46479
[5]   Voice Pathology Detection Using Deep Learning on Mobile Healthcare Framework [J].
Alhussein, Musaed ;
Muhammad, Ghulam .
IEEE ACCESS, 2018, 6 :41034-41041
[6]   Intra- and Inter-database Study for Arabic, English, and German Databases: Do Conventional Speech Features Detect Voice Pathology? [J].
Ali, Zulfiqar ;
Alsulaiman, Mansour ;
Muhammad, Ghulam ;
Elamvazuthi, Irraivan ;
Al-nasheri, Ahmed ;
Mesallam, Tamer A. ;
Farahat, Mohamed ;
Malki, Khalid H. .
JOURNAL OF VOICE, 2017, 31 (03) :386.e1-386.e8
[7]   Entropies from Markov Models as Complexity Measures of Embedded Attractors [J].
Arias-Londono, Julian D. ;
Godino-Llorente, Juan I. .
ENTROPY, 2015, 17 (06) :3595-3620
[8]   Automatic Detection of Pathological Voices Using Complexity Measures, Noise Parameters, and Mel-Cepstral Coefficients [J].
Arias-Londono, Julian D. ;
Godino-Llorente, Juan I. ;
Saenz-Lechon, Nicolas ;
Osma-Ruiz, Victor ;
Castellanos-Dominguez, German .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2011, 58 (02) :370-379
[9]   An optimum algorithm in pathological voice quality assessment using wavelet-packet-based features, linear discriminant analysis and support vector machine [J].
Arjmandi, Meisam Khalil ;
Pooyan, Mohammad .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2012, 7 (01) :3-19
[10]   The Prevalence, Diagnosis, and Management of Voice Disorders in a National Ambulatory Medical Care Survey (NAMCS) Cohort [J].
Best, Simon R. ;
Fakhry, Carole .
LARYNGOSCOPE, 2011, 121 (01) :150-157