Voice Pathology Detection and Classification by Adopting Online Sequential Extreme Learning Machine

被引：50

作者：

Al-Dhief, Fahad Taha ^{[1
]}

Baki, Marina Mat ^{[2
]}

Latiff, Nurul Mu'azzah Abdul ^{[1
]}

Abd Malik, Nik Noordini Nik ^{[1
]}

Salim, Naseer Sabri ^{[3
]}

Albader, Musatafa Abbas Abbood ^{[4
]}

Mahyuddin, Nor Muzlifah ^{[5
]}

Mohammed, Mazin Abed ^{[6
]}

机构：

[1] Univ Teknol Malaysia, Sch Elect Engn, Fac Engn, Skudai 81310, Malaysia

[2] Univ Kebangsaan, Malaysia Med Ctr, Fac Med, Dept Otorhinolaryngol, Kuala Lumpur 56000, Malaysia

[3] Sohar Univ, Comp & Informat Technol, Sohar 311, Oman

[4] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, CAIT, Bangi 43600, Malaysia

[5] Univ Sains Malaysia, Sch Elect & Elect Engn, Nibong Tebal 14300, Malaysia

[6] Univ Anbar, Coll Comp Sci & Informat Technol, Ramadi 45654, Iraq

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Pathology; Feature extraction; Databases; Machine learning algorithms; Medical services; Support vector machines; Classification algorithms; Machine learning; healthcare; voice pathology detection; pathologies classification; OSELM; MFCC; SVD; IDENTIFICATION; ALGORITHM; FEATURES;

D O I：

10.1109/ACCESS.2021.3082565

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the last decade, the implementation of machine learning algorithms in the analysis of voice disorder is paramount in order to provide a non-invasive voice pathology detection by only using audio signal. In spite of that, most recent systems of voice pathology work on a limited acoustic database. In other words, the systems use one vowel, such as /a/, and ignore sentences and other vowels when analyzing the audio signal. Other key issues that should be considered in the systems are accuracy and time consumption of an algorithm. Online Sequential Extreme Learning Machine (OSELM) is one of the machine learning algorithms that can be regarded as a rapid and accurate algorithm in the classification process. Therefore, this paper presents a voice pathology detection and classification system by using OSELM algorithm as a classifier, and Mel-frequency cepstral coefficient (MFCC) as a featured extraction. In this work, the voice samples were taken from the Saarbrucken voice database (SVD). This system involves two parts of the database; the first part includes all voices in SVD with sentences and vowels /a/, /i/, and /u/, which are uttered in high, low, and normal pitches; and the second part utilizes voice samples of the common three types of pathologies (cyst, polyp, and paralysis) based on the vowel /a/ that is produced in normal pitch. The experimental results have shown that OSELM was able to achieve the highest accuracy up to 91.17%, 94% of precision, and 91% of recall. Furthermore, OSELM obtained 87%, 87.55%, and 97.67% for f-measure, G-mean, and specificity, respectively. The proposed system also presents a high ability to achieve detection and classification results in real-time clinical applications.

引用

页码：77293 / 77306

页数：14

共 57 条

[1] Zero Frequency Filter Based Analysis of Voice Disorders [J].

Adiga, Nagaraj ;

Vikram, C. M. ;

Pullela, Keerthi ;

Prasanna, S. R. M. .

18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :1824-1828

[2]

Ahmed E.H., 2020, Solid State Technology, V63, P8730

[3] A Survey of Voice Pathology Surveillance Systems Based on Internet of Things and Machine Learning Algorithms [J].

Al-Dhief, Fahad Taha ;

Latiff, Nurul Mu'azzah Abdul ;

Abd Malik, Nik Noordini Nik ;

Salim, Naseer Sabri ;

Baki, Marina Mat ;

Albadr, Musatafa Abbas Abbood ;

Mohammed, Mazin Abed .

IEEE ACCESS, 2020, 8 :64514-64533

[4] Voice Pathology Detection and Classification Using Auto-Correlation and Entropy Features in Different Frequency Regions [J].

Al-Nasheri, Ahmed ;

Muhammad, Ghulam ;

Alsulaiman, Mansour ;

Ali, Zulfiqar ;

Malki, Khalid H. ;

Mesallam, Tamer A. ;

Ibrahim, Mohamed Farahat .

IEEE ACCESS, 2018, 6 :6961-6974

[5] An Investigation of Multidimensional Voice Program Parameters in Three Different Databases for Voice Pathology Detection and Classification [J].

Al-nasheri, Ahmed ;

Muhammad, Ghulam ;

Alsulaiman, Mansour ;

Ali, Zulfiqar ;

Mesallam, Tamer A. ;

Farahat, Mohamed ;

Malki, Khalid H. ;

Bencherif, Mohamed A. .

JOURNAL OF VOICE, 2017, 31 (01) :113.e9-113.e18

[6] Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions [J].

Al-nasheri, Ahmed ;

Muhammad, Ghulam ;

Alsulaiman, Mansour ;

Ali, Zulfiqar .

JOURNAL OF VOICE, 2017, 31 (01) :3-15

[7]

Al-Nasheri A, 2014, I C COMP SYST APPLIC, P50, DOI 10.1109/AICCSA.2014.7073178

[8] Optimised genetic algorithm-extreme learning machine approach for automatic COVID-19 detection [J].

Albadr, Musatafa Abbas Abbood ;

Tiun, Sabrina ;

Ayob, Masri ;

AL-Dhief, Fahad Taha ;

Omar, Khairuddin ;

Hamzah, Faizal Amri .

PLOS ONE, 2020, 15 (12)

[9] Spoken Language Identification Based on Particle Swarm Optimisation-Extreme Learning Machine Approach [J].

Albadr, Musatafa Abbas Abbood ;

Tiun, Sabrina .

CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (09) :4596-4622

[10] Spoken language identification based on optimised genetic algorithm-extreme learning machine approach [J].

Albadr, Musatafa Abbas Abbood ;

Tiun, Sabrina ;

Ayob, Masri ;

AL-Dhief, Fahad Taha .

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) :711-727

← 1 2 3 4 5 6 →