Voice Pathology Detection and Classification by Adopting Online Sequential Extreme Learning Machine

被引:47
作者
Al-Dhief, Fahad Taha [1 ]
Baki, Marina Mat [2 ]
Latiff, Nurul Mu'azzah Abdul [1 ]
Abd Malik, Nik Noordini Nik [1 ]
Salim, Naseer Sabri [3 ]
Albader, Musatafa Abbas Abbood [4 ]
Mahyuddin, Nor Muzlifah [5 ]
Mohammed, Mazin Abed [6 ]
机构
[1] Univ Teknol Malaysia, Sch Elect Engn, Fac Engn, Skudai 81310, Malaysia
[2] Univ Kebangsaan, Malaysia Med Ctr, Fac Med, Dept Otorhinolaryngol, Kuala Lumpur 56000, Malaysia
[3] Sohar Univ, Comp & Informat Technol, Sohar 311, Oman
[4] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, CAIT, Bangi 43600, Malaysia
[5] Univ Sains Malaysia, Sch Elect & Elect Engn, Nibong Tebal 14300, Malaysia
[6] Univ Anbar, Coll Comp Sci & Informat Technol, Ramadi 45654, Iraq
关键词
Pathology; Feature extraction; Databases; Machine learning algorithms; Medical services; Support vector machines; Classification algorithms; Machine learning; healthcare; voice pathology detection; pathologies classification; OSELM; MFCC; SVD; IDENTIFICATION; ALGORITHM; FEATURES;
D O I
10.1109/ACCESS.2021.3082565
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the last decade, the implementation of machine learning algorithms in the analysis of voice disorder is paramount in order to provide a non-invasive voice pathology detection by only using audio signal. In spite of that, most recent systems of voice pathology work on a limited acoustic database. In other words, the systems use one vowel, such as /a/, and ignore sentences and other vowels when analyzing the audio signal. Other key issues that should be considered in the systems are accuracy and time consumption of an algorithm. Online Sequential Extreme Learning Machine (OSELM) is one of the machine learning algorithms that can be regarded as a rapid and accurate algorithm in the classification process. Therefore, this paper presents a voice pathology detection and classification system by using OSELM algorithm as a classifier, and Mel-frequency cepstral coefficient (MFCC) as a featured extraction. In this work, the voice samples were taken from the Saarbrucken voice database (SVD). This system involves two parts of the database; the first part includes all voices in SVD with sentences and vowels /a/, /i/, and /u/, which are uttered in high, low, and normal pitches; and the second part utilizes voice samples of the common three types of pathologies (cyst, polyp, and paralysis) based on the vowel /a/ that is produced in normal pitch. The experimental results have shown that OSELM was able to achieve the highest accuracy up to 91.17%, 94% of precision, and 91% of recall. Furthermore, OSELM obtained 87%, 87.55%, and 97.67% for f-measure, G-mean, and specificity, respectively. The proposed system also presents a high ability to achieve detection and classification results in real-time clinical applications.
引用
收藏
页码:77293 / 77306
页数:14
相关论文
共 57 条
  • [1] Zero Frequency Filter Based Analysis of Voice Disorders
    Adiga, Nagaraj
    Vikram, C. M.
    Pullela, Keerthi
    Prasanna, S. R. M.
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1824 - 1828
  • [2] Ahmed E.H., 2020, Solid State Technology, V63, P8730
  • [3] A Survey of Voice Pathology Surveillance Systems Based on Internet of Things and Machine Learning Algorithms
    Al-Dhief, Fahad Taha
    Latiff, Nurul Mu'azzah Abdul
    Abd Malik, Nik Noordini Nik
    Salim, Naseer Sabri
    Baki, Marina Mat
    Albadr, Musatafa Abbas Abbood
    Mohammed, Mazin Abed
    [J]. IEEE ACCESS, 2020, 8 : 64514 - 64533
  • [4] Voice Pathology Detection and Classification Using Auto-Correlation and Entropy Features in Different Frequency Regions
    Al-Nasheri, Ahmed
    Muhammad, Ghulam
    Alsulaiman, Mansour
    Ali, Zulfiqar
    Malki, Khalid H.
    Mesallam, Tamer A.
    Ibrahim, Mohamed Farahat
    [J]. IEEE ACCESS, 2018, 6 : 6961 - 6974
  • [5] An Investigation of Multidimensional Voice Program Parameters in Three Different Databases for Voice Pathology Detection and Classification
    Al-nasheri, Ahmed
    Muhammad, Ghulam
    Alsulaiman, Mansour
    Ali, Zulfiqar
    Mesallam, Tamer A.
    Farahat, Mohamed
    Malki, Khalid H.
    Bencherif, Mohamed A.
    [J]. JOURNAL OF VOICE, 2017, 31 (01) : 113.e9 - 113.e18
  • [6] Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions
    Al-nasheri, Ahmed
    Muhammad, Ghulam
    Alsulaiman, Mansour
    Ali, Zulfiqar
    [J]. JOURNAL OF VOICE, 2017, 31 (01) : 3 - 15
  • [7] Al-Nasheri A, 2014, I C COMP SYST APPLIC, P50, DOI 10.1109/AICCSA.2014.7073178
  • [8] Optimised genetic algorithm-extreme learning machine approach for automatic COVID-19 detection
    Albadr, Musatafa Abbas Abbood
    Tiun, Sabrina
    Ayob, Masri
    AL-Dhief, Fahad Taha
    Omar, Khairuddin
    Hamzah, Faizal Amri
    [J]. PLOS ONE, 2020, 15 (12):
  • [9] Spoken Language Identification Based on Particle Swarm Optimisation-Extreme Learning Machine Approach
    Albadr, Musatafa Abbas Abbood
    Tiun, Sabrina
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (09) : 4596 - 4622
  • [10] Spoken language identification based on optimised genetic algorithm-extreme learning machine approach
    Albadr, Musatafa Abbas Abbood
    Tiun, Sabrina
    Ayob, Masri
    AL-Dhief, Fahad Taha
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 711 - 727