Speech as a Biomarker for COVID-19 Detection Using Machine Learning

被引:9
作者
Usman, Mohammed [1 ]
Gunjan, Vinit Kumar [2 ]
Wajid, Mohd [3 ]
Zubair, Mohammed [1 ]
Siddiquee, Kazy Noor-e-alam [4 ]
机构
[1] King Khalid Univ, Dept Elect Engn, Abha 61411, Saudi Arabia
[2] CMR Inst Technol, Dept Comp Sci & Engn, Hyderabad, India
[3] Aligarh Muslim Univ, Dept Elect Engn, ZHCET, Aligarh 202002, Uttar Pradesh, India
[4] Univ Sci & Technol, Dept Comp Sci & Engn, Chittagong, Bangladesh
关键词
RESPIRATORY SINUS ARRHYTHMIA; ARTIFICIAL-INTELLIGENCE; MORTALITY RISK; CLASSIFICATION; PREDICTION; REGRESSION; DIAGNOSIS; RECOGNITION; EXTRACTION; HEARTBEAT;
D O I
10.1155/2022/6093613
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The use of speech as a biomedical signal for diagnosing COVID-19 is investigated using statistical analysis of speech spectral features and classification algorithms based on machine learning. It is established that spectral features of speech, obtained by computing the short-time Fourier Transform (STFT), get altered in a statistical sense as a result of physiological changes. These spectral features are then used as input features to machine learning-based classification algorithms to classify them as coming from a COVID-19 positive individual or not. Speech samples from healthy as well as "asymptomatic" COVID-19 positive individuals have been used in this study. It is shown that the RMS error of statistical distribution fitting is higher in the case of speech samples of COVID-19 positive speech samples as compared to the speech samples of healthy individuals. Five state-of-the-art machine learning classification algorithms have also been analyzed, and the performance evaluation metrics of these algorithms are also presented. The tuning of machine learning model parameters is done so as to minimize the misclassification of COVID-19 positive individuals as being COVID-19 negative since the cost associated with this misclassification is higher than the opposite misclassification. The best performance in terms of the "recall" metric is observed for the Decision Forest algorithm which gives a recall value of 0.7892.
引用
收藏
页数:12
相关论文
共 84 条
[1]   Review of Big Data Analytics, Artificial Intelligence and Nature-Inspired Computing Models towards Accurate Detection of COVID-19 Pandemic Cases and Contact Tracing [J].
Agbehadji, Israel Edem ;
Awuzie, Bankole Osita ;
Ngowi, Alfred Beati ;
Millham, Richard C. .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (15) :1-16
[2]   Machine Learning and Deep Learning Approaches to Analyze and Detect COVID-19: A Review [J].
Aishwarya T. ;
Ravi Kumar V. .
SN Computer Science, 2021, 2 (3)
[3]   Role of biological Data Mining and Machine Learning Techniques in Detecting and Diagnosing the Novel Coronavirus (COVID-19): A Systematic Review [J].
Albahri, A. S. ;
Hamid, Rula A. ;
Alwan, Jwan K. ;
Al-qays, Z. T. ;
Zaidan, A. A. ;
Zaidan, B. B. ;
Albahri, A. O. S. ;
AlAmoodi, A. H. ;
Khlaf, Jamal Mawlood ;
Almahdi, E. M. ;
Thabet, Eman ;
Hadi, Suha M. ;
Mohammed, K., I ;
Alsalem, M. A. ;
Al-Obaidi, Jameel R. ;
Madhloom, H. T. .
JOURNAL OF MEDICAL SYSTEMS, 2020, 44 (07)
[4]   Random forest method for the recognition of susceptibility and resistance patterns in antibiograms [J].
Ayala-Aldana, Nicolas ;
Gonzalez-Valdes, Leticia .
REVISTA CHILENA DE INFECTOLOGIA, 2023, 40 (01) :76-77
[5]  
AlJame Maryam, 2020, Inform Med Unlocked, V21, P100449, DOI 10.1016/j.imu.2020.100449
[6]   COVID-19 Diagnostics, Tools, and Prevention [J].
Allam, Mayar ;
Cai, Shuangyi ;
Ganesh, Shambavi ;
Venkatesan, Mythreye ;
Doodhwala, Saurabh ;
Song, Zexing ;
Hu, Thomas ;
Kumar, Aditi ;
Heit, Jeremy ;
Coskun, Ahmet F. .
DIAGNOSTICS, 2020, 10 (06)
[7]  
[Anonymous], 2021, LANCET DIGIT HEALTH, V3, pE1, DOI 10.1016/S2589-7500(20)30295-8
[8]  
[Anonymous], 2009, P 4 IN C INT TECHN J
[9]  
[Anonymous], 2021, VIT SIGNS BOD TEMP P
[10]  
[Anonymous], 2019, SCIENCEENCYCLOPEDIA