Automatic COVID-19 prediction using explainable machine learning techniques

被引:0
作者
Solayman S. [1 ]
Aumi S.A. [1 ]
Mery C.S. [1 ]
Mubassir M. [1 ]
Khan R. [1 ]
机构
[1] Electrical and Computer Engineering, North South University, Dhaka
来源
International Journal of Cognitive Computing in Engineering | 2023年 / 4卷
关键词
CNN-LSTM; COVID-19; Deep learning; Explainable AI; Hyperparameter optimization; Machine learning; Support vector machine; Webpage;
D O I
10.1016/j.ijcce.2023.01.003
中图分类号
学科分类号
摘要
The coronavirus is considered this century's most disruptive catastrophe and global concern. This disease has prompted extreme social, psychological and economic impacts affecting millions of people around the globe. COVID-19 is transmitted from one infected person's body to another through respiratory droplets. This virus proliferates when people breathe in air-contaminated space with droplets and microscopic airborne particles. This research aims to analyze automatic COVID-19 detection using machine learning techniques to build an intelligent web application. The dataset has been preprocessed by dropping null values, feature engineering, and synthetic oversampling (SMOTE) techniques. Next, we trained and evaluated different classifiers, i.e., logistic regression, random forest, decision tree, k-nearest neighbor, support vector machine (SVM), ensemble models (adaptive boosting and extreme gradient boosting) and deep learning (artificial neural network, convolutional neural network and long short-term memory) techniques. Explainable AI with the LIME framework has been applied to interpret the prediction results. The hybrid CNN-LSTM algorithm with the SMOTE approach performed better than the other models on the employed open-source dataset obtained from the Israeli Ministry of Health website, with 96.34% accuracy and a 0.98 F1 score. Finally, this model was chosen to deploy the proposed prediction system to a website, where users may acquire an instantaneous COVID-19 prognosis based on their symptoms. © 2023 The Authors
引用
收藏
页码:36 / 46
页数:10
相关论文
共 39 条
[1]  
Jin Y., Yang H., Ji W., Wu W., Chen S., Zhang W., Et al., Virology, epidemiology, pathogenesis, and control of COVID-19, Viruses, 12, pp. 1-17, (2020)
[2]  
COVID L.U.
[3]  
Darapaneni N., Singh A., Paduri A., Ranjith A., Kumar A., Dixit D., Et al., pp. 375-380, (2020)
[4]  
Kaiser M.S., Mahmud M., Noor M.B.T., Zenia N.Z., Mamun S.A., Et al., iWorksafe: Towards healthy workplaces during COVID-19 with an intelligent phealth app for industrial settings, IEEE Access, 9, pp. 13814-13828, (2021)
[5]  
Koshti D., Kamoji S., Cheruthuruthy K., Shahi S.P., Mishra M., A detection, tracking and alerting system for Covid-19 using geo-fencing and machine learning, International Conference on Intelligent Computing and Control Systems, pp. 1499-1506, (2021)
[6]  
Udawat B., Santani A., Agrawal S., Occlusion detection for COVID-19 identification: A review, International Conference on Advanced Computing and Communication Systems, pp. 298-301, (2021)
[7]  
Zoabi Y., Deri-Rozov S., Shomron N., Machine learning-based prediction of COVID-19 diagnosis based on symptoms, npj Digital Medicine, 4, pp. 1-5, (2021)
[8]  
Rustam Z., Zhafarina F., Saragih G.S., Hartini S., Pancreatic cancer classification using logistic regression and random forest, International Journal of Artificial Intelligence, 10, pp. 476-481, (2021)
[9]  
Ahmed N., Ahammed R., Islam M.M., Uddin M.A., Akhter A., Talukder M.A., Et al., Machine learning based diabetes prediction and development of smart web application, International Journal of Cognitive Computing in Engineering, 2, pp. 229-241, (2021)
[10]  
Nurhachita N., Negara E.S., A comparison between deep learning, naïve bayes and random forest for the application of data mining on the admission of new students, International Journal of Artificial Intelligence, 10, pp. 324-331, (2021)