Machine Learning-Based Screening Solution for COVID-19 Cases Investigation: Socio-Demographic and Behavioral Factors Analysis and COVID-19 Detection

被引:1
作者
K. M. Aslam Uddin
Farida Siddiqi Prity
Maisha Tasnim
Sumiya Nur Jannat
Mohammad Omar Faruk
Jahirul Islam
Saydul Akbar Murad
Apurba Adhikary
Anupam Kumar Bairagi
机构
[1] Noakhali Science and Technology University,Department of Information and Communication Engineering
[2] Noakhali Science and Technology University,Department of Statistics
[3] New Mexico Institute of Mining and Technology,Department of Computer Science and Engineering
[4] University of Southern Mississippi,School of Computing Sciences and Engineering
[5] Khulna University,Computing Sciences and Engineering Discipline
来源
Human-Centric Intelligent Systems | 2023年 / 3卷 / 4期
关键词
COVID-19; Socio-demographic; Behavior; Pearson Chi-square; Machine Learning;
D O I
10.1007/s44230-023-00049-9
中图分类号
学科分类号
摘要
The COVID-19 pandemic has unleashed an unprecedented global crisis, releasing a wave of illness, mortality, and economic disarray of unparalleled proportions. Numerous societal and behavioral aspects have conspired to fuel the rampant spread of COVID-19 across the globe. These factors encompass densely populated areas, adherence to mask-wearing protocols, inadequate awareness levels, and various behavioral and social practices. Despite the extensive research surrounding COVID-19 detection, an unfortunate dearth of studies has emerged to meticulously evaluate the intricate interplay between socio-demographic and behavioral factors and the likelihood of COVID-19 infection. Thus, a comprehensive online-based cross-sectional survey was methodically orchestrated, amassing data from a substantial sample size of 500 respondents. The precisely designed survey questionnaire encompassed various variables encompassing socio-demographics, behaviors, and social factors. The Bivariate Pearson’s Chi-square association test was deftly employed to unravel the complex associations between the explanatory variables and COVID-19 infection. The feature importance approach was also introduced to discern the utmost critical features underpinning this infectious predicament. Four distinct Machine Learning (ML) algorithms, specifically Decision Tree, Random Forest, CatBoost, and XGBoost, were employed to accurately predict COVID-19 infection based on a comprehensive analysis of socio-demographic and behavioral factors. The performance of these models was rigorously assessed using a range of evaluation metrics, including accuracy, recall, precision, ROC-AUC score, and F1 score. Pearson’s Chi-square test revealed a statistically significant association between vaccination status and COVID-19 infection. The use of sanitizer and masks, the timing of infection, and the interval between the first and second vaccine doses were significantly correlated with the likelihood of contracting the COVID-19 virus. Among the ML models tested, the XGBoost classifier demonstrated the highest classification accuracy, achieving an impressive 97.6%. These findings provide valuable insights for individuals, communities, and policymakers to implement targeted strategies aimed at mitigating the impact of the COVID-19 pandemic.
引用
收藏
页码:441 / 460
页数:19
相关论文
共 319 条
[1]  
Rajpal S(2022)Cov-elm classifier: an extreme learning machine based identification of covid-19 using chest x-ray images Intell Decis Technol 16 193-203
[2]  
Agarwal M(2021)A new approach for classifying coronavirus COVID-19 based on its manifestation on chest X-rays using texture features and neural networks Inf Sci 545 403-414
[3]  
Rajpal A(2021)Behavioral factors associated with COVID-19 risk: a cross-sectional survey in Japan Int J Environ Res Public Health 18 12184-187
[4]  
Lakhyani N(2020)Lifestyle risk factors, inflammatory mechanisms, and COVID-19 hospitalization: a community-based cohort study of 387,109 adults in UK Brain Behav Immun 87 184-11
[5]  
Saggar A(2021)Application of the protection motivation theory for predicting COVID-19 preventive behaviors in Hormozgan, Iran: a cross-sectional study BMC Public Health 21 1-536
[6]  
Kumar N(2022)Factors associated with COVID-19 vaccination behaviour in Latvian population: cross-sectional study Health Psychol Behav Med 10 514-8
[7]  
Varela-Santos S(2020)The age-related risk of severe outcomes due to COVID-19 infection: a rapid review, meta-analysis, and meta-regression Int J Environ Res Public Health 17 5974-22
[8]  
Melin P(2021)A machine learning regression model for the screening and design of potential SARS-CoV-2 protease inhibitors Netw Model Anal Health Inform Bioinform 10 1-256
[9]  
Ochi S(2021)A novel enhanced decision tree model for detecting chronic kidney disease Netw Model Anal Health Inform Bioinform 10 1-1208
[10]  
So M(2021)Artificial intelligence and machine learning for medical imaging: a technology review Physica Med 83 242-26769