Vaccine hesitancy in the post-vaccination COVID-19 era: a machine learning and statistical analysis driven study

被引:3
作者
Gupta, Himanshu [1 ]
Verma, Om Prakash [1 ]
机构
[1] Dr BR Ambedkar Natl Inst Technol, Dept Instrumentat & Control Engn, Jalandhar, Punjab, India
关键词
COVID-19; Machine Learning; Predictive Analysis; SARS-CoV-2; Statistical Analysis; VAERS;
D O I
10.1007/s12065-022-00704-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background The COVID-19 pandemic has badly affected people of all ages globally. Therefore, its vaccine has been developed and made available for public use in unprecedented times. However, because of various levels of hesitancy, it did not have general acceptance. The main objective of this work is to identify the risk associated with the COVID-19 vaccines by developing a prognosis tool that will help in enhancing its acceptability and therefore, reducing the lethality of SARS-CoV-2. Methods: The obtained raw VAERS dataset has three files indicating medical history, vaccination status, and post vaccination symptoms respectively with more than 354 thousand samples. After pre-processing, this raw dataset has been merged into one with 85 different attributes however, the whole analysis has been subdivided into three scenarios ((i) medical history (ii) reaction of vaccination (iii) combination of both). Further, Machine Learning (ML) models which includes Linear Regression (LR), Random Forest (RF), Naive Bayes (NB), Light Gradient Boosting Algorithm (LGBM), and Multilayer feed-forward perceptron (MLP) have been employed to predict the most probable outcome and their performance has been evaluated based on various performance parameters. Also, the chi-square (statistical), LR, RF, and LGBM have been utilized to estimate the most probable attribute in the dataset that resulted in death, hospitalization, and COVID-19. Results: For the above mentioned scenarios, all the models estimates different attributes (such as cardiac arrest, Cancer, Hyperlipidemia, Kidney Disease, Diabetes, Atrial Fibrillation, Dementia, Thyroid, etc.) for death, hospitalization, and COVID-19 even after vaccination. Further, for prediction, LGBM outperforms all the other developed models in most of the scenarios whereas, LR, RF, NB, and MLP perform satisfactorily in patches. Conclusion: The male population in the age group of 50-70 has been found most susceptible to this virus. Also, people with existing serious illnesses have been found most vulnerable. Therefore, they must be vaccinated in close observations. Generally, no serious adverse effect of the vaccine has been observed therefore, people must vaccinate themselves without any hesitation at the earliest. Also, the model developed using LGBM establishes its supremacy over all the other prediction models. Therefore, it can be very helpful for the policymakers in administrating and prioritizing the population for the different vaccination programs.
引用
收藏
页码:739 / 757
页数:19
相关论文
共 22 条
[11]  
Khan AH, 2020, SMART INNOV SYST TEC, V169, P1, DOI [10.1007/978-981-15-1616-0_1, 10.1007/s41870-020-00495-9]
[12]   A Novel YOLOv3 Algorithm-Based Deep Learning Approach for Waste Segregation: Towards Smart Waste Management [J].
Kumar, Saurav ;
Yadav, Drishti ;
Gupta, Himanshu ;
Verma, Om Prakash ;
Ansari, Irshad Ahmad ;
Ahn, Chang Wook .
ELECTRONICS, 2021, 10 (01) :1-20
[13]   Strategy for COVID-19 vaccination in India: the country with the second highest population and number of cases [J].
Kumar, Velayudhan Mohan ;
Pandi-Perumal, Seithikurippu R. ;
Trakht, Ilya ;
Thyagarajan, Sadras Panchatcharam .
NPJ VACCINES, 2021, 6 (01)
[14]   REVIEW OF NEURAL NETWORK APPLICATIONS IN MEDICAL IMAGING AND SIGNAL-PROCESSING [J].
MILLER, AS ;
BLOTT, BH ;
HAMES, TK .
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 1992, 30 (05) :449-464
[15]   SDN-IoT empowered intelligent framework for industry 4.0 applications during COVID-19 pandemic [J].
Rahman, Anichur ;
Chakraborty, Chinmay ;
Anwar, Adnan ;
Karim, Md. Razaul ;
Islam, Md. Jahidul ;
Kundu, Dipanjali ;
Rahman, Ziaur ;
Band, Shahab S. .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (04) :2351-2368
[16]   Strategy to identify priority groups for COVID-19 vaccination: A population based cohort study [J].
Russo, Antonio Giampiero ;
Decarli, Adriano ;
Valsecchi, Maria Grazia .
VACCINE, 2021, 39 (18) :2517-2525
[17]  
Salmi N, CLASSIFIER MODELS PR, DOI [10.1088/1757-899X/546/5/052068, DOI 10.1088/1757-899X/546/5/052068]
[18]   Random Forest Algorithm for the Classification of Neuroimaging Data in Alzheimer's Disease: A Systematic Review [J].
Sarica, Alessia ;
Cerasa, Antonio ;
Quattrone, Aldo .
FRONTIERS IN AGING NEUROSCIENCE, 2017, 9
[19]   COVID-19 infection: Origin, transmission, and characteristics of human coronaviruses [J].
Shereen, Muhammad Adnan ;
Khan, Suliman ;
Kazmi, Abeer ;
Bashir, Nadia ;
Siddique, Rabeea .
JOURNAL OF ADVANCED RESEARCH, 2020, 24 :91-98
[20]   Reports of Anaphylaxis After Receipt of mRNA COVID-19 Vaccines in the US-December 14, 2020-January 18, 2021 [J].
Shimabukuro, Tom T. ;
Cole, Matthew ;
Su, John R. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2021, 325 (11) :1101-1102