Forecasting and classification of new cases of COVID 19 before vaccination using decision trees and Gaussian mixture model

被引:12
作者
Hamdi, Monia [1 ]
Hilali-Jaghdam, Ines [2 ]
Elnaim, Bushra Elamin [3 ]
Elhag, Azhari A. [4 ]
机构
[1] Princess Nourah bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Technol, POB 84428, Riyadh 11671, Saudi Arabia
[2] Princess Nourah bint Abdulrahman Univ, Appl Coll, Dept Comp Sci & Informat Technol, POB 84428, Riyadh 11671, Saudi Arabia
[3] Prince Sattam bin Abdulaziz Univ, Coll Sci & Humanities Al Sulail, Dept Comp Sci, Al Kharj, Saudi Arabia
[4] Taif Univ, Coll Sci, Dept Math, POB 11099, Taif 21944, Saudi Arabia
关键词
Gaussian Mixture Model (GMM); Decision Tree (DT); Machine Learning (ML); Chi-Squared Automatic Interaction Detection (CHAID);
D O I
10.1016/j.aej.2022.07.011
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Regarding the pandemic taking place in the world from the spread of the Coronavirus pandemic and viral mutations, the need has arisen to analyze the epidemic data in terms of numbers of infected and deaths, different geographical regions, and the dynamics of the spread of the virus. In China, the total number of reported infections is 224,659 on June 11, 2022. In this paper, the Gaussian Mixture Model and the decision tree method were used to classify and predict new cases of the virus. Although we focus mainly on the Chinese case, the model is general and adapted to any context without loss of validity of the qualitative results. The Chi-Squared (v2) Automatic Interac-tion Detection (CHAID) was applied in creating the decision tree structure, the data has been clas-sified into five classes, according to the BIC criterion. The best mixture model is the E (Equal variance) with five components. The considered data sets of the world health organization (WHO) were used from January 5, 2020, to 12, November 2021. We provide numerical results based on the Chinese case.
引用
收藏
页码:327 / 333
页数:7
相关论文
共 23 条
[1]   Free convection effect on oscillatory flow using artificial neural networks and statistical techniques [J].
Abo-Dahab, S. M. ;
Ragab, Mahmoud ;
Elhag, Azhari A. ;
Abdel-Khalek, S. .
ALEXANDRIA ENGINEERING JOURNAL, 2020, 59 (05) :3599-3608
[2]   Empirical Evaluation of Alternative Time-Series Models for COVID-19 Forecasting in Saudi Arabia [J].
Al-Turaiki, Isra ;
Almutlaq, Fahad ;
Alrasheed, Hend ;
Alballa, Norah .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (16)
[3]   Mathematical COVID-19 model with vaccination: a case study in Saudi Arabia [J].
Algarni, Abeer D. ;
Ben Hamed, Aws ;
Hamdi, Monia ;
Elmannai, Hela ;
Meshoul, Souham .
PEERJ COMPUTER SCIENCE, 2022, 8 :1-20
[4]  
Aljohani H.M., 2021, COMPLEXITY, P1
[5]  
Alsabti K., 1998, Proceedings Fourth International Conference on Knowledge Discovery and Data Mining, P2
[6]   Machine learned hybrid Gaussian analysis of COVID-19 pandemic in India [J].
Bhardwaj, Shivam ;
Alowaidi, Majed ;
Bhardwaj, Rashmi ;
Sharma, Sunil Kumar .
RESULTS IN PHYSICS, 2021, 30
[7]   On the effect of COVID-19 pandemic in the excess of human mortality. The case of Brazil and Spain [J].
Borrego-Morell, Jorge A. ;
Huertas, Edmundo J. ;
Torrado, Nuria .
PLOS ONE, 2021, 16 (09)
[8]   Covid-19 Patients' Hospital Occupancy Prediction During the Recent Omicron Wave via some Recurrent Deep Learning Architectures [J].
Bouhamed, Heni ;
Hamdi, Monia ;
Gargouri, Rahma .
INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2022, 17 (03)
[9]  
Breiman L., 1999, Classification and Regression Trees
[10]   Establishing a Multiple-Criteria Decision-Making Model for Stock Investment Decisions Using Data Mining Techniques [J].
Cheng, Kuo-Chih ;
Huang, Mu-Jung ;
Fu, Cheng-Kai ;
Wang, Kuo-Hua ;
Wang, Huo-Ming ;
Lin, Lan-Hui .
SUSTAINABILITY, 2021, 13 (06)