A proficient approach to forecast COVID-19 spread via optimized dynamic machine learning models

被引:53
作者
Alali, Yasminah [1 ]
Harrou, Fouzi [1 ]
Sun, Ying [1 ]
机构
[1] King Abdullah Univ Sci & Technol KAUST, Comp Elect & Math Sci & Engn CEMSE Div, Thuwal 239556900, Saudi Arabia
关键词
GAUSSIAN-PROCESSES; REGRESSION; ALGORITHMS;
D O I
10.1038/s41598-022-06218-3
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study aims to develop an assumption-free data-driven model to accurately forecast COVID-19 spread. Towards this end, we firstly employed Bayesian optimization to tune the Gaussian process regression (GPR) hyperparameters to develop an efficient GPR-based model for forecasting the recovered and confirmed COVID-19 cases in two highly impacted countries, India and Brazil. However, machine learning models do not consider the time dependency in the COVID-19 data series. Here, dynamic information has been taken into account to alleviate this limitation by introducing lagged measurements in constructing the investigated machine learning models. Additionally, we assessed the contribution of the incorporated features to the COVID-19 prediction using the Random Forest algorithm. Results reveal that significant improvement can be obtained using the proposed dynamic machine learning models. In addition, the results highlighted the superior performance of the dynamic GPR compared to the other models (i.e., Support vector regression, Boosted trees, Bagged trees, Decision tree, Random Forest, and XGBoost) by achieving an averaged mean absolute percentage error of around 0.1%. Finally, we provided the confidence level of the predicted results based on the dynamic GPR model and showed that the predictions are within the 95% confidence interval. This study presents a promising shallow and simple approach for predicting COVID-19 spread.
引用
收藏
页数:20
相关论文
共 78 条
[1]   FSS-2019-nCov: A deep learning architecture for semi-supervised few-shot segmentation of COVID-19 infection [J].
Abdel-Basset, Mohamed ;
Chang, Victor ;
Hawash, Hossam ;
Chakrabortty, Ripon K. ;
Ryan, Michael .
KNOWLEDGE-BASED SYSTEMS, 2021, 212
[2]  
Acosta M. F.J, 2020, 2020 IEEE INT S SIGN, P1
[3]  
[Anonymous], 2017, P 2017 8 INT C INFOR
[4]   Forecasting the dynamics of cumulative COVID-19 cases (confirmed, recovered and deaths) for top-16 countries using statistical machine learning models: Auto-Regressive Integrated Moving Average (ARIMA) and Seasonal Auto-Regressive Integrated Moving Average (SARIMA) [J].
ArunKumar, K. E. ;
Kalaga, Dinesh V. ;
Kumar, Ch. Mohan Sai ;
Chilkoor, Govinda ;
Kawaji, Masahiro ;
Brenza, Timothy M. .
APPLIED SOFT COMPUTING, 2021, 103
[5]   Deep Learning Applications to Combat Novel Coronavirus (COVID-19) Pandemic [J].
Asraf A. ;
Islam M.Z. ;
Haque M.R. ;
Islam M.M. .
SN Computer Science, 2020, 1 (6)
[6]   Time series forecasting of new cases and new deaths rate for COVID-19 using deep learning methods [J].
Ayoobi, Nooshin ;
Sharifrazi, Danial ;
Alizadehsani, Roohallah ;
Shoeibi, Afshin ;
Gorriz, Juan M. ;
Moosaei, Hossein ;
Khosravi, Abbas ;
Nahavandi, Saeid ;
Chofreh, Abdoulmohammad Gholamzadeh ;
Goni, Feybi Ariani ;
Klemes, Jiri Jaromir ;
Mosavi, Amir .
RESULTS IN PHYSICS, 2021, 27
[7]   Data analysis of Covid-19 pandemic and short-term cumulative case forecasting using machine learning time series methods [J].
Balli, Serkan .
CHAOS SOLITONS & FRACTALS, 2021, 142
[8]   An empirical comparison of voting classification algorithms: Bagging, boosting, and variants [J].
Bauer, E ;
Kohavi, R .
MACHINE LEARNING, 1999, 36 (1-2) :105-139
[9]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[10]   Anticipating the international spread of Zika virus from Brazil [J].
Bogoch, Isaac I. ;
Brady, Oliver J. ;
Kraemer, Moritz U. G. ;
German, Matthew ;
Creatore, Marisa I. ;
Kulkarni, Manisha A. ;
Brownstein, John S. ;
Mekaru, Sumiko R. ;
Hay, Simon I. ;
Groot, Emily ;
Watts, Alexander ;
Khan, Kamran .
LANCET, 2016, 387 (10016) :335-336