An interpretable hybrid predictive model of COVID-19 cases using autoregressive model and LSTM

被引:10
|
作者
Zhang, Yangyi [1 ]
Tang, Sui [1 ]
Yu, Guo [2 ]
机构
[1] Univ Calif Santa Barbara, Dept Math, Santa Barbara, CA 93106 USA
[2] Univ Calif Santa Barbara, Dept Stat & Appl Probabil, Santa Barbara, CA 93106 USA
关键词
ARIMA; XGBOOST;
D O I
10.1038/s41598-023-33685-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The Coronavirus Disease 2019 (COVID-19) has had a profound impact on global health and economy, making it crucial to build accurate and interpretable data-driven predictive models for COVID-19 cases to improve public policy making. The extremely large scale of the pandemic and the intrinsically changing transmission characteristics pose a great challenge for effectively predicting COVID-19 cases. To address this challenge, we propose a novel hybrid model in which the interpretability of the Autoregressive model (AR) and the predictive power of the long short-term memory neural networks (LSTM) join forces. The proposed hybrid model is formalized as a neural network with an architecture that connects two composing model blocks, of which the relative contribution is decided data-adaptively in the training procedure. We demonstrate the favorable performance of the hybrid model over its two single composing models as well as other popular predictive models through comprehensive numerical studies on two data sources under multiple evaluation metrics. Specifically, in county-level data of 8 California counties, our hybrid model achieves 4.173% MAPE, outperforming the composing AR (5.629%) and LSTM (4.934%) alone on average. In country-level datasets, our hybrid model outperforms the widely-used predictive models such as AR, LSTM, Support Vector Machines, Gradient Boosting, and Random Forest, in predicting the COVID-19 cases in Japan, Canada, Brazil, Argentina, Singapore, Italy, and the United Kingdom. In addition to the predictive performance, we illustrate the interpretability of our proposed hybrid model using the estimated AR component, which is a key feature that is not shared by most black-box predictive models for COVID-19 cases. Our study provides a new and promising direction for building effective and interpretable data-driven models for COVID-19 cases, which could have significant implications for public health policy making and control of the current COVID-19 and potential future pandemics.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Time series prediction of COVID-19 transmission in America using LSTM and XGBoost algorithms
    Luo, Junling
    Zhang, Zhongliang
    Fu, Yao
    Rao, Feng
    RESULTS IN PHYSICS, 2021, 27
  • [32] A Dynamic Nonlinear Autoregressive Exogenous Model to Analyze the Impact of Mobility during COVID-19 Pandemic on the Electricity Consumption Prediction in Jordan
    Shbool, Mohammad A.
    Altarazi, Farah
    Alalaween, Wafa' H.
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2024, 83 (02): : 164 - 173
  • [33] Predictive Analytics for Detecting Sensor Failure Using Autoregressive Integrated Moving Average Model
    Thiyagarajan, Karthick
    Kodagoda, Sarath
    Linh Van Nguyen
    PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, : 1926 - 1931
  • [34] Forecasting COVID-19 daily cases using phone call data
    Rostami-Tabar, Bahman
    Rendon-Sanchez, Juan F.
    APPLIED SOFT COMPUTING, 2021, 100 (100)
  • [35] Short-Term Prediction of COVID-19 Using Novel Hybrid Ensemble Empirical Mode Decomposition and Error Trend Seasonal Model
    Khan, Dost Muhammad
    Ali, Muhammad
    Iqbal, Nadeem
    Khalil, Umair
    Aljohani, Hassan M. M.
    Alharthi, Amirah Saeed
    Afify, Ahmed Z. Z.
    FRONTIERS IN PUBLIC HEALTH, 2022, 10
  • [36] Estimating the COVID-19 prevalence and mortality using a novel data-driven hybrid model based on ensemble empirical mode decomposition
    Wang, Yongbin
    Xu, Chunjie
    Yao, Sanqiao
    Wang, Lei
    Zhao, Yingzheng
    Ren, Jingchao
    Li, Yuchun
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [37] Forecasting COVID-19: Vector Autoregression-Based Model
    Khairan Rajab
    Firuz Kamalov
    Aswani Kumar Cherukuri
    Arabian Journal for Science and Engineering, 2022, 47 : 6851 - 6860
  • [38] Forecasting COVID-19: Vector Autoregression-Based Model
    Rajab, Khairan
    Kamalov, Firuz
    Cherukuri, Aswani Kumar
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (06) : 6851 - 6860
  • [39] A Machine Learning Univariate Time series Model for Forecasting COVID-19 Confirmed Cases: A Pilot Study in Botswana
    Mphale, Ofaletse
    Okike, Ezekiel U.
    Rafifing, Neo
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (01): : 225 - 233
  • [40] Transfer Function Model for COVID-19 Deaths in USA Using Case Counts as Input Series
    Shahela, Fahmida Akter
    Uddin, Nizam
    BULLETIN OF THE MALAYSIAN MATHEMATICAL SCIENCES SOCIETY, 2022, 45 (SUPPL 1) : 461 - 475