An interpretable hybrid predictive model of COVID-19 cases using autoregressive model and LSTM

被引:10
|
作者
Zhang, Yangyi [1 ]
Tang, Sui [1 ]
Yu, Guo [2 ]
机构
[1] Univ Calif Santa Barbara, Dept Math, Santa Barbara, CA 93106 USA
[2] Univ Calif Santa Barbara, Dept Stat & Appl Probabil, Santa Barbara, CA 93106 USA
关键词
ARIMA; XGBOOST;
D O I
10.1038/s41598-023-33685-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The Coronavirus Disease 2019 (COVID-19) has had a profound impact on global health and economy, making it crucial to build accurate and interpretable data-driven predictive models for COVID-19 cases to improve public policy making. The extremely large scale of the pandemic and the intrinsically changing transmission characteristics pose a great challenge for effectively predicting COVID-19 cases. To address this challenge, we propose a novel hybrid model in which the interpretability of the Autoregressive model (AR) and the predictive power of the long short-term memory neural networks (LSTM) join forces. The proposed hybrid model is formalized as a neural network with an architecture that connects two composing model blocks, of which the relative contribution is decided data-adaptively in the training procedure. We demonstrate the favorable performance of the hybrid model over its two single composing models as well as other popular predictive models through comprehensive numerical studies on two data sources under multiple evaluation metrics. Specifically, in county-level data of 8 California counties, our hybrid model achieves 4.173% MAPE, outperforming the composing AR (5.629%) and LSTM (4.934%) alone on average. In country-level datasets, our hybrid model outperforms the widely-used predictive models such as AR, LSTM, Support Vector Machines, Gradient Boosting, and Random Forest, in predicting the COVID-19 cases in Japan, Canada, Brazil, Argentina, Singapore, Italy, and the United Kingdom. In addition to the predictive performance, we illustrate the interpretability of our proposed hybrid model using the estimated AR component, which is a key feature that is not shared by most black-box predictive models for COVID-19 cases. Our study provides a new and promising direction for building effective and interpretable data-driven models for COVID-19 cases, which could have significant implications for public health policy making and control of the current COVID-19 and potential future pandemics.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Prediction of COVID-19 Data Using an ARIMA-LSTM Hybrid Forecast Model
    Jin, Yongchao
    Wang, Renfang
    Zhuang, Xiaodie
    Wang, Kenan
    Wang, Honglian
    Wang, Chenxi
    Wang, Xiyin
    MATHEMATICS, 2022, 10 (21)
  • [2] Global Forecasting Confirmed and Fatal Cases of COVID-19 Outbreak Using Autoregressive Integrated Moving Average Model
    Dansana, Debabrata
    Kumar, Raghvendra
    Das Adhikari, Janmejoy
    Mohapatra, Mans
    Sharma, Rohit
    Priyadarshini, Ishaani
    Le, Dac-Nhuong
    FRONTIERS IN PUBLIC HEALTH, 2020, 8
  • [3] LSTM algorithm optimization for COVID-19 prediction model
    Sembiring, Irwan
    Wahyuni, Sri Ngudi
    Sediyono, Eko
    HELIYON, 2024, 10 (04)
  • [4] Hybrid Time Series Model for Advanced Predictive Analysis in COVID-19 Vaccination
    Khalil, Amna
    Awan, Mazhar Javed
    Yasin, Awais
    Kousar, Tanzeela
    Rahman, Abdur
    Youssef, Mohamed Sebaie
    ELECTRONICS, 2024, 13 (13)
  • [5] Forecasting daily Covid-19 cases in the world with a hybrid ARIMA and neural network model
    Morais, Lucas Rabelo de Araujo
    Gomes, Gecynalda Soares da Silva
    APPLIED SOFT COMPUTING, 2022, 126
  • [6] COVID-19 Pandemic Forecasting Using CNN-LSTM: A Hybrid Approach
    Zain, Zuhaira M.
    Alturki, Nazik M.
    JOURNAL OF CONTROL SCIENCE AND ENGINEERING, 2021, 2021
  • [7] Improved autoregressive integrated moving average model for COVID-19 prediction by using statistical significance and clustering techniques
    Ilu, Saratu Yusuf
    Prasad, Rajesh
    HELIYON, 2023, 9 (02)
  • [8] Forecasting of COVID-19 in India Using ARIMA Model
    Darapaneni, Narayana
    Reddy, Deepak
    Paduri, Anwesh Reddy
    Acharya, Pooja
    Nithin, H. S.
    2020 11TH IEEE ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2020, : 894 - 899
  • [9] Prediction of COVID-19 Data Using Improved ARIMA-LSTM Hybrid Forecast Models
    Jin, Yong-Chao
    Cao, Qian
    Wang, Ke-Nan
    Zhou, Yuan
    Cao, Yan-Peng
    Wang, Xi-Yin
    IEEE ACCESS, 2023, 11 : 67956 - 67967
  • [10] A Methodological Approach for Predicting COVID-19 Epidemic Using EEMD-ANN Hybrid Model
    Hasan, Najmul
    INTERNET OF THINGS, 2020, 11