A random forest model for forecasting regional COVID-19 cases utilizing reproduction number estimates and demographic data

被引:18
作者
Galasso, Joseph [1 ]
Cao, Duy M. [2 ]
Hochberg, Robert [3 ]
机构
[1] Univ Dallas, Dept Biol 11, Irving, TX 75062 USA
[2] Univ Dallas, Dept Comp Sci 134, Irving, TX 75062 USA
[3] Univ Dallas, Dept Comp Sci 50, Irving, TX 75062 USA
关键词
COVID-19; Random forest; Compartmental model; Mobility; US county;
D O I
10.1016/j.chaos.2021.111779
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
During the COVID-19 pandemic, predicting case spikes at the local level is important for a precise, targeted public health response and is generally done with compartmental models. The performance of compartmental models is highly dependent on the accuracy of their assumptions about disease dynamics within a population; thus, such models are susceptible to human error, unexpected events, or unknown characteristics of a novel infectious agent like COVID-19. We present a relatively non-parametric random forest model that forecasts the number of COVID-19 cases at the U.S. county level. Its most prioritized training features are derived from easily accessible, standard epidemiological data (i.e., regional test positivity rate) and the effective reproduction number ( R t ) from compartmental models. A novel input training feature is case projections generated by aligning estimated effective reproduction number (pre-computed by COVIDActNow.org) with real time testing data until maximally correlated, helping our model fit better to the epidemic's trajectory as ascertained by traditional models. Poor reliability of R t is partially mitigated with dynamic population mobility and prevalence and mortality of non-COVID-19 diseases to gauge population disease susceptibility. The model was used to generate forecasts for 1, 2, 3, and 4 weeks into the future for each reference week within 11/01/2020 - 01/10/2021 for 3068 counties. Over this time period, it maintained a mean absolute error (MAE) of less than 300 weekly cases/10 0,0 0 0 and consistently outperformed or performed comparably with gold-standard compartmental models. Furthermore, it holds great potential in ensemble modeling due to its potential for a more expansive training feature set while maintaining good performance and limited resource utilization. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:8
相关论文
共 25 条
[1]  
[Anonymous], 2021, IEEE Trans. Broadcast.
[2]   The challenges of modeling and forecasting the spread of COVID-19 [J].
Bertozzi, Andrea L. ;
Franco, Elisa ;
Mohler, George ;
Short, Martin B. ;
Sledge, Daniel .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (29) :16732-16738
[3]   Replicating and projecting the path of COVID-19 with a model-implied reproduction number [J].
Buckman, Shelby R. ;
Glick, Reuven ;
Lansing, Kevin J. ;
Petrosky-Nadeau, Nicolas ;
Seitelman, Lily M. .
INFECTIOUS DISEASE MODELLING, 2020, 5 :635-651
[4]   Random forests for genomic data analysis [J].
Chen, Xi ;
Ishwaran, Hemant .
GENOMICS, 2012, 99 (06) :323-329
[5]   Short-term forecasting COVID-19 cumulative confirmed cases: Perspectives for Brazil [J].
Dal Molin Ribeiro, Matheus Henrique ;
da Silva, Ramon Gomes ;
Mariani, Viviana Cocco ;
Coelho, Leandro dos Santos .
CHAOS SOLITONS & FRACTALS, 2020, 135
[6]   Ensemble forecast modeling for the design of COVID-19 vaccine efficacy trials [J].
Dean, Natalie E. ;
Piontti, Ana Pastore Y. ;
Madewell, Zachary J. ;
Cummings, Derek A. T. ;
Hitchings, Matthew D. T. ;
Joshi, Keya ;
Kahn, Rebecca ;
Vespignani, Alessandro ;
Halloran, M. Elizabeth ;
Longini, Ira M., Jr. .
VACCINE, 2020, 38 (46) :7213-7216
[7]   Forecasting incidence of infectious diarrhea using random forest in Jiangsu Province, China [J].
Fang, Xinyu ;
Liu, Wendong ;
Ai, Jing ;
He, Mike ;
Wu, Ying ;
Shi, Yingying ;
Shen, Wenqi ;
Bao, Changjun .
BMC INFECTIOUS DISEASES, 2020, 20 (01)
[8]  
Jo A., 2021, ONEQUIETNIGHT UNPUB
[9]   Comparison of ARIMA and Random Forest time series models for prediction of avian influenza H5N1 outbreaks [J].
Kane, Michael J. ;
Price, Natalie ;
Scotch, Matthew ;
Rabinowitz, Peter .
BMC BIOINFORMATICS, 2014, 15
[10]   Seasonal temperatures and hydrological conditions improve the prediction of West Nile virus infection rates in Culex mosquitoes and human case counts in New York and Connecticut [J].
Keyel, Alexander C. ;
Timm, Oliver Elison ;
Backenson, P. Bryon ;
Prussing, Catharine ;
Quinones, Sarah ;
McDonough, Kathleen A. ;
Vuille, Mathias ;
Conn, Jan E. ;
Armstrong, Philip M. ;
Andreadis, Theodore G. ;
Kramer, Laura D. .
PLOS ONE, 2019, 14 (06)