An Adaptable LSTM Network Predicting COVID-19 Occurrence Using Time Series Data

被引:3
作者
Li, Anthony [1 ]
Yadav, Nikhil [2 ]
机构
[1] Bergen Cty Acad, Acad Technol & Comp Sci, Hackensack, NJ 07601 USA
[2] St Johns Univ, Div Comp Sci Math & Sci, Queens, NY USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH (ICDH 2021) | 2021年
关键词
Deep learning; supervised learning; LSTM; COVID-19; digital health; time-series prediction; classification;
D O I
10.1109/ICDH52753.2021.00031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the COVID-19 pandemic progresses, it has become critical for policymakers and medical officials to understand how cases are trending. Machine learning models, particularly deep learning LSTM (Long Short-Term Memory) models, may hold immense value to forecast changes in COVID-19 cases. In this paper, a novel LSTM-based architecture is proposed, developed and trained on human logistics data that includes travel patterns, visits to commercial properties, as well as historical cases, demographic, and climate data. This data includes both time series and static data allowing the LSTM to be used in both classification and regression tasks to predict COVID-19 occurrence trends. For classification, the problem is modeled as a multiclass supervised learning classification problem with varying granularity. The proposed LSTM network achieves an 81.0% F1-score outperforming conventional machine learning model benchmarks (such as the random forest model with an F1 score of 58.9%) and is comparable in performance to a time series forest model. Additionally, the LSTM model is adaptable to perform regression and predict a 14-day sliding window based on currently observed data with a mean absolute error of 0.0026. This research serves as a foundation for future work in the forecasting of COVID-19 and other similar disease outbreaks using similar temporal and static data.
引用
收藏
页码:172 / 177
页数:6
相关论文
共 21 条
[1]  
Achterberg M. A., INTERNAL J FORECASTI
[2]  
Agency for Toxic Substances and Disease Registsry, 2018, CDC SOC VULN IND
[3]  
[Anonymous], 2020, Safegraph Social Distancing Metrics
[4]  
[Anonymous], 2020, NEW YORK TIMES
[5]  
[Anonymous], 2018, State Area Measurements and Internal Point Coordinates. The United States Census Bureau
[6]   Utilization of machine-learning models to accurately predict the risk for critical COVID-19 [J].
Assaf, Dan ;
Gutman, Ya'ara ;
Neuman, Yair ;
Segal, Gad ;
Amit, Sharon ;
Gefen-Halevi, Shiraz ;
Shilo, Noya ;
Epstein, Avi ;
Mor-Cohen, Ronit ;
Biber, Asaf ;
Rahav, Galia ;
Levy, Itzchak ;
Tirosh, Amit .
INTERNAL AND EMERGENCY MEDICINE, 2020, 15 (08) :1435-1443
[7]   Time series forecasting of COVID-19 transmission in Canada using LSTM networks [J].
Chimmula, Vinay Kumar Reddy ;
Zhang, Lei .
CHAOS SOLITONS & FRACTALS, 2020, 135
[8]  
Deng H., 2013, A Time Series Forest for Classification and Feature Extraction
[9]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[10]  
Institute for Health Metrics and Evaluation, 2020, COVID 19 PROJ