Application of machine learning for multi-community COVID-19 outbreak predictions with wastewater surveillance

被引:25
作者
Ai, Yuehan [1 ]
He, Fan [1 ]
Lancaster, Emma [2 ,3 ]
Lee, Jiyoung [1 ,2 ,4 ]
机构
[1] Ohio State Univ, Dept Food Sci & Technol, Columbus, OH 43210 USA
[2] Ohio State Univ, Coll Publ Hlth, Div Environm Hlth Sci, Columbus, OH 43210 USA
[3] Ohio State Univ, Environm Sci Grad Program, Columbus, OH 43210 USA
[4] Ohio State Univ, Infect Dis Inst, Columbus, OH 43210 USA
关键词
D O I
10.1371/journal.pone.0277154
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The potential of wastewater-based epidemiology (WBE) as a surveillance and early warning tool for the COVID-19 outbreak has been demonstrated. For areas with limited testing capacity, wastewater surveillance can provide information on the disease dynamic at a community level. A predictive model is a key to generating quantitative estimates of the infected population. Modeling longitudinal wastewater data can be challenging as biomarkers in wastewater are susceptible to variations caused by multiple factors associated with the wastewater matrix and the sewersheds characteristics. As WBE is an emerging trend, the model should be able to address the uncertainties of wastewater from different sewersheds. We proposed exploiting machine learning and deep learning techniques, which are supported by the growing WBE data. In this article, we reviewed the existing predictive models, among which the emerging machine learning/deep learning models showed great potential. However, most models are built for individual sewersheds with few features extracted from the wastewater. To fulfill the research gap, we compared different time-series and non-time-series models for their short-term predictive performance of COVID-19 cases in 9 diverse sewersheds. The time-series models, long short-term memory (LSTM) and Prophet, outcompeted the non-time-series models. Besides viral (SARS-CoV-2) loads and location identity, domain-specific features like biochemical parameters of wastewater, geographical parameters of the sewersheds, and some socioeconomic parameters of the communities can contribute to the models. With proper feature engineering and hyperparameter tuning, we believe machine learning models like LSTM can be a feasible solution for the COVID-19 trend prediction via WBE. Overall, this is a proof-of-concept study on the application of machine learning in COVID-19 WBE. Future studies are needed to deploy and maintain the model in more real-world applications.
引用
收藏
页数:12
相关论文
共 33 条
[1]   Viral outbreaks detection and surveillance using wastewater-based epidemiology, viral air sampling, and machine learning techniques: A comprehensive review and outlook [J].
Abdeldayem, M. Omar ;
Dabbish, M. Areeg ;
Habashy, M. Mahmoud ;
Mostafa, K. Mohamed ;
Elhefnawy, Mohamed ;
Amin, Lobna ;
Al-Sakkari, G. Eslam ;
Ragab, Ahmed ;
Rene, R. Eldon .
SCIENCE OF THE TOTAL ENVIRONMENT, 2022, 803
[2]   First con firmed detection of SARS-CoV-2 in untreated wastewater in Australia: A proof of concept for the wastewater surveillance of COVID-19 in the community [J].
Ahmed, Warish ;
Angel, Nicola ;
Edson, Janette ;
Bibby, Kyle ;
Bivins, Aaron ;
O'Brien, Jake W. ;
Choi, Phil M. ;
Kitajima, Masaaki ;
Simpson, Stuart L. ;
Li, Jiaying ;
Tscharke, Ben ;
Verhagen, Rory ;
Smith, Wendy J. M. ;
Zaugg, Julian ;
Dierens, Leanne ;
Hugenholtz, Philip ;
Thomas, Kevin, V ;
Mueller, Jochen F. .
SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 728
[3]   Wastewater SARS-CoV-2 monitoring as a community-level COVID-19 trend tracker and variants in Ohio, United States [J].
Ai, Yuehan ;
Davis, Angela ;
Jones, Dan ;
Lemeshow, Stanley ;
Tu, Huolin ;
He, Fan ;
Ru, Peng ;
Pan, Xiaokang ;
Bohrerova, Zuzana ;
Lee, Jiyoung .
SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 801
[4]   Making waves: Plausible lead time for wastewater based epidemiology as an early warning system for COVID-19 [J].
Bibby, Kyle ;
Bivins, Aaron ;
Wu, Zhenyu ;
North, Devin .
WATER RESEARCH, 2021, 202
[5]   Wastewater-Based Epidemiology: Global Collaborative to Maximize Contributions in the Fight Against COVID-19 [J].
Bivins, Aaron ;
North, Devin ;
Ahmad, Arslan ;
Ahmed, Warish ;
Alm, Eric ;
Been, Frederic ;
Bhattacharya, Prosun ;
Bijlsma, Lubertus ;
Boehm, Alexandria B. ;
Brown, Joe ;
Buttiglieri, Gianluigi ;
Calabro, Vincenza ;
Carducci, Annalaura ;
Castiglioni, Sara ;
Gurol, Zeynep Cetecioglu ;
Chakraborty, Sudip ;
Costa, Federico ;
Curcio, Stefano ;
de los Reyes, Francis L., III ;
Vela, Jeseth Delgado ;
Farkas, Kata ;
Fernandez-Casi, Xavier ;
Gerba, Charles ;
Gerrity, Daniel ;
Girones, Rosina ;
Gonzalez, Raul ;
Haramoto, Eiji ;
Harris, Angela ;
Holden, Patricia A. ;
Islam, Md. Tahmidul ;
Jones, Davey L. ;
Kasprzyk-Hordern, Barbara ;
Kitajima, Masaaki ;
Kotlarz, Nadine ;
Kumar, Manish ;
Kuroda, Keisuke ;
La Rosa, Giuseppina ;
Malpei, Francesca ;
Mautus, Mariana ;
McLellan, Sandra L. ;
Medema, Gertjan ;
Meschke, John Scott ;
Mueller, Jochen ;
Newton, Ryan J. ;
Nilsson, David ;
Noble, Rachel T. ;
van Nuijs, Alexander ;
Peccia, Jordan ;
Perkins, T. Alex ;
Pickering, Amy J. .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2020, 54 (13) :7754-7757
[6]   On forecasting the community-level COVID-19 cases from the concentration of SARS-CoV-2 in wastewater [J].
Cao, Yongtao ;
Francis, Roland .
SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 786
[7]  
Centers for Disease Control and Prevention/ Agency for Toxic Substances and Disease Registry/ Geospatial Research Analysis and Services Program, 2021, CDC ATSDR SVI DAT DO
[8]  
Chandra R., 2021, arXiv
[9]   Wastewater surveillance for population -wide Covid-19: The present and future [J].
Daughton, Christian G. .
SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 736
[10]   Stochastic gradient boosting [J].
Friedman, JH .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 38 (04) :367-378