Application of machine learning for multi-community COVID-19 outbreak predictions with wastewater surveillance

被引:21
作者
Ai, Yuehan [1 ]
He, Fan [1 ]
Lancaster, Emma [2 ,3 ]
Lee, Jiyoung [1 ,2 ,4 ]
机构
[1] Ohio State Univ, Dept Food Sci & Technol, Columbus, OH 43210 USA
[2] Ohio State Univ, Coll Publ Hlth, Div Environm Hlth Sci, Columbus, OH 43210 USA
[3] Ohio State Univ, Environm Sci Grad Program, Columbus, OH 43210 USA
[4] Ohio State Univ, Infect Dis Inst, Columbus, OH 43210 USA
来源
PLOS ONE | 2022年 / 17卷 / 11期
关键词
D O I
10.1371/journal.pone.0277154
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The potential of wastewater-based epidemiology (WBE) as a surveillance and early warning tool for the COVID-19 outbreak has been demonstrated. For areas with limited testing capacity, wastewater surveillance can provide information on the disease dynamic at a community level. A predictive model is a key to generating quantitative estimates of the infected population. Modeling longitudinal wastewater data can be challenging as biomarkers in wastewater are susceptible to variations caused by multiple factors associated with the wastewater matrix and the sewersheds characteristics. As WBE is an emerging trend, the model should be able to address the uncertainties of wastewater from different sewersheds. We proposed exploiting machine learning and deep learning techniques, which are supported by the growing WBE data. In this article, we reviewed the existing predictive models, among which the emerging machine learning/deep learning models showed great potential. However, most models are built for individual sewersheds with few features extracted from the wastewater. To fulfill the research gap, we compared different time-series and non-time-series models for their short-term predictive performance of COVID-19 cases in 9 diverse sewersheds. The time-series models, long short-term memory (LSTM) and Prophet, outcompeted the non-time-series models. Besides viral (SARS-CoV-2) loads and location identity, domain-specific features like biochemical parameters of wastewater, geographical parameters of the sewersheds, and some socioeconomic parameters of the communities can contribute to the models. With proper feature engineering and hyperparameter tuning, we believe machine learning models like LSTM can be a feasible solution for the COVID-19 trend prediction via WBE. Overall, this is a proof-of-concept study on the application of machine learning in COVID-19 WBE. Future studies are needed to deploy and maintain the model in more real-world applications.
引用
收藏
页数:12
相关论文
共 33 条
  • [1] Viral outbreaks detection and surveillance using wastewater-based epidemiology, viral air sampling, and machine learning techniques: A comprehensive review and outlook
    Abdeldayem, M. Omar
    Dabbish, M. Areeg
    Habashy, M. Mahmoud
    Mostafa, K. Mohamed
    Elhefnawy, Mohamed
    Amin, Lobna
    Al-Sakkari, G. Eslam
    Ragab, Ahmed
    Rene, R. Eldon
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2022, 803
  • [2] First con firmed detection of SARS-CoV-2 in untreated wastewater in Australia: A proof of concept for the wastewater surveillance of COVID-19 in the community
    Ahmed, Warish
    Angel, Nicola
    Edson, Janette
    Bibby, Kyle
    Bivins, Aaron
    O'Brien, Jake W.
    Choi, Phil M.
    Kitajima, Masaaki
    Simpson, Stuart L.
    Li, Jiaying
    Tscharke, Ben
    Verhagen, Rory
    Smith, Wendy J. M.
    Zaugg, Julian
    Dierens, Leanne
    Hugenholtz, Philip
    Thomas, Kevin, V
    Mueller, Jochen F.
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 728
  • [3] Wastewater SARS-CoV-2 monitoring as a community-level COVID-19 trend tracker and variants in Ohio, United States
    Ai, Yuehan
    Davis, Angela
    Jones, Dan
    Lemeshow, Stanley
    Tu, Huolin
    He, Fan
    Ru, Peng
    Pan, Xiaokang
    Bohrerova, Zuzana
    Lee, Jiyoung
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 801
  • [4] Making waves: Plausible lead time for wastewater based epidemiology as an early warning system for COVID-19
    Bibby, Kyle
    Bivins, Aaron
    Wu, Zhenyu
    North, Devin
    [J]. WATER RESEARCH, 2021, 202
  • [5] Wastewater-Based Epidemiology: Global Collaborative to Maximize Contributions in the Fight Against COVID-19
    Bivins, Aaron
    North, Devin
    Ahmad, Arslan
    Ahmed, Warish
    Alm, Eric
    Been, Frederic
    Bhattacharya, Prosun
    Bijlsma, Lubertus
    Boehm, Alexandria B.
    Brown, Joe
    Buttiglieri, Gianluigi
    Calabro, Vincenza
    Carducci, Annalaura
    Castiglioni, Sara
    Gurol, Zeynep Cetecioglu
    Chakraborty, Sudip
    Costa, Federico
    Curcio, Stefano
    de los Reyes, Francis L., III
    Vela, Jeseth Delgado
    Farkas, Kata
    Fernandez-Casi, Xavier
    Gerba, Charles
    Gerrity, Daniel
    Girones, Rosina
    Gonzalez, Raul
    Haramoto, Eiji
    Harris, Angela
    Holden, Patricia A.
    Islam, Md. Tahmidul
    Jones, Davey L.
    Kasprzyk-Hordern, Barbara
    Kitajima, Masaaki
    Kotlarz, Nadine
    Kumar, Manish
    Kuroda, Keisuke
    La Rosa, Giuseppina
    Malpei, Francesca
    Mautus, Mariana
    McLellan, Sandra L.
    Medema, Gertjan
    Meschke, John Scott
    Mueller, Jochen
    Newton, Ryan J.
    Nilsson, David
    Noble, Rachel T.
    van Nuijs, Alexander
    Peccia, Jordan
    Perkins, T. Alex
    Pickering, Amy J.
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2020, 54 (13) : 7754 - 7757
  • [6] On forecasting the community-level COVID-19 cases from the concentration of SARS-CoV-2 in wastewater
    Cao, Yongtao
    Francis, Roland
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 786
  • [7] Centers for Disease Control and Prevention/ Agency for Toxic Substances and Disease Registry/ Geospatial Research Analysis and Services Program, 2021, CDC ATSDR SVI DAT DO
  • [8] Chandra R., 2021, arXiv
  • [9] Wastewater surveillance for population -wide Covid-19: The present and future
    Daughton, Christian G.
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 736
  • [10] Stochastic gradient boosting
    Friedman, JH
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 38 (04) : 367 - 378