A Bayesian Approach to Recreational Water Quality Model Validation and Comparison in the Presence of Measurement Error

被引:7
作者
Potash, E. [1 ]
Steinschneider, S. [2 ]
机构
[1] Univ Illinois, Inst Sustainabil Energy & Environm, Urbana, IL 61801 USA
[2] Cornell Univ, Dept Biol & Environm Engn, Ithaca, NY USA
关键词
recreational water quality; measurement error; cross-validation; intermodel comparison; Bayesian; fecal indicator bacteria (FIB); ESCHERICHIA-COLI; UNCERTAINTY; ENTEROCOCCI; IMPACTS;
D O I
10.1029/2021WR031115
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Methods for measuring recreational water quality vary in analysis time, precision, availability, and cost. Decision-makers often use predictions from statistical models to compensate for the shortcomings of available measurements. However, model validation and comparison has largely omitted measurement error (defined here as variation in both sampling and the measurement technique) as an important source of uncertainty during validation. It is unknown how this omission affects estimates of model performance and comparisons between models. This study aims to fill this gap. First, we derive the bias incurred when omitting measurement error in calculating a model's mean squared error (MSE). We then develop a non-parametric validation method to correct estimates of MSE. To study other metrics of prediction performance (mean absolute error, sensitivity, precision, etc.) we develop a second validation method that uses simulations from a Bayesian validation model. These methods are applied to a comparison of two prediction models (random forest and nearest neighbor) used to predict the level of fecal indicator bacteria at nine recreational beaches in the city of Chicago. We find that accounting for measurement error significantly changes estimates of model performance. Moreover, it reveals substantial uncertainty underlying some of these estimates.
引用
收藏
页数:14
相关论文
共 44 条
[1]   Hydrological validation of a radar-based nowcasting technique [J].
Berenguer, M ;
Corral, C ;
Sánchez-Diezma, R ;
Sempere-Torres, D .
JOURNAL OF HYDROMETEOROLOGY, 2005, 6 (04) :532-549
[2]   Validation of hydrological models: Conceptual basis, methodological approaches and a proposal for a code of practice [J].
Biondi, Daniela ;
Freni, Gabriele ;
Iacobellis, Vito ;
Mascaro, Giuseppe ;
Montanari, Alberto .
PHYSICS AND CHEMISTRY OF THE EARTH, 2012, 42-44 :70-76
[3]   Enterococci concentrations in diverse coastal environments exhibit extreme variability [J].
Boehm, Alexandria B. .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2007, 41 (24) :8227-8232
[4]  
Breiman L., 2001, Machine Learning, V45, P5
[5]   Stan: A Probabilistic Programming Language [J].
Carpenter, Bob ;
Gelman, Andrew ;
Hoffman, Matthew D. ;
Lee, Daniel ;
Goodrich, Ben ;
Betancourt, Michael ;
Brubaker, Marcus A. ;
Guo, Jiqiang ;
Li, Peter ;
Riddell, Allen .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 76 (01) :1-29
[6]   Phosphorus load estimation in the Saginaw River, MI using a Bayesian hierarchical/multilevel model [J].
Cha, YoonKyung ;
Stow, Craig A. ;
Reckhow, Kenneth H. ;
DeMarchi, Carlo ;
Johengen, Thomas H. .
WATER RESEARCH, 2010, 44 (10) :3270-3282
[7]   BART: BAYESIAN ADDITIVE REGRESSION TREES [J].
Chipman, Hugh A. ;
George, Edward I. ;
McCulloch, Robert E. .
ANNALS OF APPLIED STATISTICS, 2010, 4 (01) :266-298
[8]  
Dawson CW, 2001, PROG PHYS GEOG, V25, P80, DOI 10.1191/030913301674775671
[9]   Monitoring urban beaches with qPCR vs. culture measures of fecal indicator bacteria: Implications for public notification [J].
Dorevitch, Samuel ;
Shrestha, Abhilasha ;
DeFlorio-Barker, Stephanie ;
Breitenbach, Cathy ;
Heimler, Ira .
ENVIRONMENTAL HEALTH, 2017, 16 :45
[10]   Impacts of measured data uncertainty on urban stormwater models [J].
Dotto, C. B. S. ;
Kleidorfer, M. ;
Deletic, A. ;
Rauch, W. ;
McCarthy, D. T. .
JOURNAL OF HYDROLOGY, 2014, 508 :28-42