Modeling groundwater nitrate concentrations in private wells in Iowa

被引:116
作者
Wheeler, David C. [1 ]
Nolan, Bernard T. [2 ]
Flory, Abigail R. [3 ]
DellaValle, Curt T. [4 ]
Ward, Mary H. [4 ]
机构
[1] Virginia Commonwealth Univ, Dept Biostat, Richmond, VA 23298 USA
[2] US Geol Survey, Reston, VA 22092 USA
[3] Westat Corp, Rockville, MD USA
[4] NCI, Occupat & Environm Epidemiol Branch, Div Canc Epidemiol & Genet, Rockville, MD USA
关键词
Nitrate; Groundwater contamination; Random forest; UNITED-STATES; VULNERABILITY; WATER; POLLUTION; DRAINAGE;
D O I
10.1016/j.scitotenv.2015.07.080
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Contamination of drinking water by nitrate is a growing problem in many agricultural areas of the country. Ingested nitrate can lead to the endogenous formation of N-nitroso compounds, potent carcinogens. We developed a predictive model for nitrate concentrations in private wells in Iowa. Using 34,084 measurements of nitrate in private wells, we trained and tested random forest models to predict log nitrate levels by systematically assessing the predictive performance of 179 variables in 36 thematic groups (well depth, distance to sinkholes, location, land use, soil characteristics, nitrogen inputs, meteorology, and other factors). The final model contained 66 variables in 17 groups. Some of the most important variables were well depth, slope length within 1 km of the well, year of sample, and distance to nearest animal feeding operation. The correlation between observed and estimated nitrate concentrations was excellent in the training set (r-square = 0.77) and was acceptable in the testing set (r-square = 0.38). The random forest model had substantially better predictive performance than a traditional linear regressionmodel or a regression tree. Our model will be used to investigate the association between nitrate levels in drinking water and cancer risk in the Iowa participants of the Agricultural Health Study cohort. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:481 / 488
页数:8
相关论文
共 55 条
[1]  
Agresti A., 2002, Categorical Data Analysis
[2]   The agricultural health study [J].
Alavanja, MCR ;
Sandler, DP ;
McMaster, SB ;
Zahm, SH ;
McDonnell, CJ ;
Lynch, CF ;
Pennybacker, M ;
Rothman, N ;
Dosemeci, M ;
Bond, AE ;
Blair, A .
ENVIRONMENTAL HEALTH PERSPECTIVES, 1996, 104 (04) :362-369
[3]  
Anning DW, 2012, US Geol Surv Sci Investig Rep 2012-5065
[4]  
[Anonymous], 2003, Iowa Geological Survey Education Series
[5]  
[Anonymous], 2010, 20105100 US GEOL SUR
[6]  
[Anonymous], IOWA CONCENTRATED AN
[7]  
[Anonymous], 2002, 4A3 USGS
[8]  
[Anonymous], 024269 US GEOL SURV
[9]  
[Anonymous], 2011, EPASAB11013
[10]  
[Anonymous], 2012, US GEOLOGICAL SURVEY