A hybrid machine learning model to predict and visualize nitrate concentration throughout the Central Valley aquifer, California, USA

被引:134
作者
Ransom, Katherine M. [1 ]
Nolan, Bernard T. [2 ]
Traum, Jonathan A. [3 ]
Faunt, Claudia C. [4 ]
Bell, Andrew M. [5 ]
Gronberg, Jo Ann M. [6 ]
Wheeler, David C. [7 ]
Rosecrans, Celia Z. [3 ]
Jurgens, Bryant [3 ]
Schwarz, Gregory E. [2 ]
Belitz, Kenneth [8 ]
Eberts, Sandra M. [9 ]
Kourakos, George [1 ]
Harter, Thomas [1 ]
机构
[1] Univ Calif Davis, Dept Land Air & Water Resources, Davis, CA 95616 USA
[2] US Geol Survey, Natl Water Qual Program, 959 Natl Ctr, Reston, VA 22092 USA
[3] US Geol Survey, Calif Water Sci Ctr, Sacramento, CA USA
[4] US Geol Survey, Calif Water Sci Ctr, San Diego, CA USA
[5] Univ Calif Davis, Ctr Watershed Sci, Davis, CA 95616 USA
[6] US Geol Survey, Calif Water Sci Ctr, 345 Middlefield Rd, Menlo Pk, CA 94025 USA
[7] Virginia Commonwealth Univ, Dept Biostat, Richmond, VA USA
[8] US Geol Survey, New England Water Sci Ctr, Northborough, MA USA
[9] US Geol Survey, Ohio Water Sci Ctr, Columbus, OH USA
关键词
Groundwater; Nitrate; Boosted regression trees; Machine learning; Modeling; DRINKING-WATER WELLS; GROUNDWATER AGE; DISTRIBUTIONS; POPULATION; QUALITY;
D O I
10.1016/j.scitotenv.2017.05.192
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Intense demand for water in the Central Valley of California and related increases in groundwater nitrate concentration threaten the sustainability of the groundwater resource. To assess contamination risk in the region, we developed a hybrid, non-linear, machine learningmodel within a statistical learning framework to predict nitrate contamination of groundwater to depths of approximately 500 m below ground surface. A database of 145 predictor variables representing well characteristics, historical and current field and landscape-scale nitrogen mass balances, historical and current land use, oxidation/reduction conditions, groundwater flow, climate, soil characteristics, depth to groundwater, and groundwater age were assigned to over 6000 private supply and public supply wells measured previously for nitrate and located throughout the study area. The boosted regression tree (BRT) method was used to screen and rank variables to predict nitrate concentration at the depths of domestic and public well supplies. The novel approach included as predictor variables outputs from existing physically based models of the Central Valley. The top five most important predictor variables included two oxidation/reduction variables (probability of manganese concentration to exceed 50 ppb and probability of dissolved oxygen concentration to be below 0.5 ppm), field-scale adjusted unsaturated zone nitrogen input for the 1975 time period, average difference between precipitation and evapotranspiration during the years 1971-2000, and 1992 total landscape nitrogen input. Twenty-five variables were selected for the final model for log-transformed nitrate. In general, increasing probability of anoxic conditions and increasing precipitation relative to potential evapotranspiration had a corresponding decrease in nitrate concentration predictions. Conversely, increasing 1975 unsaturated zone nitrogen leaching flux and 1992 total landscape nitrogen input had an increasing relative impact on nitrate predictions. Three-dimensional visualization indicates that nitrate predictions depend on the probability of anoxic conditions and other factors, and that nitrate predictions generally decreased with increasing groundwater age. (C) 2017 The Authors. Published by Elsevier B.V.
引用
收藏
页码:1160 / 1172
页数:13
相关论文
共 61 条
  • [1] [Anonymous], USGS TECH WATER RESO
  • [2] [Anonymous], 2012, 4F3 US GEOL SURV
  • [3] [Anonymous], 2012, US GEOL SURV SCI INV
  • [4] ArcGIS, 2016, EMP BAYES KRIG
  • [5] Predicting Arsenic in Drinking Water Wells of the Central Valley, California
    Ayotte, Joseph D.
    Nolan, Bernard T.
    Gronberg, Jo Ann
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2016, 50 (14) : 7555 - 7563
  • [6] Metrics for Assessing the Quality of Groundwater Used for Public Supply, CA, USA: Equivalent-Population and Area
    Belitz, Kenneth
    Fram, Miranda S.
    Johnson, Tyler D.
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2015, 49 (14) : 8330 - 8338
  • [7] Estimation of aquifer scale proportion using equal area grids: Assessment of regional scale groundwater quality
    Belitz, Kenneth
    Jurgens, Bryant
    Landon, Matthew K.
    Fram, Miranda S.
    Johnson, Tyler
    [J]. WATER RESOURCES RESEARCH, 2010, 46
  • [8] Boyle D., 2012, Addressing Nitrate in California's Drinking Water with a Focus on Tulare Lake Basin and Salinas Valley Groundwater
  • [9] Assessment of regional change in nitrate concentrations in groundwater in the Central Valley, California, USA, 1950s-2000s
    Burow, Karen R.
    Jurgens, Bryant C.
    Belitz, Kenneth
    Dubrovsky, Neil M.
    [J]. ENVIRONMENTAL EARTH SCIENCES, 2013, 69 (08) : 2609 - 2621
  • [10] Canter L., 1996, Environmental Impact Assessment