A data-driven approach to identifying PFAS water sampling priorities in Colorado, United States

被引:2
|
作者
Barton, Kelsey E. [1 ,2 ]
Anthamatten, Peter J. [3 ]
Adgate, John L. [2 ]
McKenzie, Lisa M. [2 ]
Starling, Anne P. [4 ]
Berg, Kevin [1 ]
Murphy, Robert C. [5 ]
Richardson, Kristy [1 ]
机构
[1] Toxicol & Environm Epidemiol Off, Colorado Dept Publ Hlth & Environm, Denver, CO 80246 USA
[2] Univ Colorado, Colorado Sch Publ Hlth, Dept Environm & Occupat Hlth, Anschutz Med Campus, Aurora, CO 80045 USA
[3] Univ Colorado Denver, Dept Geog & Environm Sci, Denver, CO USA
[4] Univ N Carolina, Gillings Sch Global Publ Hlth, Dept Epidemiol, Chapel Hill, NC USA
[5] Colorado Dept Publ Hlth & Environm, Source Water Assessment & Protect Program, Denver, CO USA
关键词
Perfluorinated chemicals; Emerging contaminants; Environmental monitoring; Geospatial analyses; Vulnerable populations; POLYFLUOROALKYL SUBSTANCES; ACCESS; HEALTH; RISK; FATE;
D O I
10.1038/s41370-024-00705-7
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
BackgroundPer and polyfluoroalkyl substances (PFAS), a class of environmentally and biologically persistent chemicals, have been used across many industries since the middle of the 20th century. Some PFAS have been linked to adverse health effects.ObjectiveOur objective was to incorporate known and potential PFAS sources, physical characteristics of the environment, and existing PFAS water sampling results into a PFAS risk prediction map that may be used to develop a PFAS water sampling prioritization plan for the Colorado Department of Public Health and Environment (CDPHE).MethodsWe used random forest classification to develop a predictive surface of potential groundwater contamination from two PFAS, perfluorooctane sulfonate (PFOS) and perfluorooctanoate (PFOA). The model predicted PFAS risk at locations without sampling data into one of three risk categories after being "trained" with existing PFAS water sampling data. We used prediction results, variable importance ranking, and population characteristics to develop recommendations for sampling prioritization.ResultsSensitivity and precision ranged from 58% to 90% in the final models, depending on the risk category. The model and prioritization approach identified private wells in specific census blocks, as well as schools, mobile home parks, and public water systems that rely on groundwater as priority sampling locations. We also identified data gaps including areas of the state with limited sampling and potential source types that need further investigation.Impact statementThis work uses random forest classification to predict the risk of groundwater contamination from two per- and polyfluoroalkyl substances (PFAS) across the state of Colorado, United States. We developed the prediction model using data on known and potential PFAS sources and physical characteristics of the environment, and "trained" the model using existing PFAS water sampling results. This data-driven approach identifies opportunities for PFAS water sampling prioritization as well as information gaps that, if filled, could improve model predictions. This work provides decision-makers information to effectively use limited resources towards protection of populations most susceptible to the impacts of PFAS exposure.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] A Novel Data-Driven Approach for Predicting the Performance Degradation of a Gas Turbine
    Dai, Shun
    Zhang, Xiaoyi
    Luo, Mingyu
    ENERGIES, 2024, 17 (04)
  • [42] A data-driven approach to estimating dockless electric scooter service areas
    Karimpour, Abolfazl
    Hosseinzadeh, Aryan
    Kluger, Robert
    JOURNAL OF TRANSPORT GEOGRAPHY, 2023, 109
  • [43] Bayesian Data-Driven approach enhances synthetic flood loss models
    Sairam, Nivedita
    Schroeter, Kai
    Carisi, Francesca
    Wagenaar, Dennis
    Domeneghetti, Alessio
    Molinari, Daniela
    Brill, Fabio
    Priest, Sally
    Viavattene, Christophe
    Merz, Bruno
    Kreibich, Heidi
    ENVIRONMENTAL MODELLING & SOFTWARE, 2020, 132
  • [44] Estimating a large drive time matrix between ZIP codes in the United States: A differential sampling approach
    Hu, Yujie
    Wang, Changzhen
    Li, Ruiyang
    Wang, Fahui
    JOURNAL OF TRANSPORT GEOGRAPHY, 2020, 86
  • [45] A single weighting approach to analyze respondent-driven sampling data
    Selvaraj, Vadivoo
    Boopathi, Kangusamy
    Paranjape, Ramesh
    Mehendale, Sanjay
    INDIAN JOURNAL OF MEDICAL RESEARCH, 2016, 144 : 447 - 459
  • [46] Identifying Homeless Youth At-Risk of Substance Use Disorder: Data-Driven Insights for Policymakers
    Tabar, Maryam
    Park, Heesoo
    Winkler, Stephanie
    Lee, Dongwon
    Barman-Adhikari, Anamika
    Yadav, Amulya
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3092 - 3100
  • [47] Data-Driven Cervical Cancer Prediction Model with Outlier Detection and Over-Sampling Methods
    Ijaz, Muhammad Fazal
    Attique, Muhammad
    Son, Youngdoo
    SENSORS, 2020, 20 (10)
  • [48] Multidimensional Population Health Modeling: A Data-Driven Multivariate Statistical Learning Approach
    Wei, Zhiyuan
    Narin, Adil Baran
    Mukherjee, Sayanti
    IEEE ACCESS, 2022, 10 : 22737 - 22755
  • [49] Integrating Macroeconomic and Technical Indicators into Forecasting the Stock Market: A Data-Driven Approach
    Latif, Saima
    Aslam, Faheem
    Ferreira, Paulo
    Iqbal, Sohail
    ECONOMIES, 2025, 13 (01)
  • [50] Household financial health: a machine learning approach for data-driven diagnosis and prescription
    Kim, Kyeongbin
    Hwang, Yoontae
    Lim, Dongcheol
    Kim, Suhyeon
    Lee, Junghye
    Lee, Yongjae
    QUANTITATIVE FINANCE, 2023, 23 (11) : 1565 - 1595