A data-driven approach to identifying PFAS water sampling priorities in Colorado, United States

被引：2

作者：

Barton, Kelsey E. ^{[1
,2
]}

Anthamatten, Peter J. ^{[3
]}

Adgate, John L. ^{[2
]}

McKenzie, Lisa M. ^{[2
]}

Starling, Anne P. ^{[4
]}

Berg, Kevin ^{[1
]}

Murphy, Robert C. ^{[5
]}

Richardson, Kristy ^{[1
]}

机构：

[1] Toxicol & Environm Epidemiol Off, Colorado Dept Publ Hlth & Environm, Denver, CO 80246 USA

[2] Univ Colorado, Colorado Sch Publ Hlth, Dept Environm & Occupat Hlth, Anschutz Med Campus, Aurora, CO 80045 USA

[3] Univ Colorado Denver, Dept Geog & Environm Sci, Denver, CO USA

[4] Univ N Carolina, Gillings Sch Global Publ Hlth, Dept Epidemiol, Chapel Hill, NC USA

[5] Colorado Dept Publ Hlth & Environm, Source Water Assessment & Protect Program, Denver, CO USA

来源：

JOURNAL OF EXPOSURE SCIENCE AND ENVIRONMENTAL EPIDEMIOLOGY | 2024年

关键词：

Perfluorinated chemicals; Emerging contaminants; Environmental monitoring; Geospatial analyses; Vulnerable populations; POLYFLUOROALKYL SUBSTANCES; ACCESS; HEALTH; RISK; FATE;

D O I：

10.1038/s41370-024-00705-7

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

BackgroundPer and polyfluoroalkyl substances (PFAS), a class of environmentally and biologically persistent chemicals, have been used across many industries since the middle of the 20th century. Some PFAS have been linked to adverse health effects.ObjectiveOur objective was to incorporate known and potential PFAS sources, physical characteristics of the environment, and existing PFAS water sampling results into a PFAS risk prediction map that may be used to develop a PFAS water sampling prioritization plan for the Colorado Department of Public Health and Environment (CDPHE).MethodsWe used random forest classification to develop a predictive surface of potential groundwater contamination from two PFAS, perfluorooctane sulfonate (PFOS) and perfluorooctanoate (PFOA). The model predicted PFAS risk at locations without sampling data into one of three risk categories after being "trained" with existing PFAS water sampling data. We used prediction results, variable importance ranking, and population characteristics to develop recommendations for sampling prioritization.ResultsSensitivity and precision ranged from 58% to 90% in the final models, depending on the risk category. The model and prioritization approach identified private wells in specific census blocks, as well as schools, mobile home parks, and public water systems that rely on groundwater as priority sampling locations. We also identified data gaps including areas of the state with limited sampling and potential source types that need further investigation.Impact statementThis work uses random forest classification to predict the risk of groundwater contamination from two per- and polyfluoroalkyl substances (PFAS) across the state of Colorado, United States. We developed the prediction model using data on known and potential PFAS sources and physical characteristics of the environment, and "trained" the model using existing PFAS water sampling results. This data-driven approach identifies opportunities for PFAS water sampling prioritization as well as information gaps that, if filled, could improve model predictions. This work provides decision-makers information to effectively use limited resources towards protection of populations most susceptible to the impacts of PFAS exposure.

引用

页数：11

共 50 条

[41] A Novel Data-Driven Approach for Predicting the Performance Degradation of a Gas Turbine
Dai, Shun
Zhang, Xiaoyi
Luo, Mingyu
ENERGIES, 2024, 17 (04)
[42] A data-driven approach to estimating dockless electric scooter service areas
Karimpour, Abolfazl
Hosseinzadeh, Aryan
Kluger, Robert
JOURNAL OF TRANSPORT GEOGRAPHY, 2023, 109
[43] Bayesian Data-Driven approach enhances synthetic flood loss models
Sairam, Nivedita
Schroeter, Kai
Carisi, Francesca
Wagenaar, Dennis
Domeneghetti, Alessio
Molinari, Daniela
Brill, Fabio
Priest, Sally
Viavattene, Christophe
Merz, Bruno
Kreibich, Heidi
ENVIRONMENTAL MODELLING & SOFTWARE, 2020, 132
[44] Estimating a large drive time matrix between ZIP codes in the United States: A differential sampling approach
Hu, Yujie
Wang, Changzhen
Li, Ruiyang
Wang, Fahui
JOURNAL OF TRANSPORT GEOGRAPHY, 2020, 86
[45] A single weighting approach to analyze respondent-driven sampling data
Selvaraj, Vadivoo
Boopathi, Kangusamy
Paranjape, Ramesh
Mehendale, Sanjay
INDIAN JOURNAL OF MEDICAL RESEARCH, 2016, 144 : 447 - 459
[46] Identifying Homeless Youth At-Risk of Substance Use Disorder: Data-Driven Insights for Policymakers
Tabar, Maryam
Park, Heesoo
Winkler, Stephanie
Lee, Dongwon
Barman-Adhikari, Anamika
Yadav, Amulya
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3092 - 3100
[47] Data-Driven Cervical Cancer Prediction Model with Outlier Detection and Over-Sampling Methods
Ijaz, Muhammad Fazal
Attique, Muhammad
Son, Youngdoo
SENSORS, 2020, 20 (10)
[48] Multidimensional Population Health Modeling: A Data-Driven Multivariate Statistical Learning Approach
Wei, Zhiyuan
Narin, Adil Baran
Mukherjee, Sayanti
IEEE ACCESS, 2022, 10 : 22737 - 22755
[49] Integrating Macroeconomic and Technical Indicators into Forecasting the Stock Market: A Data-Driven Approach
Latif, Saima
Aslam, Faheem
Ferreira, Paulo
Iqbal, Sohail
ECONOMIES, 2025, 13 (01)
[50] Household financial health: a machine learning approach for data-driven diagnosis and prescription
Kim, Kyeongbin
Hwang, Yoontae
Lim, Dongcheol
Kim, Suhyeon
Lee, Junghye
Lee, Yongjae
QUANTITATIVE FINANCE, 2023, 23 (11) : 1565 - 1595

← 1 2 3 4 5 →