Integration of Per- and Polyfluoroalkyl Substance (PFAS) Fingerprints in Fish with Machine Learning for PFAS Source Tracking in Surface Water

被引:6
作者
Stults, John F. F. [3 ]
Higgins, Christopher P. P. [1 ]
Helbling, Damian E. E. [2 ]
机构
[1] Colorado Sch Mines, Dept Civil & Environm Engn, Golden, CO 80401 USA
[2] Cornell Univ, Sch Civil & Environm Engn, Ithaca, NY 14853 USA
[3] CDM Smith, Edison, NJ 08837 USA
关键词
PFAS; source tracking; bioaccumulation; supervised machine learning; optimization; featureselection; PERFLUOROALKYL SUBSTANCES; SOURCE ALLOCATION; WASTE-WATER; GROUNDWATER; PERFORMANCE; SELECTION; RELEASE; STORY; SOIL; BIG;
D O I
10.1021/acs.estlett.3c00278
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Per- and polyfluoroalkyl substances (PFASs) are a classof environmentalcontaminants that originate from various sources. The unique chemicalfingerprints associated with many commercial products and industrialapplications make PFASs ideal candidates for machine learning (ML)-assistedenvironmental forensics. Here, we propose a novel use of PFAS fingerprintsin fish tissue from surface water systems to classify exposure frommultiple sources of PFASs using a proof-of-concept demonstration.Three supervised ML classification techniques (k-nearest neighbors(KNN), decision trees, support vector machines) implementing two predictivefeatures are used to classify literature-reported PFAS fingerprintsin fish (n = 1057). The importance of additionalpredictive features was explored using brute force optimization ofa multifeature KNN algorithm. The multiclass classification consideredexposure to aqueous film-forming foam-impacted water, paper industrywastewater, diffuse sources, or PFASs undergoing long-range transport.The optimized classifiers demonstrated 85%-94% classificationaccuracy for this first known multiclass classification of PFASs forenvironmental forensics. The optimized classifiers also demonstrated79%-92% classification accuracy with a set of independent externalvalidation data (n = 192). Our results demonstratethat PFAS fingerprints in fish tissue may be an effective means ofPFAS source tracking in surface water systems. The source code isprovided for guidance on best practices for ML-assisted environmentalforensics.
引用
收藏
页码:1052 / 1058
页数:7
相关论文
共 55 条
  • [1] Adjei R., 2022, Conf. Appl. Statistics Agric. Nat. Resour, DOI [10.26077/693e-25f0, DOI 10.26077/693E-25F0, 10.26077/693E-25F0]
  • [2] Ahrens L., 2016, SOURCE TRACKING IMPA
  • [3] Polyfluoroalkyl compounds in the aquatic environment: a review of their occurrence and fate
    Ahrens, Lutz
    [J]. JOURNAL OF ENVIRONMENTAL MONITORING, 2011, 13 (01): : 20 - 31
  • [4] Amudsen C. E., 2016, PFAS RYGGE FLYSTASJO
  • [5] Occurrence of select perfluoroalkyl substances at US Air Force aqueous film-forming foam release sites other than fire-training areas: Field-validation of critical fate and transport properties
    Anderson, R. Hunter
    Long, G. Cornell
    Porter, Ronald C.
    Anderson, Janet K.
    [J]. CHEMOSPHERE, 2016, 150 : 678 - 685
  • [6] Uptake of Per- and Polyfluoroalkyl Substances by Fish, Mussel, and Passive Samplers in Mobile-Laboratory Exposures Using Groundwater from a Contamination Plume at a Historical Fire Training Area, Cape Cod, Massachusetts
    Barber, Larry B.
    Pickard, Heidi M.
    Alvarez, David A.
    Becanova, Jitka
    Keefe, Steffanie H.
    LeBlanc, Denis R.
    Lohmann, Rainer
    Steevens, Jeffery A.
    Vajda, Alan M.
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2023, 57 (07) : 5544 - 5557
  • [7] A forensic approach for distinguishing PFAS materials
    Benotti, Mark J.
    Fernandez, Loretta A.
    Peaslee, Graham F.
    Douglas, Gregory S.
    Uhler, Allen D.
    Emsbo-Mattingly, Stephen
    [J]. ENVIRONMENTAL FORENSICS, 2020, 21 (3-4) : 319 - 333
  • [8] Brody S., 2021, PREPRINT
  • [9] Statistics for big data: A perspective
    Buhlmann, Peter
    van de Geer, Sara
    [J]. STATISTICS & PROBABILITY LETTERS, 2018, 136 : 37 - 41
  • [10] Cawley GC, 2010, J MACH LEARN RES, V11, P2079