Evaluation of Nontarget Long-Term LC-HRMS Time Series Data Using Multivariate Statistical Approaches

被引:25
作者
Purschke, Kirsten [1 ,2 ,3 ]
Vosough, Maryam [5 ]
Leonhardt, Juri [4 ]
Weber, Markus [1 ]
Schmidt, Torsten C. [2 ,3 ,6 ]
机构
[1] Currenta GmbH & Co OHG, Environm Anal, D-51368 Leverkusen, Germany
[2] Univ Duisburg Essen, Instrumental Analyt Chem IAC, D-45141 Essen, Germany
[3] Univ Duisburg Essen, Ctr Water & Environm Res ZWU, D-45141 Essen, Germany
[4] Currenta GmbH & Co OHG, Prod Analyt, D-41538 Dormagen, Germany
[5] Chem & Chem Engn Res Ctr Iran CCERCI, Dept Clean Technol, Tehran 1496813151, Iran
[6] IWW Zentrum Wasser, D-45476 Mulheim, Germany
关键词
RESOLUTION MASS-SPECTROMETRY; LIQUID-CHROMATOGRAPHY; TRANSFORMATION PRODUCTS; STRATEGY; MS;
D O I
10.1021/acs.analchem.0c01897
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The use of liquid chromatography coupled with high-resolution mass spectrometry (LC-HRMS) has steadily increased in many application fields ranging from metabolomics to environmental science. HRMS data are frequently used for nontarget screening (NTS), i.e., the search for compounds that are not previously known and where no reference substances are available. However, the large quantity of data produced by NTS analytical workflows makes data interpretation and time-dependent monitoring of samples very sophisticated and necessitates exploiting chemometric data processing techniques. Consequently, in this study, a prioritization method to handle time series in nontarget data was established. As proof of concept, industrial wastewater was investigated. As routine industrial wastewater analyses monitor the occurrence of a limited number of targeted water contaminants, NTS provides the opportunity to detect also unknown trace organic compounds (TrOCs) that are not in the focus of routine target analysis. The developed prioritization method enables reducing raw data and including identification of prioritized unknown contaminants. To that end, a five-month time series for industrial wastewaters was utilized, analyzed by liquid chromatography-time-of-flight mass spectrometry (LC-qTOF-MS), and evaluated by NTS. Following peak detection, alignment, grouping, and blank subtraction, 3303 features were obtained of wastewater treatment plant (WWTP) influent samples. Subsequently, two complementary ways for exploratory time trend detection and feature prioritization are proposed. Therefore, following a prefiltering step, featurewise principal component analysis (PCA) and groupwise PCA (GPCA) of the matrix (temporal wise) were used to annotate trends of relevant wastewater contaminants. With sparse factorization of data matrices using GPCA, groups of correlated features/mass fragments or adducts were detected, recovered, and prioritized. Similarities and differences in the chemical composition of wastewater samples were observed over time to reveal hidden factors accounting for the structure of the data. The detected features were reduced to 130 relevant time trends related to TrOCs for identification. Exemplarily, as proof of concept, one nontarget pollutant was identified as N-methylpyrrolidone. The developed chemometric strategies of this study are not only suitable for industrial wastewater but also could be efficiently employed for time trend exploration in other scientific fields.
引用
收藏
页码:12273 / 12281
页数:9
相关论文
共 41 条
  • [1] Advances in liquid chromatography-high-resolution mass spectrometry for quantitative and qualitative environmental analysis
    Acena, Jaume
    Stampachiacchiere, Serena
    Perez, Sandra
    Barcelo, Damia
    [J]. ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2015, 407 (21) : 6289 - 6299
  • [2] Nontarget Screening Reveals Time Trends of Polar Micropollutants in a Riverbank Filtration System
    Albergamo, Vittorio
    Schollee, Jennifer E.
    Schymanski, Emma L.
    Helmus, Rick
    Timmer, Harrie
    Hollender, Juliane
    de Voogt, Pim
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2019, 53 (13) : 7584 - 7594
  • [3] Untargeted time-pattern analysis of LC-HRMS data to detect spills and compounds with high fluctuation in influent wastewater
    Alygizakis, Nikiforos A.
    Gago-Ferrero, Pablo
    Hollender, Juliane
    Thomaidis, Nikolaos S.
    [J]. JOURNAL OF HAZARDOUS MATERIALS, 2019, 361 : 19 - 29
  • [4] Assessing Emissions from Pharmaceutical Manufacturing Based on Temporal High-Resolution Mass Spectrometry Data
    Anliker, Sabine
    Loos, Martin
    Comte, Rahel
    Ruff, Matthias
    Fenner, Kathrin
    Singer, Heinz
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2020, 54 (07) : 4110 - 4120
  • [5] Bader T, 2016, ACS SYM SER, V1242, P49
  • [6] A New Concept for Regulatory Water Monitoring Via High-Performance Liquid Chromatography Coupled to High-Resolution Mass Spectrometry
    Brüggen S.
    Schmitz O.J.
    [J]. Journal of Analysis and Testing, 2018, 2 (4) : 342 - 351
  • [7] Group-Wise Principal Component Analysis for Exploratory Intrusion Detection
    Camacho, Jose
    Theron, Roberto
    Garcia-Gimenez, Jose M.
    Macia-Fernandez, Gabriel
    Garcia-Teodoro, Pedro
    [J]. IEEE ACCESS, 2019, 7 : 113081 - 113093
  • [8] Multivariate Big Data Analysis for intrusion detection: 5 steps from the haystack to the needle
    Camacho, Jose
    Manuel Garcia-Gimenez, Jose
    Marta Fuentes-Garcia, Noemi
    Macia-Fernandez, Gabriel
    [J]. COMPUTERS & SECURITY, 2019, 87
  • [9] Group-Wise Principal Component Analysis for Exploratory Data Analysis
    Camacho, Jose
    Rodriguez-Gomez, Rafael A.
    Saccenti, Edoardo
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2017, 26 (03) : 501 - 512
  • [10] Multivariate Exploratory Data Analysis (MEDA) Toolbox for Matlab
    Camacho, Jose
    Perez-Villegas, Alejandro
    Rodriguez-Gomez, Rafael A.
    Jimenez-Manas, Elena
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2015, 143 : 49 - 57