Bypassing the Identification: MS2Quant for Concentration Estimations of Chemicals Detected with Nontarget LC-HRMS from MS2 Data

被引:18
|
作者
Sepman, Helen [1 ,2 ]
Malm, Louise [1 ]
Peets, Pilleriin [1 ]
MacLeod, Matthew [2 ]
Martin, Jonathan [3 ]
Breitholtz, Magnus [2 ]
Kruve, Anneli [1 ,2 ]
机构
[1] Stockholm Univ, Dept Mat & Environm Chem, S-10691 Stockholm, Sweden
[2] Stockholm Univ, Dept Environm Sci, S-10691 Stockholm, Sweden
[3] Stockholm Univ, Dept Environm Sci, Sci Life Lab, S-10691 Stockholm, Sweden
基金
瑞典研究理事会;
关键词
ELECTROSPRAY-IONIZATION EFFICIENCY; RESOLUTION MASS-SPECTROMETRY; MOBILE-PHASE; SEMI-QUANTIFICATION; WATER; CONTAMINANTS; SUBSTANCES; PREDICTION; PARAMETERS; SUSPECT;
D O I
10.1021/acs.analchem.3c01744
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Nontarget analysisby liquid chromatography-high-resolutionmass spectrometry (LC-HRMS) is now widely used to detect pollutantsin the environment. Shifting away from targeted methods has led todetection of previously unseen chemicals, and assessing the risk posedby these newly detected chemicals is an important challenge. Assessingexposure and toxicity of chemicals detected with nontarget HRMS ishighly dependent on the knowledge of the structure of the chemical.However, the majority of features detected in nontarget screeningremain unidentified and therefore the risk assessment with conventionaltools is hampered. Here, we developed MS2Quant, a machine learningmodel that enables prediction of concentration from fragmentation(MS2) spectra of detected, but unidentified chemicals.MS2Quant is an xgbTree algorithm-based regressionmodel developed using ionization efficiency data for 1191 unique chemicalsthat spans 8 orders of magnitude. The ionization efficiency valuesare predicted from structural fingerprints that can be computed fromthe SMILES notation of the identified chemicals or from MS2 spectra of unidentified chemicals using SIRIUS+CSI:FingerID software.The root mean square errors of the training and test sets were 0.55(3.5x) and 0.80 (6.3x) log-units, respectively. In comparison,ionization efficiency prediction approaches that depend on assigningan unequivocal structure typically yield errors from 2x to 6x.The MS2Quant quantification model was validated on a set of 39 environmentalpollutants and resulted in a mean prediction error of 7.4x, ageometric mean of 4.5x, and a median of 4.0x. For comparison,a model based on PaDEL descriptors that depends on unequivocal structuralassignment was developed using the same dataset. The latter approachyielded a comparable mean prediction error of 9.5x, a geometricmean of 5.6x, and a median of 5.2x on the validation setchemicals when the top structural assignment was used as input. Thisconfirms that MS2Quant enables to extract exposure information forunidentified chemicals which, although detected, have thus far beendisregarded due to lack of accurate tools for quantification. TheMS2Quant model is available as an R-package in GitHub for improvingdiscovery and monitoring of potentially hazardous environmental pollutantswith nontarget screening.
引用
收藏
页码:12329 / 12338
页数:10
相关论文
共 49 条
  • [21] Large-scale LC-MS/MS identification of proteins in aquaporin-2 (AQP2) vesicles immunoisolated from renal inner medullary collecting duct
    Barile, M
    Pisitkun, T
    Chou, C
    Verbalis, M
    Knepper, M
    FASEB JOURNAL, 2005, 19 (05): : A1606 - A1606
  • [22] Comparison of three directly coupled HPLC MS/MS strategies for identification of proteins from complex mixtures: single-dimension LC-MS/MS, 2-phase MudPIT, and 3-phase MudPIT
    McDonald, WH
    Ohi, R
    Miyamoto, DT
    Mitchison, TJ
    Yates, JR
    INTERNATIONAL JOURNAL OF MASS SPECTROMETRY, 2002, 219 (01) : 245 - 251
  • [23] Novel LC-MS2 Product Dependent Parallel Data Acquisition Function and Data Analysis Workflow for Sequencing and Identification of Intact Glycopeptides
    Wu, Sz-Wei
    Pu, Tsung-Hsien
    Viner, Rosa
    Khoo, Kay-Hooi
    ANALYTICAL CHEMISTRY, 2014, 86 (11) : 5478 - 5486
  • [24] Identification of membrane proteins from mammalian cell/tissue using methanol-facilitated solubilization and tryptic digestion coupled with 2D-LC-MS/MS
    Blonder, Josip
    Chan, King C.
    Issaq, Haleem J.
    Veenstra, Timothy D.
    NATURE PROTOCOLS, 2006, 1 (06) : 2784 - 2790
  • [25] Identification of membrane proteins from mammalian cell/tissue using methanol-facilitated solubilization and tryptic digestion coupled with 2D-LC-MS/MS
    Josip Blonder
    King C Chan
    Haleem J Issaq
    Timothy D Veenstra
    Nature Protocols, 2006, 1 : 2784 - 2790
  • [26] Coupling immuno-magnetic capture with LC–MS/MS(MRM) as a sensitive, reliable, and specific assay for SARS-CoV-2 identification from clinical samples
    Ofir Schuster
    Yafit Atiya-Nasagi
    Osnat Rosen
    Anat Zvi
    Itai Glinert
    Amir Ben Shmuel
    Shay Weiss
    Orly Laskar
    Liron Feldberg
    Analytical and Bioanalytical Chemistry, 2022, 414 : 1949 - 1962
  • [27] The identification of dilignols from dehydrogenation mixtures of coniferyl alcohol and apocynol [4-(1-hydroxyethyl)-2-methoxyphenol] by LC-ES-MS/MS
    Syrjänen, K
    Sipilä, J
    Björk, H
    Brunow, G
    JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2000, 48 (11) : 5211 - 5215
  • [28] Coupling immuno-magnetic capture with LC-MS/MS(MRM) as a sensitive, reliable, and specific assay for SARS-CoV-2 identification from clinical samples
    Schuster, Ofir
    Atiya-Nasagi, Yafit
    Rosen, Osnat
    Zvi, Anat
    Glinert, Itai
    Ben Shmuel, Amir
    Weiss, Shay
    Laskar, Orly
    Feldberg, Liron
    ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2022, 414 (05) : 1949 - 1962
  • [29] Identification of Candidate Protein Biomarkers for CIN2+Lesions from Self-Sampled, Dried Cervico-Vaginal Fluid Using LC-MS/MS
    Gutierrez, Ariadna Lara
    Lindberg, Julia Hedlund
    Shevchenko, Ganna
    Gustavsson, Inger
    Bergquist, Jonas
    Gyllensten, Ulf
    Enroth, Stefan
    CANCERS, 2021, 13 (11)
  • [30] Identification of bacteriophage ms2 coat protein from E. coli Lysates via ion trap collisional activation of intact protein ions
    Chemical Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831-6365, United States
    Anal. Chem., 1600, 6 (1277-1285):