Bypassing the Identification: MS2Quant for Concentration Estimations of Chemicals Detected with Nontarget LC-HRMS from MS2 Data

被引:18
|
作者
Sepman, Helen [1 ,2 ]
Malm, Louise [1 ]
Peets, Pilleriin [1 ]
MacLeod, Matthew [2 ]
Martin, Jonathan [3 ]
Breitholtz, Magnus [2 ]
Kruve, Anneli [1 ,2 ]
机构
[1] Stockholm Univ, Dept Mat & Environm Chem, S-10691 Stockholm, Sweden
[2] Stockholm Univ, Dept Environm Sci, S-10691 Stockholm, Sweden
[3] Stockholm Univ, Dept Environm Sci, Sci Life Lab, S-10691 Stockholm, Sweden
基金
瑞典研究理事会;
关键词
ELECTROSPRAY-IONIZATION EFFICIENCY; RESOLUTION MASS-SPECTROMETRY; MOBILE-PHASE; SEMI-QUANTIFICATION; WATER; CONTAMINANTS; SUBSTANCES; PREDICTION; PARAMETERS; SUSPECT;
D O I
10.1021/acs.analchem.3c01744
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Nontarget analysisby liquid chromatography-high-resolutionmass spectrometry (LC-HRMS) is now widely used to detect pollutantsin the environment. Shifting away from targeted methods has led todetection of previously unseen chemicals, and assessing the risk posedby these newly detected chemicals is an important challenge. Assessingexposure and toxicity of chemicals detected with nontarget HRMS ishighly dependent on the knowledge of the structure of the chemical.However, the majority of features detected in nontarget screeningremain unidentified and therefore the risk assessment with conventionaltools is hampered. Here, we developed MS2Quant, a machine learningmodel that enables prediction of concentration from fragmentation(MS2) spectra of detected, but unidentified chemicals.MS2Quant is an xgbTree algorithm-based regressionmodel developed using ionization efficiency data for 1191 unique chemicalsthat spans 8 orders of magnitude. The ionization efficiency valuesare predicted from structural fingerprints that can be computed fromthe SMILES notation of the identified chemicals or from MS2 spectra of unidentified chemicals using SIRIUS+CSI:FingerID software.The root mean square errors of the training and test sets were 0.55(3.5x) and 0.80 (6.3x) log-units, respectively. In comparison,ionization efficiency prediction approaches that depend on assigningan unequivocal structure typically yield errors from 2x to 6x.The MS2Quant quantification model was validated on a set of 39 environmentalpollutants and resulted in a mean prediction error of 7.4x, ageometric mean of 4.5x, and a median of 4.0x. For comparison,a model based on PaDEL descriptors that depends on unequivocal structuralassignment was developed using the same dataset. The latter approachyielded a comparable mean prediction error of 9.5x, a geometricmean of 5.6x, and a median of 5.2x on the validation setchemicals when the top structural assignment was used as input. Thisconfirms that MS2Quant enables to extract exposure information forunidentified chemicals which, although detected, have thus far beendisregarded due to lack of accurate tools for quantification. TheMS2Quant model is available as an R-package in GitHub for improvingdiscovery and monitoring of potentially hazardous environmental pollutantswith nontarget screening.
引用
收藏
页码:12329 / 12338
页数:10
相关论文
共 49 条
  • [31] Identification of bacteriophage MS2 coat protein from E-coli lysates via ion trap collisional activation of intact protein ions
    Cargile, BJ
    McLuckey, SA
    Stephenson, JL
    ANALYTICAL CHEMISTRY, 2001, 73 (06) : 1277 - 1285
  • [32] Supercritical CO2 Assisted Extraction and LC-MS Identification of Picroside I and Picroside II from Picrorhiza kurroa
    Patil, Ajit A.
    Sachin, Bhusari S.
    Shinde, Devanand B.
    Wakte, Pravin S.
    PHYTOCHEMICAL ANALYSIS, 2013, 24 (02) : 97 - 104
  • [33] 2D-LC as an on-line desalting tool allowing peptide identification directly from MS unfriendly HPLC methods
    Luo, Hao
    Zhong, Wendy
    Yang, Jiong
    Zhuang, Ping
    Meng, Fanyu
    Caldwell, John
    Mao, Bing
    Welch, Christopher J.
    JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2017, 137 : 139 - 145
  • [34] Confident software-driven identification of intact glycopeptides by integrating complementary peptide- and glycan-first search strategies in efficient mining of MS2 data
    Chien, Yu-Chun
    Kuo, Chu-Wei
    Zeng, Wen-Feng
    Khoo, Kay-Hooi
    GLYCOBIOLOGY, 2021, 31 (12) : 1725 - 1726
  • [35] Correlation-Based Deconvolution (CorrDec) To Generate High-Quality MS2 Spectra from Data-Independent Acquisition in Multisample Studies
    Tada, Ipputa
    Chaleckis, Romanas
    Tsugawa, Hiroshi
    Meister, Isabel
    Zhang, Pei
    Lazarinis, Nikolaos
    Dahlen, Barbro
    Wheelock, Craig E.
    Arita, Masanori
    ANALYTICAL CHEMISTRY, 2020, 92 (16) : 11310 - 11317
  • [36] compMS2Miner: An Automatable Metabolite Identification, Visualization, and Data-Sharing R Package for High-Resolution LC-MS Data Sets
    Edmands, William M. B.
    Petrick, Lauren
    Barupal, Dinesh K.
    Scalbert, Augustin
    Wilson, Mark J.
    Wickliffe, Jeffrey K.
    Rappaport, Stephen M.
    ANALYTICAL CHEMISTRY, 2017, 89 (07) : 3919 - 3928
  • [37] Ameliorated membranous nephropathy activities of two ethanol extracts from corn silk and identification of flavonoid active compounds by LC-MS2
    Wang, Xizhu
    Yuan, Liyan
    Dong, Yifei
    Bao, Zhijie
    Ma, Tiecheng
    Lin, Songyi
    FOOD & FUNCTION, 2021, 12 (20) : 9669 - 9679
  • [38] Isolation and Identification of Dipeptidyl Peptidase IV-Inhibitory Peptides from Trypsin/Chymotrypsin-Treated Goat Milk Casein Hydrolysates by 2D-TLC and LC-MS/MS
    Zhang, Ying
    Chen, Ran
    Ma, Huiqin
    Chen, Shangwu
    JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2015, 63 (40) : 8819 - 8828
  • [39] LC-MS guided identification of dimeric 2-(2-phenylethyl)chromones and sesquiterpene-2-(2-phenylethyl)chromone conjugates from agarwood of Aquilaria crassna and their cytotoxicity
    Xia, Lulu
    Li, Wei
    Wang, Hao
    Chen, Huiqin
    Cai, Caihong
    Yang, Li
    Jiang, Bei
    Yang, Yiling
    Mei, Wenli
    Dai, Haofu
    FITOTERAPIA, 2019, 138
  • [40] DIA-MS2pep: a library-free framework for comprehensive peptide identification from data-independent acquisition data
    Junjie Hou
    Jifeng Wang
    Fuquan Yang
    Tao Xu
    Biophysics Reports, 2022, 8(Z1) (Z1) : 253 - 268