Quality evaluation of metabolite annotation based on comprehensive simulation of MS/MS data from high-resolution mass spectrometry (HRMS) and similarity scoring

被引:0
作者
Shi, Yingjiao [1 ]
Yang, Ji [2 ]
Yang, Qianxu [2 ]
Zhang, Yipeng [2 ]
Zeng, Zhongda [2 ]
机构
[1] Dalian Univ, Coll Environm & Chem Engn, Dalian 116622, Peoples R China
[2] Technol Ctr China Tobacco Yunnan Ind Co Ltd, Kunming 650231, Peoples R China
关键词
High-resolution mass spectrometry; MS/MS data simulation; Virtual database; False-positive metabolite annotation; Discovery metabolomics; SPECTRAL LIBRARY SEARCH; IDENTIFICATION; MS; INSTRUMENTS; ALGORITHMS;
D O I
10.1007/s00216-025-05847-7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Metabolite annotation is a critical step in discovery metabolomics, but remains a significant challenge. In this study, the accuracy of metabolite annotation was systematically evaluated by leveraging the proposed strategies for simulation of tandem mass spectrometry (MS/MS) data from high-resolution mass spectrometry (HRMS) and then construction of a large-scale virtual database. Furthermore, various similarity scoring methods were comprehensively compared to assess the performance for annotation. First, three key characteristics that are essential for simulating MS/MS spectra to closely resemble experimental data were identified: (i) the number of mass-to-charge ratio (m/z) features, (ii) the differences between neighboring m/z values, and (iii) the intensity distribution of MS/MS features. These factors were employed to generate representative MS/MS spectra for subsequent study. A meticulously designed virtual MS/MS database was constructed to facilitate accurate annotation assessment, which covered over 100,000 metabolites with diverse structural similarities and differences. To evaluate annotation quality, two simulation strategies on the basis of strong and weak data inference were respectively proposed to replicate MS/MS spectra for unknown metabolites. These simulated spectra were then compared with the virtual database, which provided insights into the expected variations in experimental MS/MS data. Furthermore, eight similarity evaluation methods, including entropy similarity (ES) and weighted dot product (W/DP) algorithms, were rigorously evaluated for their effectiveness in metabolite annotation. The results revealed that some methods, such as ES, exhibited strong resistance to interference and broad adaptability across different MS/MS patterns, whereas others selectively yielded reliable outcomes under specific conditions. This study provided a systematic framework for quality evaluation in metabolite annotation and offered strategies to mitigate false-positive identifications. The findings held great significance for advancing metabolomics research and further improving annotation reliability in complex biological samples.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Ion Fusion of High-Resolution LC MS-Based Metabolomics Data to Discover More Reliable Biomarkers
    Zeng, Zhongda
    Liu, Xinyu
    Dai, Weidong
    Yin, Peiyuan
    Zhou, Lina
    Huang, Qiang
    Lin, Xiaohui
    Xu, Guowang
    ANALYTICAL CHEMISTRY, 2014, 86 (08) : 3793 - 3800
  • [32] Introducing AAA-MS, a Rapid and Sensitive Method for Amino Acid Analysis Using Isotope Dilution and High-Resolution Mass Spectrometry
    Louwagie, Mathilde
    Kieffer-Jaquinod, Sylvie
    Dupierris, Veronique
    Coute, Yohann
    Bruley, Christophe
    Garin, Jerome
    Dupuis, Alain
    Jaquinod, Michel
    Brun, Virginie
    JOURNAL OF PROTEOME RESEARCH, 2012, 11 (07) : 3929 - 3936
  • [33] Nontarget screening of production waste samples from Leuckart amphetamine synthesis using liquid chromatography - high-resolution mass spectrometry as a complementary method to GC-MS impurity profiling
    Greif, Maximilian
    Koeke, Niklas
    Puetz, Michael
    Roessler, Thorsten
    Knepper, Thomas P.
    Froemel, Tobias
    DRUG TESTING AND ANALYSIS, 2022, 14 (03) : 450 - 461
  • [34] LipidMatch: an automated workflow for rule-based lipid identification using untargeted high-resolution tandem mass spectrometry data
    Koelmel, Jeremy P.
    Kroeger, Nicholas M.
    Ulmer, Candice Z.
    Bowden, John A.
    Patterson, Rainey E.
    Cochran, Jason A.
    Beecher, Christopher W. W.
    Garrett, Timothy J.
    Yost, Richard A.
    BMC BIOINFORMATICS, 2017, 18
  • [35] A Study on Tissue-Specific Metabolite Variations in Polygonum cuspidatum by High-Resolution Mass Spectrometry-Based Metabolic Profiling
    Wu, Zhijun
    Wang, Xiaowei
    Chen, Mo
    Hu, Hongyan
    Cao, Jie
    Chai, Tuanyao
    Wang, Hong
    MOLECULES, 2019, 24 (06)
  • [36] An Extended Markov Blanket Approach to Proteomic Biomarker Detection From High-Resolution Mass Spectrometry Data
    Oh, Jung Hun
    Gumani, Prem
    Schorge, John
    Rosenblatt, Kevin P.
    Gao, Jean X.
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2009, 13 (02): : 195 - 206
  • [37] Uncovering immobilized trypsin digestion features from large-scale proteome data generated by high-resolution mass spectrometry
    Sun, Liangliang
    Zhu, Guijie
    Yan, Xiaojing
    Mou, Si
    Dovichi, Norman J.
    JOURNAL OF CHROMATOGRAPHY A, 2014, 1337 : 40 - 47
  • [38] Development of a metabolomics-based data analysis approach for identifying drug metabolites based on high-resolution mass spectrometry
    Ting, Hsiao-Hsien
    Chiou, Yi-Shiou
    Chang, Tien-Yi
    Lin, Guan-Yu
    Li, Pei-Jhen
    Shih, Chia-Lung
    JOURNAL OF FOOD AND DRUG ANALYSIS, 2023, 31 (01) : 152 - 164
  • [39] Quantitative Protein Topography Analysis and High-Resolution Structure Prediction Using Hydroxyl Radical Labeling and Tandem-Ion Mass Spectrometry (MS)
    Kaur, Parminder
    Kiselar, Janna
    Yang, Sichun
    Chance, Mark R.
    MOLECULAR & CELLULAR PROTEOMICS, 2015, 14 (04) : 1159 - 1168
  • [40] Screening and quantification of emerging contaminants in Periyar River, Kerala (India) by using high-resolution mass spectrometry (LC-Q-ToF-MS)
    Nejumal K. Khalid
    Dineep Devadasan
    Usha K. Aravind
    Charuvila T. Aravindakumar
    Environmental Monitoring and Assessment, 2018, 190