共 50 条
Quality evaluation of metabolite annotation based on comprehensive simulation of MS/MS data from high-resolution mass spectrometry (HRMS) and similarity scoring
被引:0
作者:
Shi, Yingjiao
[1
]
Yang, Ji
[2
]
Yang, Qianxu
[2
]
Zhang, Yipeng
[2
]
Zeng, Zhongda
[2
]
机构:
[1] Dalian Univ, Coll Environm & Chem Engn, Dalian 116622, Peoples R China
[2] Technol Ctr China Tobacco Yunnan Ind Co Ltd, Kunming 650231, Peoples R China
关键词:
High-resolution mass spectrometry;
MS/MS data simulation;
Virtual database;
False-positive metabolite annotation;
Discovery metabolomics;
SPECTRAL LIBRARY SEARCH;
IDENTIFICATION;
MS;
INSTRUMENTS;
ALGORITHMS;
D O I:
10.1007/s00216-025-05847-7
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Metabolite annotation is a critical step in discovery metabolomics, but remains a significant challenge. In this study, the accuracy of metabolite annotation was systematically evaluated by leveraging the proposed strategies for simulation of tandem mass spectrometry (MS/MS) data from high-resolution mass spectrometry (HRMS) and then construction of a large-scale virtual database. Furthermore, various similarity scoring methods were comprehensively compared to assess the performance for annotation. First, three key characteristics that are essential for simulating MS/MS spectra to closely resemble experimental data were identified: (i) the number of mass-to-charge ratio (m/z) features, (ii) the differences between neighboring m/z values, and (iii) the intensity distribution of MS/MS features. These factors were employed to generate representative MS/MS spectra for subsequent study. A meticulously designed virtual MS/MS database was constructed to facilitate accurate annotation assessment, which covered over 100,000 metabolites with diverse structural similarities and differences. To evaluate annotation quality, two simulation strategies on the basis of strong and weak data inference were respectively proposed to replicate MS/MS spectra for unknown metabolites. These simulated spectra were then compared with the virtual database, which provided insights into the expected variations in experimental MS/MS data. Furthermore, eight similarity evaluation methods, including entropy similarity (ES) and weighted dot product (W/DP) algorithms, were rigorously evaluated for their effectiveness in metabolite annotation. The results revealed that some methods, such as ES, exhibited strong resistance to interference and broad adaptability across different MS/MS patterns, whereas others selectively yielded reliable outcomes under specific conditions. This study provided a systematic framework for quality evaluation in metabolite annotation and offered strategies to mitigate false-positive identifications. The findings held great significance for advancing metabolomics research and further improving annotation reliability in complex biological samples.
引用
收藏
页数:17
相关论文