CFM-ID 3.0: Significantly Improved ESI-MS/MS Prediction and Compound Identification

被引:194
作者
Djoumbou-Feunang, Yannick [1 ,5 ]
Pon, Allison [2 ]
Karu, Naama [1 ,6 ]
Zheng, Jiamin [1 ]
Li, Carin [1 ]
Arndt, David [1 ]
Gautam, Maheswor [1 ]
Allen, Felicity [3 ]
Wishart, David S. [1 ,4 ]
机构
[1] Univ Alberta, Dept Biol Sci, Edmonton, AB T6G 2E9, Canada
[2] OMx Personal Hlth Analyt, Edmonton, AB T5J 1B9, Canada
[3] Wellcome Sanger Inst, Wellcome Trust Genome Campus, Hinxton CB10 1SA, England
[4] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada
[5] Corteva Agrisci, 9300 Zionsville Rd, Indianapolis, IN 46268 USA
[6] Leiden Univ, Leiden Acad Ctr Drug Res, NL-2300 RA Leiden, Netherlands
关键词
mass spectrometry; liquid chromatography; MS spectral prediction; metabolite identification; structure-based chemical classification; rule-based fragmentation; combinatorial fragmentation; METABOLITE IDENTIFICATION; MASS-SPECTRA; DATABASE; FRAGMENTATION; ANNOTATION; RESOURCE;
D O I
10.3390/metabo9040072
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Metabolite identification for untargeted metabolomics is often hampered by the lack of experimentally collected reference spectra from tandem mass spectrometry (MS/MS). To circumvent this problem, Competitive Fragmentation Modeling-ID (CFM-ID) was developed to accurately predict electrospray ionization-MS/MS (ESI-MS/MS) spectra from chemical structures and to aid in compound identification via MS/MS spectral matching. While earlier versions of CFM-ID performed very well, CFM-ID's performance for predicting the MS/MS spectra of certain classes of compounds, including many lipids, was quite poor. Furthermore, CFM-ID's compound identification capabilities were limited because it did not use experimentally available MS/MS spectra nor did it exploit metadata in its spectral matching algorithm. Here, we describe significant improvements to CFM-ID's performance and speed. These include (1) the implementation of a rule-based fragmentation approach for lipid MS/MS spectral prediction, which greatly improves the speed and accuracy of CFM-ID; (2) the inclusion of experimental MS/MS spectra and other metadata to enhance CFM-ID's compound identification abilities; (3) the development of new scoring functions that improves CFM-ID's accuracy by 21.1%; and (4) the implementation of a chemical classification algorithm that correctly classifies unknown chemicals (based on their MS/MS spectra) in >80% of the cases. This improved version called CFM-ID 3.0 is freely available as a web server. Its source code is also accessible online.
引用
收藏
页数:23
相关论文
共 42 条
  • [1] Integration of Molecular Networking and In-Silico MS/MS Fragmentation for Natural Products Dereplication
    Allard, Pierre-Marie
    Peresse, Tiphaine
    Bisson, Jonathan
    Gindro, Katia
    Marcourt, Laurence
    Van Cuong Pham
    Roussi, Fanny
    Litaudon, Marc
    Wolfender, Jean-Luc
    [J]. ANALYTICAL CHEMISTRY, 2016, 88 (06) : 3317 - 3323
  • [2] Computational Prediction of Electron Ionization Mass Spectra to Assist in GC/MS Compound Identification
    Allen, Felicity
    Pon, Allison
    Greiner, Russ
    Wishart, David
    [J]. ANALYTICAL CHEMISTRY, 2016, 88 (15) : 7689 - 7697
  • [3] Competitive fragmentation modeling of ESI-MS/MS spectra for putative metabolite identification
    Allen, Felicity
    Greiner, Russ
    Wishart, David
    [J]. METABOLOMICS, 2015, 11 (01) : 98 - 110
  • [4] CFM-ID: a web server for annotation, spectrum prediction and metabolite identification from tandem mass spectra
    Allen, Felicity
    Pon, Allison
    Wilson, Michael
    Greiner, Russ
    Wishart, David
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (W1) : W94 - W99
  • [5] Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy
    Blazenovic, Ivana
    Kind, Tobias
    Torbasinovic, Hrvoje
    Obrenovic, Slobodan
    Mehta, Sajjan S.
    Tsugawa, Hiroshi
    Wermuth, Tobias
    Schauer, Nicolas
    Jahn, Martina
    Biedendieck, Rebekka
    Jahn, Dieter
    Fiehn, Oliver
    [J]. JOURNAL OF CHEMINFORMATICS, 2017, 9
  • [6] Illuminating the dark matter in metabolomics
    da Silva, Ricardo R.
    Dorrestein, Pieter C.
    Quinn, Robert A.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (41) : 12549 - 12550
  • [7] Current and Future Perspectives on the Structural Identification of Small Molecules in Biological Systems
    Dias, Daniel A.
    Jones, Oliver A. H.
    Beale, David J.
    Boughton, Berin A.
    Benheim, Devin
    Kouremenos, Konstantinos A.
    Wolfender, Jean-Luc
    Wishart, David S.
    [J]. METABOLITES, 2016, 6 (04):
  • [8] BioTransformer: a comprehensive computational tool for small molecule metabolism prediction and metabolite identification
    Djoumbou-Feunang, Yannick
    Fiamoncini, Jarlei
    Gil-de-la-Fuente, Alberto
    Greiner, Russell
    Manach, Claudine
    Wishart, David S.
    [J]. JOURNAL OF CHEMINFORMATICS, 2019, 11 (1)
  • [9] Dodder N.G., ORGANIC BIOL MASS SP
  • [10] Searching molecular structure databases with tandem mass spectra using CSI:FingerID
    Duehrkop, Kai
    Shen, Huibin
    Meusel, Marvin
    Rousu, Juho
    Boecker, Sebastian
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (41) : 12580 - 12585