Dereplication of microbial metabolites through database search of mass spectra

被引:205
作者
Mohimani, Hosein [1 ,2 ]
Gurevich, Alexey [3 ]
Shlemov, Alexander [3 ]
Mikheenko, Alla [3 ]
Korobeynikov, Anton [3 ,4 ]
Cao, Liu [1 ]
Shcherbin, Egor [5 ]
Nothias, Louis-Felix [6 ]
Dorrestein, Pieter C. [6 ,7 ]
Pevzner, Pavel A. [2 ,3 ]
机构
[1] Carnegie Mellon Univ, Computat Biol Dept, Sch Comp Sci, Pittsburgh, PA 15213 USA
[2] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[3] St Petersburg State Univ, Ctr Algorithm Biotechnol, Inst Translat Biomed, St Petersburg, Russia
[4] St Petersburg State Univ, Dept Stat Modelling, St Petersburg, Russia
[5] Natl Res Univ, Higher Sch Econ, St Petersburg, Russia
[6] Univ Calif San Diego, Collaborat Mass Spectrometry Innovat Ctr, Skaggs Sch Pharm & Pharmaceut Sci, La Jolla, CA 92093 USA
[7] Univ Calif San Diego, Dept Pharmacol & Pediat, La Jolla, CA 92093 USA
来源
NATURE COMMUNICATIONS | 2018年 / 9卷
基金
美国安德鲁·梅隆基金会; 俄罗斯科学基金会; 美国国家卫生研究院;
关键词
COLLISIONALLY ACTIVATED DISSOCIATION; PEPTIDIC NATURAL-PRODUCTS; PROTEIN IDENTIFICATION; STRUCTURE ELUCIDATION; MOLECULAR NETWORKING; DRUG DISCOVERY; GENE CLUSTERS; SPECTROMETRY; BIOSYNTHESIS; FRAGMENTATION;
D O I
10.1038/s41467-018-06082-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Natural products have traditionally been rich sources for drug discovery. In order to clear the road toward the discovery of unknown natural products, biologists need dereplication strategies that identify known ones. Here we report DEREPLICATOR+, an algorithm that improves on the previous approaches for identifying peptidic natural products, and extends them for identification of polyketides, terpenes, benzenoids, alkaloids, flavonoids, and other classes of natural products. We show that DEREPLICATOR+ can search all spectra in the recently launched Global Natural Products Social molecular network and identify an order of magnitude more natural products than previous dereplication efforts. We further demonstrate that DEREPLICATOR+ enables cross-validation of genome-mining and peptidogenomics/glycogenomics results.
引用
收藏
页数:12
相关论文
共 74 条
  • [1] Competitive fragmentation modeling of ESI-MS/MS spectra for putative metabolite identification
    Allen, Felicity
    Greiner, Russ
    Wishart, David
    [J]. METABOLOMICS, 2015, 11 (01) : 98 - 110
  • [2] Protein identification by spectral networks analysis
    Bandeira, Nuno
    Tsur, Dekel
    Frank, Ari
    Pevzner, Pavel A.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (15) : 6140 - 6145
  • [3] Coupled-cluster theory in quantum chemistry
    Bartlett, Rodney J.
    Musial, Monika
    [J]. REVIEWS OF MODERN PHYSICS, 2007, 79 (01) : 291 - 352
  • [4] Fast metabolite identification with Input Output Kernel Regression
    Brouard, Celine
    Shen, Huibin
    Duehrkop, Kai
    d'Alche-Buc, Florence
    Boecker, Sebastian
    Rousu, Juho
    [J]. BIOINFORMATICS, 2016, 32 (12) : 28 - 36
  • [5] Chen SX, 1997, STAT SINICA, V7, P875
  • [6] Searching molecular structure databases with tandem mass spectra using CSI:FingerID
    Duehrkop, Kai
    Shen, Huibin
    Meusel, Marvin
    Rousu, Juho
    Boecker, Sebastian
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (41) : 12580 - 12585
  • [7] Molecular Networking and Pattern-Based Genome Mining Improves Discovery of Biosynthetic Gene Clusters and their Products from Salinispora Species
    Duncan, Katherine R.
    Cruesemann, Max
    Lechner, Anna
    Sarkar, Anindita
    Li, Jie
    Ziemert, Nadine
    Wang, Mingxun
    Bandeira, Nuno
    Moore, Bradley S.
    Dorrestein, Pieter C.
    Jensen, Paul R.
    [J]. CHEMISTRY & BIOLOGY, 2015, 22 (04): : 460 - 471
  • [8] Structure and biosynthesis of the jamaicamides, new mixed polyketide-peptide neurotoxins from the marine cyanobacterium Lyngbya majuscula
    Edwards, DJ
    Marquez, BL
    Nogle, LM
    McPhail, K
    Goeger, DE
    Roberts, MA
    Gerwick, WH
    [J]. CHEMISTRY & BIOLOGY, 2004, 11 (06): : 817 - 833
  • [9] AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE
    ENG, JK
    MCCORMACK, AL
    YATES, JR
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) : 976 - 989
  • [10] ClassyFire: automated chemical classification with a comprehensive, computable taxonomy
    Feunang, Yannick Djoumbou
    Eisner, Roman
    Knox, Craig
    Chepelev, Leonid
    Hastings, Janna
    Owen, Gareth
    Fahy, Eoin
    Steinbeck, Christoph
    Subramanian, Shankar
    Bolton, Evan
    Greiner, Russell
    Wishart, David S.
    [J]. JOURNAL OF CHEMINFORMATICS, 2016, 8 : 1 - 20