Small Molecule Accurate Recognition Technology (SMART) to Enhance Natural Products Research

被引:75
作者
Zhang, Chen [1 ]
Idelbayev, Yerlan [2 ]
Roberts, Nicholas [2 ]
Tao, Yiwen [3 ,4 ]
Nannapaneni, Yashwanth [2 ]
Duggan, Brendan M. [5 ]
Min, Jie [6 ]
Lin, Eugene C. [7 ,8 ]
Gerwick, Erik C. [9 ]
Cottrell, Garrison W. [2 ]
Gerwick, William H. [3 ,5 ]
机构
[1] Univ Calif San Diego, Dept Nanoengn, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[3] Scripps Inst Oceanog, Ctr Marine Biotechnol & Biomed, La Jolla, CA 92037 USA
[4] Guangzhou Med Univ, Sch Pharmaceut Sci, Guangzhou 511436, Guangdong, Peoples R China
[5] Univ Calif San Diego, Skaggs Sch Pharm & Pharmaceut Sci, La Jolla, CA 92093 USA
[6] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
[7] Vanderbilt Univ, Inst Imaging Sci, 221 Kirkland Hall, Nashville, TN 37235 USA
[8] Vanderbilt Univ, Dept Radiol & Radiol Sci, 221 Kirkland Hall, Nashville, TN 37235 USA
[9] Univ Gottingen, Phys Inst, Friedrich Hund Pl 1, D-37077 Gottingen, Germany
来源
SCIENTIFIC REPORTS | 2017年 / 7卷
基金
美国国家科学基金会;
关键词
CYCLIC DEPSIPEPTIDES; NEURAL-NETWORKS; NMR-SPECTRA; MARINE; SPECTROSCOPY; DERIVATIVES; SAPONINS; ROOTS; C-13;
D O I
10.1038/s41598-017-13923-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Various algorithms comparing 2D NMR spectra have been explored for their ability to dereplicate natural products as well as determine molecular structures. However, spectroscopic artefacts, solvent effects, and the interactive effect of functional group(s) on chemical shifts combine to hinder their effectiveness. Here, we leveraged Non-Uniform Sampling (NUS) 2D NMR techniques and deep Convolutional Neural Networks (CNNs) to create a tool, SMART, that can assist in natural products discovery efforts. First, an NUS heteronuclear single quantum coherence (HSQC) NMR pulse sequence was adapted to a state-of-the-art nuclear magnetic resonance (NMR) instrument, and data reconstruction methods were optimized, and second, a deep CNN with contrastive loss was trained on a database containing over 2,054 HSQC spectra as the training set. To demonstrate the utility of SMART, several newly isolated compounds were automatically located with their known analogues in the embedded clustering space, thereby streamlining the discovery pipeline for new natural products.
引用
收藏
页数:17
相关论文
共 72 条
  • [1] Isolation of swinholide A and related glycosylated derivatives from two field collections of marine cyanobacteria
    Andrianasolo, EH
    Gross, H
    Goeger, D
    Musafija-Girt, M
    McPhail, KP
    Leal, RM
    Mooberry, SL
    Gerwick, WH
    [J]. ORGANIC LETTERS, 2005, 7 (07) : 1375 - 1378
  • [2] [Anonymous], NMR DATA PROCESSING
  • [3] [Anonymous], ARTIFICIAL INTELLIGE
  • [4] [Anonymous], SPIN DYNAMICS BASICS
  • [5] [Anonymous], SPIN DYNAMICS BASICS
  • [6] [Anonymous], ARTIFICIAL INTELLIGE
  • [7] [Anonymous], P INT C MACH LEARN L
  • [8] [Anonymous], DIGITAL IMAGE PROCES
  • [9] [Anonymous], 2016, ABS160502688 ARXIV
  • [10] [Anonymous], 2014, Bokeh: Python library for interactive visualization