Small Molecule Accurate Recognition Technology (SMART) to Enhance Natural Products Research

被引:75
作者
Zhang, Chen [1 ]
Idelbayev, Yerlan [2 ]
Roberts, Nicholas [2 ]
Tao, Yiwen [3 ,4 ]
Nannapaneni, Yashwanth [2 ]
Duggan, Brendan M. [5 ]
Min, Jie [6 ]
Lin, Eugene C. [7 ,8 ]
Gerwick, Erik C. [9 ]
Cottrell, Garrison W. [2 ]
Gerwick, William H. [3 ,5 ]
机构
[1] Univ Calif San Diego, Dept Nanoengn, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[3] Scripps Inst Oceanog, Ctr Marine Biotechnol & Biomed, La Jolla, CA 92037 USA
[4] Guangzhou Med Univ, Sch Pharmaceut Sci, Guangzhou 511436, Guangdong, Peoples R China
[5] Univ Calif San Diego, Skaggs Sch Pharm & Pharmaceut Sci, La Jolla, CA 92093 USA
[6] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
[7] Vanderbilt Univ, Inst Imaging Sci, 221 Kirkland Hall, Nashville, TN 37235 USA
[8] Vanderbilt Univ, Dept Radiol & Radiol Sci, 221 Kirkland Hall, Nashville, TN 37235 USA
[9] Univ Gottingen, Phys Inst, Friedrich Hund Pl 1, D-37077 Gottingen, Germany
来源
SCIENTIFIC REPORTS | 2017年 / 7卷
基金
美国国家科学基金会;
关键词
CYCLIC DEPSIPEPTIDES; NEURAL-NETWORKS; NMR-SPECTRA; MARINE; SPECTROSCOPY; DERIVATIVES; SAPONINS; ROOTS; C-13;
D O I
10.1038/s41598-017-13923-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Various algorithms comparing 2D NMR spectra have been explored for their ability to dereplicate natural products as well as determine molecular structures. However, spectroscopic artefacts, solvent effects, and the interactive effect of functional group(s) on chemical shifts combine to hinder their effectiveness. Here, we leveraged Non-Uniform Sampling (NUS) 2D NMR techniques and deep Convolutional Neural Networks (CNNs) to create a tool, SMART, that can assist in natural products discovery efforts. First, an NUS heteronuclear single quantum coherence (HSQC) NMR pulse sequence was adapted to a state-of-the-art nuclear magnetic resonance (NMR) instrument, and data reconstruction methods were optimized, and second, a deep CNN with contrastive loss was trained on a database containing over 2,054 HSQC spectra as the training set. To demonstrate the utility of SMART, several newly isolated compounds were automatically located with their known analogues in the embedded clustering space, thereby streamlining the discovery pipeline for new natural products.
引用
收藏
页数:17
相关论文
共 72 条
  • [21] Lyngbyabellins K-N from Two Palmyra Atoll Collections of the Marine Cyanobacterium Moorea bouillonii
    Choi, Hyukjae
    Mevers, Emily
    Byrum, Tara
    Valeriote, Frederick A.
    Gerwick, William H.
    [J]. EUROPEAN JOURNAL OF ORGANIC CHEMISTRY, 2012, 2012 (27) : 5141 - 5150
  • [22] Learning a similarity metric discriminatively, with application to face verification
    Chopra, S
    Hadsell, R
    LeCun, Y
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 539 - 546
  • [23] NMRPIPE - A MULTIDIMENSIONAL SPECTRAL PROCESSING SYSTEM BASED ON UNIX PIPES
    DELAGLIO, F
    GRZESIEK, S
    VUISTER, GW
    ZHU, G
    PFEIFER, J
    BAX, A
    [J]. JOURNAL OF BIOMOLECULAR NMR, 1995, 6 (03) : 277 - 293
  • [24] DONOHO DL, 1992, J ROY STAT SOC B MET, V54, P41
  • [25] Duchi J, 2011, J MACH LEARN RES, V12, P2121
  • [26] STRUCTURE OF CURACIN-A, A NOVEL ANTIMITOTIC, ANTIPROLIFERATIVE, AND BRINE SHRIMP TOXIC NATURAL PRODUCT FROM THE MARINE CYANOBACTERIUM LYNGBYA-MAJUSCULA
    GERWICK, WH
    PROTEAU, PJ
    NAGLE, DG
    HAMEL, E
    BLOKHIN, A
    SLATE, DL
    [J]. JOURNAL OF ORGANIC CHEMISTRY, 1994, 59 (06) : 1243 - 1245
  • [27] Glorot X, P INT C ART INT STAT
  • [28] Hadsell R., 2006, IEEE C COMP VIS PATT, P1735, DOI DOI 10.1109/CVPR.2006.100
  • [29] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [30] Hinneburg A., 2007, J INTEGR BIOINFORMAT, V4, P64, DOI [10.1515/JIB-2007-53, DOI 10.1515/JIB-2007-53]