Similarity search for multi-dimensional NMR-spectra of natural products

被引:0
|
作者
Wolfram, Karina [1 ]
Porzel, Andrea [1 ]
Hinneburg, Alexander [1 ]
机构
[1] Univ Halle Wittenberg, Inst Comp Sci, Halle, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Searching and mining nuclear magnetic resonance (NMR)spectra of naturally occurring products is an important task to investigate new potentially useful chemical compounds. We develop a set-based similarity function, which, however, does not sufficiently capture more abstract aspects of similarity. NMR-spectra are like documents, but consists of continuous multi-dimensional points instead of words. Probabilistic semantic indexing (PLSI) is an retrieval method, which learns hidden topics. We develop several mappings from continuous NMR-spectra to discrete text-like data. The new mappings include redundancies into the discrete data, which proofs helpful for the PLSI-model used afterwards. Our experiments show that PLSI, which is designed for text data created by humans, can effectively handle the mapped NMR-data originating from natural products. Additionally, PLSI combined with the new mappings is able to find meaningful "topics" in the NMR-data.
引用
收藏
页码:650 / 658
页数:9
相关论文
共 50 条
  • [31] ANALYSIS OF GENERALIZED TWO-DIMENSIONAL HOMONUCLEAR NMR-SPECTRA
    BAIN, AD
    BORNAIS, J
    BROWNSTEIN, S
    CANADIAN JOURNAL OF CHEMISTRY-REVUE CANADIENNE DE CHIMIE, 1981, 59 (04): : 723 - 730
  • [32] Fast multi-dimensional NMR by minimal sampling
    Kupce, Eriks
    Freeman, Ray
    JOURNAL OF MAGNETIC RESONANCE, 2008, 191 (01) : 164 - 168
  • [33] ENHANCEMENT OF GLOBAL SYMMETRIES IN 2-DIMENSIONAL NMR-SPECTRA
    NEIDIG, KP
    KALBITZER, HR
    JOURNAL OF MAGNETIC RESONANCE, 1991, 91 (01): : 155 - 164
  • [34] XIPP: multi-dimensional NMR analysis software
    Daniel S. Garrett
    Mengli Cai
    G. Marius Clore
    Journal of Biomolecular NMR, 2020, 74 : 9 - 25
  • [35] XIPP: multi-dimensional NMR analysis software
    Garrett, Daniel S.
    Cai, Mengli
    Clore, G. Marius
    JOURNAL OF BIOMOLECULAR NMR, 2020, 74 (01) : 9 - 25
  • [36] Is liquid state multi-dimensional NMR analysis of natural organic matter the whole solution?
    Cook, RL
    Foerster, H
    Althoff, G
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2004, 227 : U1207 - U1207
  • [37] Similarity-Based Segmentation of Multi-Dimensional Signals
    Machne, Rainer
    Murray, Douglas B.
    Stadler, Peter F.
    SCIENTIFIC REPORTS, 2017, 7
  • [39] Similarity solutions for a multi-dimensional replicator dynamics equation
    Papanicolaou, Vassilis G.
    Smyrlis, George
    NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2009, 71 (7-8) : 3185 - 3196
  • [40] Similarity-Based Segmentation of Multi-Dimensional Signals
    Rainer Machné
    Douglas B. Murray
    Peter F. Stadler
    Scientific Reports, 7