Similarity search for multi-dimensional NMR-spectra of natural products

被引:0
|
作者
Wolfram, Karina [1 ]
Porzel, Andrea [1 ]
Hinneburg, Alexander [1 ]
机构
[1] Univ Halle Wittenberg, Inst Comp Sci, Halle, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Searching and mining nuclear magnetic resonance (NMR)spectra of naturally occurring products is an important task to investigate new potentially useful chemical compounds. We develop a set-based similarity function, which, however, does not sufficiently capture more abstract aspects of similarity. NMR-spectra are like documents, but consists of continuous multi-dimensional points instead of words. Probabilistic semantic indexing (PLSI) is an retrieval method, which learns hidden topics. We develop several mappings from continuous NMR-spectra to discrete text-like data. The new mappings include redundancies into the discrete data, which proofs helpful for the PLSI-model used afterwards. Our experiments show that PLSI, which is designed for text data created by humans, can effectively handle the mapped NMR-data originating from natural products. Additionally, PLSI combined with the new mappings is able to find meaningful "topics" in the NMR-data.
引用
收藏
页码:650 / 658
页数:9
相关论文
共 50 条
  • [41] Estimating the significance of a signal in a multi-dimensional search
    Vitells, Ofer
    Gross, Eilam
    ASTROPARTICLE PHYSICS, 2011, 35 (05) : 230 - 234
  • [42] C-13 NMR-SPECTRA OF SOME POLYCHLORINATED TELOMERIZATION PRODUCTS
    VELICHKO, FK
    DOSTOVALOVA, VI
    KUZMINA, NA
    FEDIN, EI
    FREIDLINA, RK
    ORGANIC MAGNETIC RESONANCE, 1975, 7 (01): : 46 - 50
  • [43] NATURAL-ABUNDANCE O-17 NMR-SPECTRA OF OZONIDES
    HOCK, F
    BALL, V
    DONG, Y
    GUTSCHE, SH
    HILSS, M
    SCHLINDWEIN, K
    GRIESBAUM, K
    JOURNAL OF MAGNETIC RESONANCE SERIES A, 1994, 111 (02) : 150 - 154
  • [44] NATURAL ABUNDANCE C-13 NMR-SPECTRA OF INTACT MUSCLE
    DOYLE, DD
    CHALOVICH, JM
    BARANY, M
    FEBS LETTERS, 1981, 131 (01) : 147 - 150
  • [45] USING SIMILARITY SEARCHES OVER DATABASES OF ESTIMATED C-13 NMR-SPECTRA FOR STRUCTURE IDENTIFICATION OF NATURAL PRODUCT COMPOUNDS
    TSIPOURAS, A
    ONDEYKA, J
    DUFRESNE, C
    LEE, S
    SALITURO, G
    TSOU, N
    GOETZ, M
    SINGH, SB
    KEARSLEY, SK
    ANALYTICA CHIMICA ACTA, 1995, 316 (02) : 161 - 171
  • [46] FOLDING AND PATTERN-RECOGNITION IN TWO-DIMENSIONAL NMR-SPECTRA
    EGGENBERGER, U
    PFANDLER, P
    BODENHAUSEN, G
    JOURNAL OF MAGNETIC RESONANCE, 1988, 77 (01): : 192 - 196
  • [47] TWO-DIMENSIONAL NMR-SPECTRA OF POLY(N-VINYLCARBAZOLE)
    NATANSOHN, A
    JOURNAL OF POLYMER SCIENCE PART A-POLYMER CHEMISTRY, 1989, 27 (13) : 4257 - 4265
  • [48] COMPUTERIZED ANALYSIS OF 2-DIMENSIONAL DOUBLE QUANTUM NMR-SPECTRA
    DUNKEL, R
    MAYNE, CL
    PUGMIRE, RJ
    GRANT, DM
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1990, 199 : 40 - ANYL
  • [49] IMPROVED REPRESENTATION OF 2-DIMENSIONAL NMR-SPECTRA BY LOCAL RESCALING
    NEIDIG, KP
    KALBITZER, HR
    JOURNAL OF MAGNETIC RESONANCE, 1990, 88 (01): : 155 - 160
  • [50] TWO-DIMENSIONAL NMR-SPECTRA OF POLY(N-VINYLCARBAZOLE)
    NATANSOHN, A
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1988, 195 : 61 - POLY