Similarity search for multi-dimensional NMR-spectra of natural products

被引:0
|
作者
Wolfram, Karina [1 ]
Porzel, Andrea [1 ]
Hinneburg, Alexander [1 ]
机构
[1] Univ Halle Wittenberg, Inst Comp Sci, Halle, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Searching and mining nuclear magnetic resonance (NMR)spectra of naturally occurring products is an important task to investigate new potentially useful chemical compounds. We develop a set-based similarity function, which, however, does not sufficiently capture more abstract aspects of similarity. NMR-spectra are like documents, but consists of continuous multi-dimensional points instead of words. Probabilistic semantic indexing (PLSI) is an retrieval method, which learns hidden topics. We develop several mappings from continuous NMR-spectra to discrete text-like data. The new mappings include redundancies into the discrete data, which proofs helpful for the PLSI-model used afterwards. Our experiments show that PLSI, which is designed for text data created by humans, can effectively handle the mapped NMR-data originating from natural products. Additionally, PLSI combined with the new mappings is able to find meaningful "topics" in the NMR-data.
引用
收藏
页码:650 / 658
页数:9
相关论文
共 50 条
  • [1] An evaluation of text retrieval methods for similarity search of multi-dimensional NMR-spectra
    Hinneburg, Alexander
    Porzel, Andrea
    Wolfram, Karina
    BIOINFORMATICS RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2007, 4414 : 424 - +
  • [2] EFFICIENT SIMILARITY SEARCH FOR MULTI-DIMENSIONAL TIME SEQUENCES
    Lee, Sangjun
    Park, Jisook
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2010, 8 (03) : 343 - 357
  • [3] Parallel acquisition of multi-dimensional spectra in protein NMR
    Ēriks Kupče
    Lewis E. Kay
    Journal of Biomolecular NMR, 2012, 54 : 1 - 7
  • [4] Parallel acquisition of multi-dimensional spectra in protein NMR
    Kupce, Eriks
    Kay, Lewis E.
    JOURNAL OF BIOMOLECULAR NMR, 2012, 54 (01) : 1 - 7
  • [5] Similarity Search Problem Research on Multi-dimensional Data Sets
    Shi, Yong
    Graham, Brian
    PROCEEDINGS OF THE 2013 10TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, 2013, : 573 - 577
  • [6] Indexing expensive functions for efficient multi-dimensional similarity search
    Chen, Hanxiong
    Liu, Jianquan
    Furuse, Kazutaka
    Yu, Jeffrey Xu
    Ohbo, Nobuo
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 27 (02) : 165 - 192
  • [7] Indexing expensive functions for efficient multi-dimensional similarity search
    Hanxiong Chen
    Jianquan Liu
    Kazutaka Furuse
    Jeffrey Xu Yu
    Nobuo Ohbo
    Knowledge and Information Systems, 2011, 27 : 165 - 192
  • [8] NMR-SPECTRA OF NATURAL COUMARIN DERIVATIVES
    PERELSON, ME
    SHEINKER, YN
    SYROVA, GP
    KHIMIYA PRIRODNYKH SOEDINENII, 1971, (05): : 576 - &
  • [9] MUNIN: A new approach to multi-dimensional NMR spectra interpretation
    Orekhov, VY
    Ibraghimov, IV
    Billeter, M
    JOURNAL OF BIOMOLECULAR NMR, 2001, 20 (01) : 49 - 60
  • [10] MUNIN: A new approach to multi-dimensional NMR spectra interpretation
    Vladislav Yu. Orekhov
    Ilghiz V. Ibraghimov
    Martin Billeter
    Journal of Biomolecular NMR, 2001, 20 : 49 - 60