Metabolome-scale prediction of intermediate compounds in multistep metabolic pathways with a recursive supervised approach

被引:11
作者
Kotera, Masaaki [1 ]
Tabei, Yasuo [2 ]
Yamanishi, Yoshihiro [3 ,4 ]
Muto, Ai [5 ]
Moriya, Yuki [6 ]
Tokimatsu, Toshiaki [6 ]
Goto, Susumu [6 ]
机构
[1] Tokyo Inst Technol, Grad Sch Biosci & Biotechnol, Meguro Ku, Tokyo 1528550, Japan
[2] Japan Sci & Technol Agcy, PRESTO, Kawaguchi, Saitama 3320012, Japan
[3] Kyushu Univ, Med Inst Bioregulat, Div Syst Cohort, Higashi Ku, Fukuoka 8128582, Japan
[4] Kyushu Univ, Inst Adv Study, Higashi Ku, Fukuoka 8128581, Japan
[5] Nara Inst Sci & Technol NAIST, Grad Sch Biol Sci, Nara 6300192, Japan
[6] Kyoto Univ, Inst Chem Res, Bioinformat Ctr, Uji, Kyoto 6110011, Japan
基金
日本科学技术振兴机构; 日本学术振兴会;
关键词
RECONSTRUCTION; SUBSTRUCTURE; LOGIC; SETS; KEGG;
D O I
10.1093/bioinformatics/btu265
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Metabolic pathway analysis is crucial not only in metabolic engineering but also in rational drug design. However, the biosynthetic/ biodegradation pathways are known only for a small portion of metabolites, and a vast amount of pathways remain uncharacterized. Therefore, an important challenge in metabolomics is the de novo reconstruction of potential reaction networks on a metabolome-scale. Results: In this article, we develop a novel method to predict the multistep reaction sequences for de novo reconstruction of metabolic pathways in the reaction-filling framework. We propose a supervised approach to learn what we refer to as 'multistep reaction sequence likeness', i.e. whether a compound-compound pair is possibly converted to each other by a sequence of enzymatic reactions. In the algorithm, we propose a recursive procedure of using step-specific classifiers to predict the intermediate compounds in the multistep reaction sequences, based on chemical substructure fingerprints/ descriptors of compounds. We further demonstrate the usefulness of our proposed method on the prediction of enzymatic reaction networks from a metabolome-scale compound set and discuss characteristic features of the extracted chemical substructure transformation patterns in multistep reaction sequences. Our comprehensively predicted reaction networks help to fill the metabolic gap and to infer new reaction sequences in metabolic pathways.
引用
收藏
页码:165 / 174
页数:10
相关论文
共 25 条
  • [1] KNApSAcK Family Databases: Integrated Metabolite-Plant Species Databases for Multifaceted Plant Research
    Afendi, Farit Mochamad
    Okada, Taketo
    Yamazaki, Mami
    Hirai-Morita, Aki
    Nakamura, Yukiko
    Nakamura, Kensuke
    Ikeda, Shun
    Takahashi, Hiroki
    Altaf-Ul-Amin, Md.
    Darusman, Latifah K.
    Saito, Kazuki
    Kanaya, Shigehiko
    [J]. PLANT AND CELL PHYSIOLOGY, 2012, 53 (02) : e1
  • [2] Reconstruction of amino acid biosynthesis pathways from the complete genome sequence
    Bono, H
    Ogata, H
    Goto, S
    Kanehisa, M
    [J]. GENOME RESEARCH, 1998, 8 (03) : 203 - 210
  • [3] PREDICTING METABOLIC PATHWAYS BY LOGIC PROGRAMMING
    DARVAS, F
    [J]. JOURNAL OF MOLECULAR GRAPHICS, 1988, 6 (02): : 80 - 86
  • [4] The University of Minnesota pathway prediction system: predicting metabolic logic
    Ellis, Lynda B. M.
    Gao, Junfeng
    Fenner, Kathrin
    Wackett, Lawrence P.
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : W427 - W432
  • [5] Knowledge-based expert systems for toxicity and metabolism prediction: DEREK, StAR and METEOR
    Greene, N
    Judson, PN
    Langowski, JJ
    Marchant, CA
    [J]. SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 1999, 10 (2-3) : 299 - +
  • [6] Exploring the diversity of complex metabolic networks
    Hatzimanikatis, V
    Li, CH
    Ionita, JA
    Henry, CS
    Jankowski, MD
    Broadbelt, LJ
    [J]. BIOINFORMATICS, 2005, 21 (08) : 1603 - 1609
  • [7] KEGG for integration and interpretation of large-scale molecular data sets
    Kanehisa, Minoru
    Goto, Susumu
    Sato, Yoko
    Furumichi, Miho
    Tanabe, Mao
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D109 - D114
  • [8] Computational assignment of the EC numbers for genomic-scale analysis of enzymatic reactions
    Kotera, M
    Okuno, Y
    Hattori, M
    Goto, S
    Kanehisa, M
    [J]. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2004, 126 (50) : 16487 - 16498
  • [9] Functional Group and Substructure Searching as a Tool in Metabolomics
    Kotera, Masaaki
    McDonald, Andrew G.
    Boyce, Sinead
    Tipton, Keith F.
    [J]. PLOS ONE, 2008, 3 (02):
  • [10] KCF-S: KEGG Chemical Function and Substructure for improved interpretability and prediction in chemical bioinformatics
    Kotera, Masaaki
    Tabei, Yasuo
    Yamanishi, Yoshihiro
    Moriya, Yuki
    Tokimatsu, Toshiaki
    Kanehisa, Minoru
    Goto, Susumu
    [J]. BMC SYSTEMS BIOLOGY, 2013, 7