Combined network analysis and machine learning allows the prediction of metabolic pathways from tomato metabolomics data

被引:47
|
作者
Toubiana, David [1 ]
Puzis, Rami [2 ]
Wen, Lingling [3 ]
Sikron, Noga [3 ]
Kurmanbayeva, Assylay [3 ]
Soltabayeva, Aigerim [3 ]
Wilhelmi, Maria del Mar Rubio [1 ]
Sade, Nir [3 ,4 ]
Fait, Aaron [3 ]
Sagi, Moshe [3 ]
Blumwald, Eduardo [1 ]
Elovici, Yuval [2 ]
机构
[1] Univ Calif Davis, Dept Plant Sci, Davis, CA 95616 USA
[2] Ben Gurion Univ Negev, Dept Software & Informat Syst Engn, Telekom Innovat Labs, Beer Sheva, Israel
[3] Ben Gurion Univ Negev, French Associates Inst Agr & Biotechnol Drylands, Jacob Blaustein Inst Desert Res, Sede Boqer, Israel
[4] Tel Aviv Univ, Sch Plant Sci & Food Secur, Tel Aviv, Israel
关键词
IDENTIFICATION; PURIFICATION; DATABASES; GENOMICS; GENES; LIVER; PCR;
D O I
10.1038/s42003-019-0440-4
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The identification and understanding of metabolic pathways is a key aspect in crop improvement and drug design. The common approach for their detection is based on gene annotation and ontology. Correlation-based network analysis, where metabolites are arranged into network formation, is used as a complentary tool. Here, we demonstrate the detection of metabolic pathways based on correlation-based network analysis combined with machine-learning techniques. Metabolites of known tomato pathways, non-tomato pathways, and random sets of metabolites were mapped as subgraphs onto metabolite correlation networks of the tomato pericarp. Network features were computed for each subgraph, generating a machine-learning model. The model predicted the presence of the beta-alaninedegradation-I, tryptophan-degradation-VII-via-indole-3-pyruvate (yet unknown to plants), the p-alanine-biosynthesis-III, and the melibiose-degradation pathway, although melibiose was not part of the networks. In vivo assays validated the presence of the melibiose degradation pathway. For the remaining pathways only some of the genes encoding regulatory enzymes were detected.
引用
收藏
页数:13
相关论文
共 27 条
  • [1] Review of Machine Learning Methods for the Prediction and Reconstruction of Metabolic Pathways
    Shah, Hayat Ali
    Liu, Juan
    Yang, Zhihui
    Feng, Jing
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2021, 8
  • [2] Machine Learning Methods for Analysis of Metabolic Data and Metabolic Pathway Modeling
    Cuperlovic-Culf, Miroslava
    METABOLITES, 2018, 8 (01)
  • [3] Towards the Integration of Metabolic Network Modelling and Machine Learning for the Routine Analysis of High-Throughput Patient Data
    Pacheco, Maria Pires
    Bintener, Tamara
    Sauter, Thomas
    AUTOMATED REASONING FOR SYSTEMS BIOLOGY AND MEDICINE, 2019, 30 : 401 - 424
  • [4] NMR in Metabolomics: From Conventional Statistics to Machine Learning and Neural Network Approaches
    Corsaro, Carmelo
    Vasi, Sebastiano
    Neri, Fortunato
    Mezzasalma, Angela Maria
    Neri, Giulia
    Fazio, Enza
    APPLIED SCIENCES-BASEL, 2022, 12 (06):
  • [5] The accurate prediction and characterization of cancerlectin by a combined machine learning and GO analysis
    Tang, Furong
    Zhang, Lichao
    Xu, Lei
    Zou, Quan
    Feng, Hailin
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [6] Combined multivariate analysis and machine learning reveals a predictive module of metabolic stress response in Arabidopsis thaliana
    Fuertauer, Lisa
    Pschenitschnigg, Alice
    Scharkosi, Helene
    Weckwerth, Wolfram
    Naegele, Thomas
    MOLECULAR OMICS, 2018, 14 (06) : 437 - 449
  • [7] Prediction of pain in knee osteoarthritis patients using machine learning: Data from Osteoarthritis Initiative
    Alexos, Antonios
    Kokkotis, Christos
    Moustakidis, Serafeim
    Papageorgiou, Elpiniki
    2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA 2020), 2020, : 240 - 246
  • [8] Network-based strategies in metabolomics data analysis and interpretation: from molecular networking to biological interpretation
    De Souza, Leonardo Perez
    Alseekh, Saleh
    Brotman, Yariv
    Fernie, Alisdair R.
    EXPERT REVIEW OF PROTEOMICS, 2020, 17 (04) : 243 - 255
  • [9] Design of an Efficient Model for Psychological Disease Analysis and Prediction Using Machine Learning and Genomic Data Samples
    Kumuda, Alparthi
    Panigrahy, Saroj Kumar
    BIG DATA AND COGNITIVE COMPUTING, 2025, 9 (03)
  • [10] The Gaussian process distribution of relaxation times: A machine learning tool for the analysis and prediction of electrochemical impedance spectroscopy data
    Liu, Jiapeng
    Ciucci, Francesco
    ELECTROCHIMICA ACTA, 2020, 331