The era of big data: Genome-scale modelling meets machine learning

被引:56
作者
Antonakoudis, Athanasios [1 ]
Barbosa, Rodrigo [1 ]
Kotidis, Pavlos [1 ]
Kontoravdi, Cleo [1 ]
机构
[1] Imperial Coll London, Dept Chem Engn, London SW7 2AZ, England
基金
英国工程与自然科学研究理事会; 英国生物技术与生命科学研究理事会;
关键词
Flux balance analysis; Cell metabolism; Strain optimisation; Chinese hamster ovary cells; Hybrid modelling; Principal component analysis; Recombinant protein production; METABOLIC FLUX ANALYSIS; RECOMBINANT PROTEIN-PRODUCTION; GENE-EXPRESSION DATA; FEATURE-SELECTION; BALANCE ANALYSIS; IMPROVED PREDICTION; CROSS-VALIDATION; CLASSIFICATION; GROWTH; FRAMEWORK;
D O I
10.1016/j.csbj.2020.10.011
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
With omics data being generated at an unprecedented rate, genome-scale modelling has become pivotal in its organisation and analysis. However, machine learning methods have been gaining ground in cases where knowledge is insufficient to represent the mechanisms underlying such data or as a means for data curation prior to attempting mechanistic modelling. We discuss the latest advances in genome-scale modelling and the development of optimisation algorithms for network and error reduction, intracellular constraining and applications to strain design. We further review applications of supervised and unsupervised machine learning methods to omics datasets from microbial and mammalian cell systems and present efforts to harness the potential of both modelling approaches through hybrid modelling. (C) 2020 The Author(s). Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology.
引用
收藏
页码:3287 / 3300
页数:14
相关论文
共 163 条
[1]   Towards the prediction of essential genes by integration of network topology, cellular localization and biological process information [J].
Acencio, Marcio L. ;
Lemke, Ney .
BMC BIOINFORMATICS, 2009, 10 :290
[2]   Towards dynamic metabolic flux analysis in CHO cell cultures [J].
Ahn, Woo Suk ;
Antoniewicz, Maciek R. .
BIOTECHNOLOGY JOURNAL, 2012, 7 (01) :61-74
[3]  
Alam MT, 2016, NAT MICROBIOL, V1, DOI [10.1038/NMICROBIOL.2015.30, 10.1038/nmicrobiol.2015.30]
[4]   Global transcription machinery engineering: A new approach for improving cellular phenotype [J].
Alper, Hal ;
Stephanopoulos, Gregory .
METABOLIC ENGINEERING, 2007, 9 (03) :258-267
[5]   Determination of growth-coupling strategies and their underlying principles [J].
Alter, Tobias B. ;
Ebert, Birgitta E. .
BMC BIOINFORMATICS, 2019, 20 (01)
[6]   Towards creating an extended metabolic model (EMM) for E. coli using enzyme promiscuity prediction and metabolomics data [J].
Amin, Sara A. ;
Chavez, Elizabeth ;
Porokhin, Vladimir ;
Nair, Nikhil U. ;
Hassoun, Soha .
MICROBIAL CELL FACTORIES, 2019, 18 (1)
[7]   Integrating splice-isoform expression into genome-scale models characterizes breast cancer metabolism [J].
Angione, Claudio .
BIOINFORMATICS, 2018, 34 (03) :494-501
[8]   Predictive analytics of environmental adaptability in multi-omic network models [J].
Angione, Claudio ;
Lio, Pietro .
SCIENTIFIC REPORTS, 2015, 5
[9]   A Hybrid of Metabolic Flux Analysis and Bayesian Factor Modeling for Multiomic Temporal Pathway Activation [J].
Angione, Claudio ;
Pratanwanich, Naruemon ;
Lio, Pietro .
ACS SYNTHETIC BIOLOGY, 2015, 4 (08) :880-889
[10]   DeepImpute: an accurate, fast, and scalable deep neural network method to impute single-cell RNA-seq data [J].
Arisdakessian, Cedric ;
Poirion, Olivier ;
Yunits, Breck ;
Zhu, Xun ;
Garmire, Lana X. .
GENOME BIOLOGY, 2019, 20 (01)