iLSGRN: inference of large-scale gene regulatory networks based on multi-model fusion

被引:2
作者
Wu, Yiming [1 ]
Qian, Bing [1 ]
Wang, Anqi [2 ]
Dong, Heng [1 ]
Zhu, Enqiang [3 ]
Ma, Baoshan [1 ]
机构
[1] Dalian Maritime Univ, Sch Informat Sci & Technol, Dalian 116026, Peoples R China
[2] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong 999077, Peoples R China
[3] Guangzhou Univ, Inst Comp Sci & Technol, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
COEXPRESSION; GENERATION;
D O I
10.1093/bioinformatics/btad619
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Gene regulatory networks (GRNs) are a way of describing the interaction between genes, which contribute to revealing the different biological mechanisms in the cell. Reconstructing GRNs based on gene expression data has been a central computational problem in systems biology. However, due to the high dimensionality and non-linearity of large-scale GRNs, accurately and efficiently inferring GRNs is still a challenging task. Results: In this article, we propose a new approach, iLSGRN, to reconstruct large-scale GRNs from steady-state and time-series gene expression data based on non-linear ordinary differential equations. Firstly, the regulatory gene recognition algorithm calculates the Maximal Information Coefficient between genes and excludes redundant regulatory relationships to achieve dimensionality reduction. Then, the feature fusion algorithm constructs a model leveraging the feature importance derived from XGBoost (eXtreme Gradient Boosting) and RF (Random Forest) models, which can effectively train the non-linear ordinary differential equations model of GRNs and improve the accuracy and stability of the inference algorithm. The extensive experiments on different scale datasets show that our method makes sensible improvement compared with the state-of-the-art methods. Furthermore, we perform cross-validation experiments on the real gene datasets to validate the robustness and effectiveness of the proposed method. Availability and implementation: The proposed method is written in the Python language, and is available at: https://github.com/lab319/ iLSGRN.
引用
收藏
页数:9
相关论文
共 47 条
[1]   NCBI GEO: archive for functional genomics data sets-update [J].
Barrett, Tanya ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Evangelista, Carlos ;
Kim, Irene F. ;
Tomashevsky, Maxim ;
Marshall, Kimberly A. ;
Phillippy, Katherine H. ;
Sherman, Patti M. ;
Holko, Michelle ;
Yefanov, Andrey ;
Lee, Hyeseung ;
Zhang, Naigong ;
Robertson, Cynthia L. ;
Serova, Nadezhda ;
Davis, Sean ;
Soboleva, Alexandra .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D991-D995
[2]   The Inferelator:: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo [J].
Bonneau, Richard ;
Reiss, David J. ;
Shannon, Paul ;
Facciotti, Marc ;
Hood, Leroy ;
Baliga, Nitin S. ;
Thorsson, Vesteinn .
GENOME BIOLOGY, 2006, 7 (05)
[3]   Next generation sequencing technology: Advances and applications [J].
Buermans, H. P. J. ;
den Dunnen, J. T. .
BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR BASIS OF DISEASE, 2014, 1842 (10) :1932-1941
[4]   Multi-study inference of regulatory networks for more accurate models of gene regulation [J].
Castro, Dayanne M. ;
de Veaux, Nicholas R. ;
Miraldi, Emily R. ;
Bonneau, Richard .
PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (01)
[5]   A review on the computational approaches for gene regulatory network construction [J].
Chai, Lian En ;
Loh, Swee Kuan ;
Low, Swee Thing ;
Mohamad, Mohd Saberi ;
Denis, Safaai ;
Zakaria, Zalmiyah .
COMPUTERS IN BIOLOGY AND MEDICINE, 2014, 48 :55-65
[6]   Computational methods for Gene Regulatory Networks reconstruction and analysis: A review [J].
Delgado, Fernando M. ;
Gomez-Vela, Francisco .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2019, 95 :133-145
[7]   Computational Analysis of the Global Effects of Ly6E in the Immune Response to Coronavirus Infection Using Gene Networks [J].
Delgado-Chaves, Fernando M. ;
Gomez-Vela, Francisco ;
Divina, Federico ;
Garcia-Torres, Miguel ;
Rodriguez-Baena, Domingo S. .
GENES, 2020, 11 (07) :1-33
[8]   Gene regulatory networks and their applications: understanding biological and medical problems in terms of networks [J].
Emmert-Streib, Frank ;
Dehmer, Matthias ;
Haibe-Kains, Benjamin .
FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2014, 2
[9]   Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles [J].
Faith, Jeremiah J. ;
Hayete, Boris ;
Thaden, Joshua T. ;
Mogno, Ilaria ;
Wierzbowski, Jamey ;
Cottarel, Guillaume ;
Kasif, Simon ;
Collins, James J. ;
Gardner, Timothy S. .
PLOS BIOLOGY, 2007, 5 (01) :54-66
[10]   Inferring Large-Scale Gene Regulatory Networks Using a Randomized Algorithm Based on Singular Value Decomposition [J].
Fan, Anjing ;
Wang, Haitao ;
Xiang, Hua ;
Zou, Xiufen .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (06) :1997-2008