ENNET: inferring large gene regulatory networks from expression data using gradient boosting

被引:30
|
作者
Slawek, Janusz [1 ]
Arodz, Tomasz [1 ]
机构
[1] Virginia Commonwealth Univ, Dept Comp Sci, Richmond, VA 23284 USA
关键词
Gene regulatory networks; Network inference; Ensemble learning; Boosting; TRANSCRIPTIONAL REGULATION; INFERENCE; RECONSTRUCTION; GENERATION; ALGORITHM; BENCHMARK;
D O I
10.1186/1752-0509-7-106
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The regulation of gene expression by transcription factors is a key determinant of cellular phenotypes. Deciphering genome-wide networks that capture which transcription factors regulate which genes is one of the major efforts towards understanding and accurate modeling of living systems. However, reverse-engineering the network from gene expression profiles remains a challenge, because the data are noisy, high dimensional and sparse, and the regulation is often obscured by indirect connections. Results: We introduce a gene regulatory network inference algorithm ENNET, which reverse-engineers networks of transcriptional regulation from a variety of expression profiles with a superior accuracy compared to the state-of-the-art methods. The proposed method relies on the boosting of regression stumps combined with a relative variable importance measure for the initial scoring of transcription factors with respect to each gene. Then, we propose a technique for using a distribution of the initial scores and information about knockouts to refine the predictions. We evaluated the proposed method on the DREAM3, DREAM4 and DREAM5 data sets and achieved higher accuracy than the winners of those competitions and other established methods. Conclusions: Superior accuracy achieved on the three different benchmark data sets shows that ENNET is a top contender in the task of network inference. It is a versatile method that uses information about which gene was knocked-out in which experiment if it is available, but remains the top performer even without such information. ENNET is available for download from https://github.com/slawekj/ennet under the GNU GPLv3 license.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Inferring circadian gene regulatory relationships from gene expression data with a hybrid framework
    Hu, Shuwen
    Jing, Yi
    Li, Tao
    Wang, You-Gan
    Liu, Zhenyu
    Gao, Jing
    Tian, Yu-Chu
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [32] Inferring circadian gene regulatory relationships from gene expression data with a hybrid framework
    Shuwen Hu
    Yi Jing
    Tao Li
    You-Gan Wang
    Zhenyu Liu
    Jing Gao
    Yu-Chu Tian
    BMC Bioinformatics, 24
  • [33] Inferring stable gene regulatory networks from steady-state data
    Larvie, Joy E.
    Gorji, Mohammad S.
    Homaifar, Abdollah
    2015 41ST ANNUAL NORTHEAST BIOMEDICAL ENGINEERING CONFERENCE (NEBEC), 2015,
  • [34] Inferring gene regulatory networks from raw data - A molecular epistemics approach
    Kightley, DA
    Chandra, N
    Elliston, K
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2004, 2003, : 510 - 520
  • [35] Inferring gene regulatory networks from classified microarray data: Initial results
    Stuart Aitken
    Thanyaluk Jirapech-Umpai
    Ronan Daly
    BMC Bioinformatics, 6 (Suppl 3)
  • [36] Inferring gene regulatory networks from classified microarray data: Initial results
    不详
    BMC BIOINFORMATICS, 2005, 6
  • [37] Learning gene regulatory networks from gene expression data using weighted consensus
    Fujii, Chisato
    Kuwahara, Hiroyuki
    Yu, Ge
    Guo, Lili
    Gao, Xin
    NEUROCOMPUTING, 2017, 220 : 23 - 33
  • [38] Inferring gene regulatory networks from time series data using the minimum description length principle
    Zhao, Wentao
    Serpedin, Erchin
    Dougherty, Edward R.
    BIOINFORMATICS, 2006, 22 (17) : 2129 - 2135
  • [39] Inferring Gene Regulatory Networks From Single-Cell Transcriptomic Data Using Bidirectional RNN
    Gan, Yanglan
    Hu, Xin
    Zou, Guobing
    Yan, Cairong
    Xu, Guangwei
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [40] Inferring and analyzing gene regulatory networks from multi-factorial expression data: a complete and interactive suite
    Cassan, Oceane
    Lebre, Sophie
    Martin, Antoine
    BMC GENOMICS, 2021, 22 (01)