Developmental gene regulatory network connections predicted by machine learning from gene expression data alone

被引:3
|
作者
Zhang, Jingyi [1 ]
Ibrahim, Farhan [1 ]
Najmulski, Emily [2 ]
Katholos, George [2 ]
Altarawy, Doaa [1 ,3 ]
Heath, Lenwood S. [1 ]
Tulin, Sarah L. [2 ]
机构
[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA USA
[2] Canisius Coll, Dept Biol, Buffalo, NY 14208 USA
[3] Alexandria Univ, Comp & Syst Engn Dept, Alexandria, Egypt
来源
PLOS ONE | 2021年 / 16卷 / 12期
关键词
MESSENGER-RNAS; SPECIFICATION; VISUALIZATION; TRANSCRIPTOME; GENOME;
D O I
10.1371/journal.pone.0261926
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene regulatory network (GRN) inference can now take advantage of powerful machine learning algorithms to complement traditional experimental methods in building gene networks. However, the dynamical nature of embryonic development-representing the timedependent interactions between thousands of transcription factors, signaling molecules, and effector genes-is one of the most challenging arenas for GRN prediction. In this work, we show that successful GRN predictions for a developmental network from gene expression data alone can be obtained with the Priors Enriched Absent Knowledge (PEAK) network inference algorithm. PEAK is a noise-robust method that models gene expression dynamics via ordinary differential equations and selects the best network based on information-theoretic criteria coupled with the machine learning algorithm Elastic Net. We test our GRN prediction methodology using two gene expression datasets for the purple sea urchin, Stronglyocentrotus purpuratus, and cross-check our results against existing GRN models that have been constructed and validated by over 30 years of experimental results. Our results find a remarkably high degree of sensitivity in identifying known gene interactions in the network (maximum 81.58%). We also generate novel predictions for interactions that have not yet been described, which provide a resource for researchers to use to further complete the sea urchin GRN. Published ChIPseq data and spatial co-expression analysis further support a subset of the top novel predictions. We conclude that GRN predictions that match known gene interactions can be produced using gene expression data alone from developmental time series experiments.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] The Impact of Gene Expression Variation on the Robustness and Evolvability of a Developmental Gene Regulatory Network
    Garfield, David A.
    Runcie, Daniel E.
    Babbitt, Courtney C.
    Haygood, Ralph
    Nielsen, William J.
    Wray, Gregory A.
    PLOS BIOLOGY, 2013, 11 (10)
  • [2] Learning the structure of gene regulatory networks from time series gene expression data
    Haoni Li
    Nan Wang
    Ping Gong
    Edward J Perkins
    Chaoyang Zhang
    BMC Genomics, 12
  • [3] Learning gene regulatory networks from gene expression data using weighted consensus
    Fujii, Chisato
    Kuwahara, Hiroyuki
    Yu, Ge
    Guo, Lili
    Gao, Xin
    NEUROCOMPUTING, 2017, 220 : 23 - 33
  • [4] Learning the structure of gene regulatory networks from time series gene expression data
    Li, Haoni
    Wang, Nan
    Gong, Ping
    Perkins, Edward J.
    Zhang, Chaoyang
    BMC GENOMICS, 2011, 12
  • [5] Reconstructing directed gene regulatory network by only gene expression data
    Lu Zhang
    Xi Kang Feng
    Yen Kaow Ng
    Shuai Cheng Li
    BMC Genomics, 17
  • [6] Integrating heterogeneous gene expression data for gene regulatory network modelling
    Alina Sîrbu
    Heather J. Ruskin
    Martin Crane
    Theory in Biosciences, 2012, 131 : 95 - 102
  • [7] Integrating heterogeneous gene expression data for gene regulatory network modelling
    Sirbu, Alina
    Ruskin, Heather J.
    Crane, Martin
    THEORY IN BIOSCIENCES, 2012, 131 (02) : 95 - 102
  • [8] Reconstructing directed gene regulatory network by only gene expression data
    Zhang, Lu
    Feng, Xi Kang
    Ng, Yen Kaow
    Li, Shuai Cheng
    BMC GENOMICS, 2016, 17
  • [9] Reconstructing directed gene regulatory network by only gene expression data
    Zhang, Lu
    Ng, Yen Kaow
    Li, ShuaiCheng
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 163 - 170
  • [10] A Machine Learning Approach to Simulate Gene Expression and Infer Gene Regulatory Networks
    Zito, Francesco
    Cutello, Vincenzo
    Pavone, Mario
    ENTROPY, 2023, 25 (08)