Developmental gene regulatory network connections predicted by machine learning from gene expression data alone

被引:3
|
作者
Zhang, Jingyi [1 ]
Ibrahim, Farhan [1 ]
Najmulski, Emily [2 ]
Katholos, George [2 ]
Altarawy, Doaa [1 ,3 ]
Heath, Lenwood S. [1 ]
Tulin, Sarah L. [2 ]
机构
[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA USA
[2] Canisius Coll, Dept Biol, Buffalo, NY 14208 USA
[3] Alexandria Univ, Comp & Syst Engn Dept, Alexandria, Egypt
来源
PLOS ONE | 2021年 / 16卷 / 12期
关键词
MESSENGER-RNAS; SPECIFICATION; VISUALIZATION; TRANSCRIPTOME; GENOME;
D O I
10.1371/journal.pone.0261926
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene regulatory network (GRN) inference can now take advantage of powerful machine learning algorithms to complement traditional experimental methods in building gene networks. However, the dynamical nature of embryonic development-representing the timedependent interactions between thousands of transcription factors, signaling molecules, and effector genes-is one of the most challenging arenas for GRN prediction. In this work, we show that successful GRN predictions for a developmental network from gene expression data alone can be obtained with the Priors Enriched Absent Knowledge (PEAK) network inference algorithm. PEAK is a noise-robust method that models gene expression dynamics via ordinary differential equations and selects the best network based on information-theoretic criteria coupled with the machine learning algorithm Elastic Net. We test our GRN prediction methodology using two gene expression datasets for the purple sea urchin, Stronglyocentrotus purpuratus, and cross-check our results against existing GRN models that have been constructed and validated by over 30 years of experimental results. Our results find a remarkably high degree of sensitivity in identifying known gene interactions in the network (maximum 81.58%). We also generate novel predictions for interactions that have not yet been described, which provide a resource for researchers to use to further complete the sea urchin GRN. Published ChIPseq data and spatial co-expression analysis further support a subset of the top novel predictions. We conclude that GRN predictions that match known gene interactions can be produced using gene expression data alone from developmental time series experiments.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Modeling Gene Regulatory Subnetworks from Time Course Gene Expression Data
    Liang, Xi-Jun
    Xia, Zhonghang
    Zhang, Li-Wei
    Wu, Fang-Xiang
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 216 - 221
  • [32] Qualitative reasoning of dynamic gene regulatory interactions from gene expression data
    Chen, Yu
    Park, Byungkyu
    Han, Kyungsook
    BMC GENOMICS, 2010, 11
  • [33] Identification of gene regulatory networks from time course gene expression data
    Wu, Fang-Xiang
    Liu, Li-Zhi
    Xia, Zhang-Hang
    2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 795 - 798
  • [34] Inference of gene regulatory subnetworks from time course gene expression data
    Liang, Xi-Jun
    Xia, Zhonghang
    Zhang, Li-Wei
    Wu, Fang-Xiang
    BMC BIOINFORMATICS, 2012, 13
  • [35] Qualitative reasoning of dynamic gene regulatory interactions from gene expression data
    Yu Chen
    Byungkyu Park
    Kyungsook Han
    BMC Genomics, 11
  • [36] Gene expression data analysis: a statistical and machine learning perspective
    Chattopadhyay, Amrita
    BIOMETRICS, 2023, 79 (01) : 526 - 528
  • [37] Learning the Regulatory Code of Gene Expression
    Zrimec, Jan
    Buric, Filip
    Kokina, Mariia
    Garcia, Victor
    Zelezniak, Aleksej
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2021, 8
  • [38] Bayesian Data Fusion of Gene Expression and Histone Modification Profiles for Inference of Gene Regulatory Network
    Chen, Haifen
    Maduranga, D. A. K.
    Mundra, Piyushkumar A.
    Zheng, Jie
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (02) : 516 - 525
  • [39] LogBTF: gene regulatory network inference using Boolean threshold network model from single-cell gene expression data
    Li, Lingyu
    Sun, Liangjie
    Chen, Guangyi
    Wong, Chi-Wing
    Ching, Wai-Ki
    Liu, Zhi-Ping
    BIOINFORMATICS, 2023, 39 (05)
  • [40] Reconstruction of the Regulatory Network for Bacillus subtilis and Reconciliation with Gene Expression Data
    Faria, Jose P.
    Overbeek, Ross
    Taylor, Ronald C.
    Conrad, Neal
    Vonstein, Veronika
    Goelzer, Anne
    Fromion, Vincent
    Rocha, Miguel
    Rocha, Isabel
    Henry, Christopher S.
    FRONTIERS IN MICROBIOLOGY, 2016, 7