Dynamic Bayesian Network Learning to Infer Sparse Models From Time Series Gene Expression Data

被引:9
|
作者
Ajmal, Hamda B. [1 ]
Madden, Michael G. [1 ]
机构
[1] Natl Univ Ireland, Sch Comp Sci, Galway H91 TK33, Ireland
关键词
Gene expression; Data models; Biological system modeling; Bayes methods; Biology; Computational modeling; Regulation; Computational biology; bioinformatics; Bayesian networks; gene regulatory networks; gene expression; INFORMATION CRITERIA; REGULATORY NETWORKS; MUTUAL INFORMATION; LINEAR-MODELS; SELECTION; TRANSCRIPTION; MICROARRAY; GENERATION; CHALLENGES; DIMENSION;
D O I
10.1109/TCBB.2021.3092879
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
One of the key challenges in systems biology is to derive gene regulatory networks (GRNs) from complex high-dimensional sparse data. Bayesian networks (BNs) and dynamic Bayesian networks (DBNs) have been widely applied to infer GRNs from gene expression data. GRNs are typically sparse but traditional approaches of BN structure learning to elucidate GRNs often produce many spurious (false positive) edges. We present two new BN scoring functions, which are extensions to the Bayesian Information Criterion (BIC) score, with additional penalty terms and use them in conjunction with DBN structure search methods to find a graph structure that maximises the proposed scores. Our BN scoring functions offer better solutions for inferring networks with fewer spurious edges compared to the BIC score. The proposed methods are evaluated extensively on auto regressive and DREAM4 benchmarks. We found that they significantly improve the precision of the learned graphs, relative to the BIC score. The proposed methods are also evaluated on three real time series gene expression datasets. The results demonstrate that our algorithms are able to learn sparse graphs from high-dimensional time series data. The implementation of these algorithms is open source and is available in form of an R package on GitHub at https://github.com/HamdaBinteAjmal/DBN4GRN, along with the documentation and tutorials.
引用
收藏
页码:2794 / 2805
页数:12
相关论文
共 50 条
  • [1] Bayesian Data Fusion of Gene Expression and Histone Modification Profiles for Inference of Gene Regulatory Network
    Chen, Haifen
    Maduranga, D. A. K.
    Mundra, Piyushkumar A.
    Zheng, Jie
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (02) : 516 - 525
  • [2] Investigating the Effects of Imputation Methods for Modelling Gene Networks Using a Dynamic Bayesian Network from Gene Expression Data
    Chai, Lian En
    Law, Chow Kuan
    Mohamad, Mohd Saberi
    Chong, Chuii Khim
    Choon, Yee Wen
    Deris, Safaai
    Illias, Rosli Md
    MALAYSIAN JOURNAL OF MEDICAL SCIENCES, 2014, 21 (02): : 20 - 27
  • [3] Using gene expression programming to infer gene regulatory networks from time-series data
    Zhang, Yongqing
    Pu, Yifei
    Zhang, Haisen
    Su, Yabo
    Zhang, Lifang
    Zhou, Jiliu
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2013, 47 : 198 - 206
  • [4] BAYESIAN SPARSE GRAPHICAL MODELS FOR CLASSIFICATION WITH APPLICATION TO PROTEIN EXPRESSION DATA
    Baladandayuthapani, Veerabhadran
    Talluri, Rajesh
    Ji, Yuan
    Coombes, Kevin R.
    Lu, Yiling
    Hennessy, Bryan T.
    Davies, Michael A.
    Mallick, Bani K.
    ANNALS OF APPLIED STATISTICS, 2014, 8 (03) : 1443 - 1468
  • [5] A dynamic time order network for time-series gene expression data analysis
    Zhang, Pengyue
    Mourad, Raphael
    Xiang, Yang
    Huang, Kun
    Huang, Tim
    Nephew, Kenneth
    Liu, Yunlong
    Li, Lang
    BMC SYSTEMS BIOLOGY, 2012, 6
  • [6] Bayesian models for gene expression with DNA microarray data
    Ibrahim, JG
    Chen, MH
    Gray, RJ
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 88 - 99
  • [7] A Bayesian network classification methodology for gene expression data
    Helman, P
    Veroff, R
    Atlas, SR
    Willman, C
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2004, 11 (04) : 581 - 615
  • [8] CSI: a nonparametric Bayesian approach to network inference from multiple perturbed time series gene expression data
    Penfold, Christopher A.
    Shifaz, Ahmed
    Brown, Paul E.
    Nicholson, Ann
    Wild, David L.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2015, 14 (03) : 307 - 310
  • [9] Combined mRMR Filter and Sparse Bayesian Classifier for Analysis of Gene Expression Data
    Soltani, Mehran
    Shammakhi, Mohammad Hasan
    Khorram, Saeed
    Sheikhzadeh, Hamid
    2016 2ND INTERNATIONAL CONFERENCE OF SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2016, : 123 - 127
  • [10] A Sparse Bayesian Learning Method for Structural Equation Model-Based Gene Regulatory Network Inference
    Li, Yan
    Liu, Dayou
    Chu, Jianfeng
    Zhu, Yungang
    Liu, Jie
    Cheng, Xiaochun
    IEEE ACCESS, 2020, 8 : 40067 - 40080