Dynamic Bayesian Network Learning to Infer Sparse Models From Time Series Gene Expression Data

被引:10
作者
Ajmal, Hamda B. [1 ]
Madden, Michael G. [1 ]
机构
[1] Natl Univ Ireland, Sch Comp Sci, Galway H91 TK33, Ireland
关键词
Gene expression; Data models; Biological system modeling; Bayes methods; Biology; Computational modeling; Regulation; Computational biology; bioinformatics; Bayesian networks; gene regulatory networks; gene expression; INFORMATION CRITERIA; REGULATORY NETWORKS; MUTUAL INFORMATION; LINEAR-MODELS; SELECTION; TRANSCRIPTION; MICROARRAY; GENERATION; CHALLENGES; DIMENSION;
D O I
10.1109/TCBB.2021.3092879
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
One of the key challenges in systems biology is to derive gene regulatory networks (GRNs) from complex high-dimensional sparse data. Bayesian networks (BNs) and dynamic Bayesian networks (DBNs) have been widely applied to infer GRNs from gene expression data. GRNs are typically sparse but traditional approaches of BN structure learning to elucidate GRNs often produce many spurious (false positive) edges. We present two new BN scoring functions, which are extensions to the Bayesian Information Criterion (BIC) score, with additional penalty terms and use them in conjunction with DBN structure search methods to find a graph structure that maximises the proposed scores. Our BN scoring functions offer better solutions for inferring networks with fewer spurious edges compared to the BIC score. The proposed methods are evaluated extensively on auto regressive and DREAM4 benchmarks. We found that they significantly improve the precision of the learned graphs, relative to the BIC score. The proposed methods are also evaluated on three real time series gene expression datasets. The results demonstrate that our algorithms are able to learn sparse graphs from high-dimensional time series data. The implementation of these algorithms is open source and is available in form of an R package on GitHub at https://github.com/HamdaBinteAjmal/DBN4GRN, along with the documentation and tutorials.
引用
收藏
页码:2794 / 2805
页数:12
相关论文
共 90 条
[1]  
Ajmal H., 2017, PROC 14 UAI BAYESIAN, P1
[2]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[3]   The curse(s) of dimensionality [J].
Altman, Naomi ;
Krzywinski, Martin .
NATURE METHODS, 2018, 15 (06) :399-400
[4]   Analysis of time-series gene expression data: Methods, challenges, and opportunities [J].
Androulakis, I. P. ;
Yang, E. ;
Almon, R. R. .
ANNUAL REVIEW OF BIOMEDICAL ENGINEERING, 2007, 9 :205-228
[5]   Approximations and consistency of Bayes factors as model dimension grows [J].
Berger, JO ;
Ghosh, JK ;
Mukhopadhyay, N .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2003, 112 (1-2) :241-258
[6]  
Bernard A, 2005, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2005, P459
[7]   Improving algorithms for structure learning in Bayesian Networks using a new implicit score [J].
Bouchaala, Lobna ;
Masmoudi, Afif ;
Gargouri, Faiez ;
Rebai, Ahmed .
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (07) :5470-5475
[8]  
Brenner E., 2013, P 20 9 C UNCERTAINTY, P112
[9]   A generalization of BIC for the general exponential family [J].
Chakrabarti, Arijit ;
Ghosh, Jayanta K. .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2006, 136 (09) :2847-2872
[10]   Building gene networks with time-delayed regulations [J].
Chaturvedi, Iti ;
Rajapakse, Jagath C. .
PATTERN RECOGNITION LETTERS, 2010, 31 (14) :2133-2137