Quantitative utilization of prior biological knowledge in the Bayesian network modeling of gene expression data

被引:20
|
作者
Gao, Shouguo [1 ,2 ]
Wang, Xujing [1 ,2 ]
机构
[1] Univ Alabama Birmingham, Dept Phys, Birmingham, AL 35294 USA
[2] Univ Alabama Birmingham, Comprehens Diabet Ctr, Birmingham, AL 35294 USA
来源
BMC BIOINFORMATICS | 2011年 / 12卷
关键词
SACCHAROMYCES-CEREVISIAE; REGULATORY NETWORKS; CELL-CYCLE; YEAST; FRAMEWORK; ONTOLOGY; DESIGN;
D O I
10.1186/1471-2105-12-359
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Bayesian Network (BN) is a powerful approach to reconstructing genetic regulatory networks from gene expression data. However, expression data by itself suffers from high noise and lack of power. Incorporating prior biological knowledge can improve the performance. As each type of prior knowledge on its own may be incomplete or limited by quality issues, integrating multiple sources of prior knowledge to utilize their consensus is desirable. Results: We introduce a new method to incorporate the quantitative information from multiple sources of prior knowledge. It first uses the Naive Bayesian classifier to assess the likelihood of functional linkage between gene pairs based on prior knowledge. In this study we included cocitation in PubMed and schematic similarity in Gene Ontology annotation. A candidate network edge reservoir is then created in which the copy number of each edge is proportional to the estimated likelihood of linkage between the two corresponding genes. In network simulation the Markov Chain Monte Carlo sampling algorithm is adopted, and samples from this reservoir at each iteration to generate new candidate networks. We evaluated the new algorithm using both simulated and real gene expression data including that from a yeast cell cycle and a mouse pancreas development/growth study. Incorporating prior knowledge led to a similar to 2 fold increase in the number of known transcription regulations recovered, without significant change in false positive rate. In contrast, without the prior knowledge BN modeling is not always better than a random selection, demonstrating the necessity in network modeling to supplement the gene expression data with additional information. Conclusion: our new development provides a statistical means to utilize the quantitative information in prior biological knowledge in the BN modeling of gene expression data, which significantly improves the performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Quantitative utilization of prior biological knowledge in the Bayesian network modeling of gene expression data
    Shouguo Gao
    Xujing Wang
    BMC Bioinformatics, 12
  • [2] Bayesian network prior: network analysis of biological data using external knowledge
    Isci, Senol
    Dogan, Haluk
    Ozturk, Cengizhan
    Otu, Hasan H.
    BIOINFORMATICS, 2014, 30 (06) : 860 - 867
  • [3] Integration of gene expression data with prior knowledge for network analysis and validation
    Ante M.
    Wingender E.
    Fuchs M.
    BMC Research Notes, 4 (1)
  • [4] Incorporating literature knowledge in Bayesian Network for inferring gene networks with gene expression data
    Almasri, Eyad
    Larsen, Peter
    Chen, Guanrao
    Dai, Yang
    BIOINFORMATICS RESEARCH AND APPLICATIONS, 2008, 4983 : 184 - 195
  • [5] Gene Network Reconstruction by Integration of Prior Biological Knowledge
    Li, Yupeng
    Jackson, Scott A.
    G3-GENES GENOMES GENETICS, 2015, 5 (06): : 1075 - 1079
  • [6] Reconstruction of Biological Networks by Incorporating Prior Knowledge into Bayesian Network Models
    Pei, Baikang
    Shin, Dong-Guk
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (12) : 1324 - 1334
  • [7] NEURAL NETWORK CLASSIFICATION WITH PRIOR KNOWLEDGE FOR ANALYSIS OF BIOLOGICAL DATA
    Abbate, D.
    Guarracino, M. R.
    Chinchuluun, A.
    Pardalos, P. M.
    BIOMAT 2008, 2009, : 223 - +
  • [8] Reconstructing gene regulatory networks with Bayesian networks by combining expression data with multiple sources of prior knowledge
    Werhli, Adriano V.
    Husmeier, Dirk
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2007, 6
  • [9] Modeling semantics of inconsistent qualitative knowledge for quantitative Bayesian network inference
    Chang, Rui
    Brauer, Wilfried
    Stetter, Martin
    NEURAL NETWORKS, 2008, 21 (2-3) : 182 - 192
  • [10] A Hybrid Neural Network Approach for Lung Cancer Classification with Gene Expression Dataset and Prior Biological Knowledge
    Azzawi, Hasseeb
    Hou, Jingyu
    Alanni, Russul
    Xiang, Yong
    MACHINE LEARNING FOR NETWORKING, 2019, 11407 : 279 - 293