Mining genetic epidemiology data with Bayesian networks I: Bayesian networks and example application (plasma apoE levels)

被引:45
|
作者
Rodin, AS
Boerwinkle, E
机构
[1] Univ Texas, Sch Publ Hlth, Ctr Human Genet, Hlth Sci Ctr, Houston, TX 77225 USA
[2] Univ Texas, Inst Mol Med, Hlth Sci Ctr, Houston, TX 77225 USA
关键词
D O I
10.1093/bioinformatics/bti505
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The wealth of single nucleotide polymorphism (SNP) data within candidate genes and anticipated across the genome poses enormous analytical problems for studies of genotype-to-phenotype relationships, and modern data mining methods may be particularly well suited to meet the swelling challenges. In this paper, we introduce the method of Belief (Bayesian) networks to the domain of genotype-to-phenotype analyses and provide an example application. Results: A Belief network is a graphical model of a probabilistic nature that represents a joint multivariate probability distribution and reflects conditional independences between variables. Given the data, optimal network topology can be estimated with the assistance of heuristic search algorithms and scoring criteria. Statistical significance of edge strengths can be evaluated using Bayesian methods and bootstrapping. As an example application, the method of Belief networks was applied to 20 SNPs in the apolipoprotein (apo) E gene and plasma apoE levels in a sample of 702 individuals from Jackson, MS. Plasma apoE level was the primary target variable. These analyses indicate that the edge between SNP 4075, coding for the well-known epsilon 2 allele, and plasma apoE level was strong. Belief networks can effectively describe complex uncertain processes and can both learn from data and incorporate prior knowledge. Availability: Various alternative and supplemental networks (not given in the text) as well as source code extensions, are available from the authors. Contact: arodin@uth.tmc.edu Supplementary information: http://bioinformatics.oxfordjournals.org
引用
收藏
页码:3273 / 3278
页数:6
相关论文
共 50 条
  • [21] Reverse engineering of genetic networks with Bayesian networks
    Husmeier, D
    BIOCHEMICAL SOCIETY TRANSACTIONS, 2003, 31 : 1516 - 1518
  • [22] Collective Mining of Bayesian Networks from Distributed Heterogeneous Data
    R. Chen
    K. Sivakumar
    H. Kargupta
    Knowledge and Information Systems, 2004, 6 : 164 - 187
  • [23] Bayesian neural networks with confidence estimations applied to data mining
    Orre, R
    Lansner, A
    Bate, A
    Lindquist, M
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2000, 34 (04) : 473 - 493
  • [24] Parallel data mining of Bayesian Networks from Telecommunications Network data
    Sterritt, R
    Adamson, K
    Shapcott, CM
    Curran, EP
    PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 2000, 1800 : 415 - 422
  • [25] Bayesian inference of genetic regulatory networks from time series microarray data using dynamic Bayesian networks
    Huang, Yufei
    Wang, Jianyin
    Zhang, Jianqiu
    Sanchez, Maribel
    Wang, Yufeng
    Journal of Multimedia, 2007, 2 (03): : 46 - 56
  • [26] Collective mining of Bayesian networks from distributed heterogeneous data
    R. Chen
    K. Sivakumar
    H. Kargupta
    Knowledge and Information Systems, 2004, 6 (2) : 164 - 187
  • [27] Collective mining of Bayesian networks from distributed heterogeneous data
    Chen, R
    Sivakumar, K
    Kargupta, H
    KNOWLEDGE AND INFORMATION SYSTEMS, 2004, 6 (02) : 164 - 187
  • [28] Mining Bayesian networks out of ontologies
    Andrea, Bellandi
    Franco, Turini
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2012, 38 (02) : 507 - 532
  • [29] Bridging text mining and Bayesian Networks
    Raghuram, Sandeep
    Xia, Yuni
    Palakal, Mathew
    Jones, Josette
    Pecenka, Dave
    Tinsley, Eric
    Bandos, Jean
    Geesaman, Jerry
    2009 INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS, 2009, : 298 - +
  • [30] Mining Bayesian networks out of ontologies
    Bellandi Andrea
    Turini Franco
    Journal of Intelligent Information Systems, 2012, 38 : 507 - 532