Mining genetic epidemiology data with Bayesian networks I: Bayesian networks and example application (plasma apoE levels)

被引:45
|
作者
Rodin, AS
Boerwinkle, E
机构
[1] Univ Texas, Sch Publ Hlth, Ctr Human Genet, Hlth Sci Ctr, Houston, TX 77225 USA
[2] Univ Texas, Inst Mol Med, Hlth Sci Ctr, Houston, TX 77225 USA
关键词
D O I
10.1093/bioinformatics/bti505
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The wealth of single nucleotide polymorphism (SNP) data within candidate genes and anticipated across the genome poses enormous analytical problems for studies of genotype-to-phenotype relationships, and modern data mining methods may be particularly well suited to meet the swelling challenges. In this paper, we introduce the method of Belief (Bayesian) networks to the domain of genotype-to-phenotype analyses and provide an example application. Results: A Belief network is a graphical model of a probabilistic nature that represents a joint multivariate probability distribution and reflects conditional independences between variables. Given the data, optimal network topology can be estimated with the assistance of heuristic search algorithms and scoring criteria. Statistical significance of edge strengths can be evaluated using Bayesian methods and bootstrapping. As an example application, the method of Belief networks was applied to 20 SNPs in the apolipoprotein (apo) E gene and plasma apoE levels in a sample of 702 individuals from Jackson, MS. Plasma apoE level was the primary target variable. These analyses indicate that the edge between SNP 4075, coding for the well-known epsilon 2 allele, and plasma apoE level was strong. Belief networks can effectively describe complex uncertain processes and can both learn from data and incorporate prior knowledge. Availability: Various alternative and supplemental networks (not given in the text) as well as source code extensions, are available from the authors. Contact: arodin@uth.tmc.edu Supplementary information: http://bioinformatics.oxfordjournals.org
引用
收藏
页码:3273 / 3278
页数:6
相关论文
共 50 条
  • [31] Application of Bayesian networks to generate synthetic health data
    Kaur, Dhamanpreet
    Sobiesk, Matthew
    Patil, Shubham
    Liu, Jin
    Bhagat, Puran
    Gupta, Amar
    Markuzon, Natasha
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (04) : 801 - 811
  • [32] The Application of Bayesian Belief Networks
    Krumay, Barbara
    Brandtweiner, Roman
    24TH BLED ECONFERENCE: EFUTURE: CREATING SOLUTIONS FOR THE INDIVIDUAL, ORGANISATIONS AND SOCIETY, 2011, : 507 - 516
  • [33] Bayesian Networks for Data Prediction
    Peng, Qingsong
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, : 101 - 102
  • [34] Application of a data-mining method based on Bayesian networks to lesion-deficit analysis
    Herskovits, EH
    Gerring, JP
    NEUROIMAGE, 2003, 19 (04) : 1664 - 1673
  • [35] A tutorial on data mining for Bayesian networks, with a specific focus on IoT for agriculture
    Krause, Paul J.
    Bokinala, Vivek
    INTERNET OF THINGS, 2023, 22 : 1 - 15
  • [36] Bayesian Networks in Equine Genetic Counseling
    Detilleux, Johann C.
    JOURNAL OF EQUINE VETERINARY SCIENCE, 2019, 73 : 115 - 120
  • [37] A fault mining method based on Bayesian networks
    Zhang, J. (jian_fei_zhang@163.com), 1600, ICIC Express Letters Office, Tokai University, Kumamoto Campus, 9-1-1, Toroku, Kumamoto, 862-8652, Japan (07):
  • [38] Bayesian Approach to Learn Bayesian Networks Using Data and Constraints
    Gao Xiao-guang
    Yang Yu
    Guo Zhi-gao
    Chen Da-qing
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3667 - 3672
  • [39] Learning Bayesian Networks from Ordinal Data - The Bayesian Way
    Grzegorczyk, Marco
    DEVELOPMENTS IN STATISTICAL MODELLING, IWSM 2024, 2024, : 7 - 13
  • [40] Application of Bayesian Networks in Reliability Evaluation
    Cai, Baoping
    Kong, Xiangdi
    Liu, Yonghong
    Lin, Jing
    Yuan, Xiaobing
    Xu, Hongqi
    Ji, Renjie
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (04) : 2146 - 2157