Genetic studies of complex human diseases: Characterizing SNP-disease associations using Bayesian networks

被引:44
作者
Han, Bing [2 ]
Chen, Xue-wen [1 ]
Talebizadeh, Zohreh [3 ,4 ]
Xu, Hua [5 ]
机构
[1] Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA
[2] Univ Kansas, Bioinformat & Computat Life Sci Lab, ITTC, Dept Elect Engn & Comp Sci, Lawrence, KS 66045 USA
[3] Childrens Mercy Hosp, Kansas City, MO 64108 USA
[4] Univ Missouri, Sch Med, Kansas City, MO 64108 USA
[5] Univ Texas Hlth Sci Ctr Houston, Sch Biomed Informat, Houston, TX 77030 USA
基金
美国国家科学基金会;
关键词
GENOME-WIDE ASSOCIATION; EPISTATIC INTERACTIONS; DIAGNOSTIC INTERVIEW; INDUCTION; ALLELES; CAUSAL;
D O I
10.1186/1752-0509-6-S3-S14
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Detecting epistatic interactions plays a significant role in improving pathogenesis, prevention, diagnosis, and treatment of complex human diseases. Applying machine learning or statistical methods to epistatic interaction detection will encounter some common problems, e.g., very limited number of samples, an extremely high search space, a large number of false positives, and ways to measure the association between disease markers and the phenotype. Results: To address the problems of computational methods in epistatic interaction detection, we propose a score-based Bayesian network structure learning method, EpiBN, to detect epistatic interactions. We apply the proposed method to both simulated datasets and three real disease datasets. Experimental results on simulation data show that our method outperforms some other commonly-used methods in terms of power and sample-efficiency, and is especially suitable for detecting epistatic interactions with weak or no marginal effects. Furthermore, our method is scalable to real disease data. Conclusions: We propose a Bayesian network-based method, EpiBN, to detect epistatic interactions. In EpiBN, we develop a new scoring function, which can reflect higher-order epistatic interactions by estimating the model complexity from data, and apply a fast Branch-and-Bound algorithm to learn the structure of a two-layer Bayesian network containing only one target node. To make our method scalable to real data, we propose the use of a Markov chain Monte Carlo (MCMC) method to perform the screening process. Applications of the proposed method to some real GWAS (genome-wide association studies) datasets may provide helpful insights into understanding the genetic basis of Age-related Macular Degeneration, late-onset Alzheimer's disease, and autism.
引用
收藏
页数:12
相关论文
共 52 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
Aliferis CF, 2003, METMBS'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, P371
[3]  
Aliferis CF, 2010, J MACH LEARN RES, V11, P235
[4]   Systematic meta-analyses of Alzheimer disease genetic association studies: the AlzGene database [J].
Bertram, Lars ;
McQueen, Matthew B. ;
Mullin, Kristina ;
Blacker, Deborah ;
Tanzi, Rudolph E. .
NATURE GENETICS, 2007, 39 (01) :17-23
[5]   Mutations in the inosine monophosphate dehydrogenase 1 gene (IMPDH1) cause the RP10 form of autosomal dominant retinitis pigmentosa [J].
Bowne, SJ ;
Sullivan, LS ;
Blanton, SH ;
Cepko, CL ;
Blackshaw, S ;
Birch, DG ;
Hughbanks-Wheaton, D ;
Heckenlively, JR ;
Daiger, SP .
HUMAN MOLECULAR GENETICS, 2002, 11 (05) :559-568
[6]  
Burnham KP., 2002, MODEL SELECTION MULT, DOI DOI 10.1007/B97636
[7]   Comparative analysis of methods for detecting interacting loci [J].
Chen, Li ;
Yu, Guoqiang ;
Langefeld, Carl D. ;
Miller, David J. ;
Guy, Richard T. ;
Raghuram, Jayaram ;
Yuan, Xiguo ;
Herrington, David M. ;
Wang, Yue .
BMC GENOMICS, 2011, 12
[8]   A support vector machine approach for detecting gene-gene interaction [J].
Chen, Shyh-Huei ;
Sun, Jielin ;
Dimitrov, Latchezar ;
Turner, Aubrey R. ;
Adams, Tamara S. ;
Meyers, Deborah A. ;
Chang, Bao-Li ;
Zheng, S. Lilly ;
Groenberg, Henrik ;
Xu, Jianfeng ;
Hsu, Fang-Chi .
GENETIC EPIDEMIOLOGY, 2008, 32 (02) :152-167
[9]   Improving Bayesian network structure learning with mutual information-based node ordering in the K2 algorithm [J].
Chen, Xue-Wen ;
Anantha, Gopalakrishna ;
Lin, Xiaotong .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (05) :628-640
[10]   An effective structure learning method for constructing gene networks [J].
Chen, Xue-wen ;
Anantha, Gopalakrishna ;
Wang, Xinkun .
BIOINFORMATICS, 2006, 22 (11) :1367-1374