Cuckoo search epistasis: a new method for exploring significant genetic interactions

被引:41
作者
Aflakparast, M. [1 ,2 ]
Salimi, H. [3 ]
Gerami, A. [4 ]
Dube, M-P [5 ]
Visweswaran, S. [6 ]
Masoudi-Nejad, A. [1 ]
机构
[1] Univ Tehran, Inst Biochem & Biophys, Lab Syst Biol & Bioinformat LBB, Tehran 131451365, Iran
[2] Vrije Univ Amsterdam, Fac Sci, Dept Math, Amsterdam, Netherlands
[3] Univ Tehran, Dept Comp Sci, Tehran 131451365, Iran
[4] Islamic Azad Univ, Dept Math & Stat, Qazvin Branch, Qazvin, Iran
[5] Univ Montreal, Fac Med, Dept Med, Montreal, PQ H3C 3J7, Canada
[6] Univ Pittsburgh, Dept Biomed Informat, Pittsburgh, PA USA
关键词
single-nucleotide polymorphism; epistatic interactions; dimensionality reduction; cuckoo search; DIMENSIONALITY REDUCTION METHOD; WHOLE-GENOME ASSOCIATION; ALZHEIMERS-DISEASE; WIDE ASSOCIATION; RISK; ALGORITHM; SELECTION; DATASETS; REVEALS; ALLELES;
D O I
10.1038/hdy.2014.4
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The advent of high-throughput sequencing technology has resulted in the ability to measure millions of single-nucleotide polymorphisms (SNPs) from thousands of individuals. Although these high-dimensional data have paved the way for better understanding of the genetic architecture of common diseases, they have also given rise to challenges in developing computational methods for learning epistatic relationships among genetic markers. We propose a new method, named cuckoo search epistasis (CSE) for identifying significant epistatic interactions in population-based association studies with a case-control design. This method combines a computationally efficient Bayesian scoring function with an evolutionary-based heuristic search algorithm, and can be efficiently applied to high-dimensional genome-wide SNP data. The experimental results from synthetic data sets show that CSE outperforms existing methods including multifactorial dimensionality reduction and Bayesian epistasis association mapping. In addition, on a real genome-wide data set related to Alzheimer's disease, CSE identified SNPs that are consistent with previously reported results, and show the utility of CSE for application to genome-wide data.
引用
收藏
页码:666 / 674
页数:9
相关论文
共 55 条
[1]  
[Anonymous], 2008, IEEE Intell Inf Bull
[2]   A MATHEMATICAL THEORY OF ADAPTIVE CONTROL PROCESSES [J].
BELLMAN, R ;
KALABA, R .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1959, 45 (08) :1288-1290
[3]  
Bush WS, 2009, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2009, P368
[4]   Alternative contingency table measures improve the power and detection of multifactor dimensionality reduction [J].
Bush, William S. ;
Edwards, Todd L. ;
Dudek, Scott M. ;
McKinney, Brett A. ;
Ritchie, Marylyn D. .
BMC BIOINFORMATICS, 2008, 9 (1)
[5]   Comparative analysis of methods for detecting interacting loci [J].
Chen, Li ;
Yu, Guoqiang ;
Langefeld, Carl D. ;
Miller, David J. ;
Guy, Richard T. ;
Raghuram, Jayaram ;
Yuan, Xiguo ;
Herrington, David M. ;
Wang, Yue .
BMC GENOMICS, 2011, 12
[6]   Odds ratio based multifactor-dimensionality reduction method for detecting gene-gene interactions [J].
Chung, Yujin ;
Lee, Seung Yeoun ;
Elston, Robert C. ;
Park, Taesung .
BIOINFORMATICS, 2007, 23 (01) :71-76
[7]   Epistasis in sporadic Alzheimer's disease [J].
Combarros, Onofre ;
Cortina-Borja, Mario ;
Smith, A. David ;
Lehmann, Donald J. .
NEUROBIOLOGY OF AGING, 2009, 30 (09) :1333-1349
[8]   A high-density whole-genome association study reveals that APOE is the major susceptibility gene for sporadic late-onset Alzheimer's disease [J].
Coon, Keith D. ;
Myers, Amanda J. ;
Craig, David W. ;
Webster, Jennifer A. ;
Pearson, John V. ;
Lince, Diane Hu ;
Zismann, Victoria L. ;
Beach, Thomas G. ;
Leung, Doris ;
Bryden, Leslie ;
Halperin, Rebecca F. ;
Marlowe, Lauren ;
Kaleem, Mona ;
Walker, Douglas G. ;
Ravid, Rivka ;
Heward, Christopher B. ;
Rogers, Joseph ;
Papassotiropoulos, Andreas ;
Reiman, Eric M. ;
Hardy, John ;
Stephan, Dietrich A. .
JOURNAL OF CLINICAL PSYCHIATRY, 2007, 68 (04) :613-618
[9]   A BAYESIAN METHOD FOR THE INDUCTION OF PROBABILISTIC NETWORKS FROM DATA [J].
COOPER, GF ;
HERSKOVITS, E .
MACHINE LEARNING, 1992, 9 (04) :309-347
[10]   Detecting gene-gene interactions that underlie human diseases [J].
Cordell, Heather J. .
NATURE REVIEWS GENETICS, 2009, 10 (06) :392-404