Review: High-performance computing to detect epistasis in genome scale data sets

被引:33
作者
Upton, Alex [1 ]
Trelles, Oswaldo [2 ]
Antonio Cornejo-Garcia, Jose [3 ]
Richard Perkins, James [3 ]
机构
[1] Univ Malaga, Dept Comp Architecture, Bitlab Res Grp, E-29071 Malaga, Spain
[2] Univ Malaga, Dept Comp Architecture, E-29071 Malaga, Spain
[3] Reg Univ Hosp Malaga, IBIMA Res Lab, Malaga, Spain
关键词
epistasis; SNP-interactions; high-performance computing; disease marker; biomarker; genome sequencing; genotyping; GENE-GENE INTERACTIONS; MULTIFACTOR-DIMENSIONALITY REDUCTION; SNP-SNP INTERACTIONS; EXACERBATED RESPIRATORY-DISEASE; ASSOCIATION INTERACTION NETWORK; WIDE ASSOCIATION; EVOLUTIONAL PROPERTIES; LOGISTIC-REGRESSION; VARIABLE SELECTION; SURVIVAL PROGNOSIS;
D O I
10.1093/bib/bbv058
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
It is becoming clear that most human diseases have a complex etiology that cannot be explained by single nucleotide polymorphisms (SNPs) or simple additive combinations; the general consensus is that they are caused by combinations of multiple genetic variations. The limited success of some genome-wide association studies is partly a result of this focus on single genetic markers. A more promising approach is to take into account epistasis, by considering the association of multiple SNP interactions with disease. However, as genomic data continues to grow in resolution, and genome and exome sequencing become more established, the number of combinations of variants to consider increases rapidly. Two potential solutions should be considered: the use of high-performance computing, which allows us to consider a larger number of variables, and heuristics to make the solution more tractable, essential in the case of genome sequencing. In this review, we look at different computational methods to analyse epistatic interactions within disease-related genetic data sets created by microarray technology. We also review efforts to use epistatic analysis results to produce biomarkers for diagnostic tests and give our views on future directions in this field in light of advances in sequencing technology and variants in non-coding regions.
引用
收藏
页码:368 / 379
页数:12
相关论文
共 126 条
[61]   Comparison of multivariate adaptive regression splines and logistic regression in detecting SNP-SNP interactions and their application in prostate cancer [J].
Lin, Hui-Yi ;
Wang, Wenquan ;
Liu, Yung-Hsin ;
Soong, Seng-Jaw ;
York, Timothy P. ;
Myers, Leann ;
Hu, Jennifer J. .
JOURNAL OF HUMAN GENETICS, 2008, 53 (09) :802-811
[62]   Variable selection in logistic regression for detecting SNP-SNP interactions: the rheumatoid arthritis example [J].
Lin, Hui-Yi ;
Desmond, Renee ;
Bridges, S. Louis, Jr. ;
Soong, Seng-jaw .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2008, 16 (06) :735-741
[63]   An Exhaustive Epistatic SNP Association Analysis on Expanded Wellcome Trust Data [J].
Lippert, Christoph ;
Listgarten, Jennifer ;
Davidson, Robert I. ;
Baxter, Scott ;
Poong, Hoifung ;
Kadie, Carl M. ;
Heckerman, David .
SCIENTIFIC REPORTS, 2013, 3
[64]  
Liu Y., 2011, P 2011 INT ACM WORKS, P7
[65]   mbmdr: an R package for exploring gene-gene interactions associated with binary or quantitative traits [J].
Luz Calle, M. ;
Urrea, Victor ;
Malats, Nuria ;
Van Steen, Kristel .
BIOINFORMATICS, 2010, 26 (17) :2198-2199
[66]   Why epistasis is important for tackling complex human disease genetics [J].
Mackay, Trudy F. C. ;
Moore, Jason H. .
GENOME MEDICINE, 2014, 6
[67]   A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic [J].
Madsen, Bo Eskerod ;
Browning, Sharon R. .
PLOS GENETICS, 2009, 5 (02)
[68]   Finding the missing heritability of complex diseases [J].
Manolio, Teri A. ;
Collins, Francis S. ;
Cox, Nancy J. ;
Goldstein, David B. ;
Hindorff, Lucia A. ;
Hunter, David J. ;
McCarthy, Mark I. ;
Ramos, Erin M. ;
Cardon, Lon R. ;
Chakravarti, Aravinda ;
Cho, Judy H. ;
Guttmacher, Alan E. ;
Kong, Augustine ;
Kruglyak, Leonid ;
Mardis, Elaine ;
Rotimi, Charles N. ;
Slatkin, Montgomery ;
Valle, David ;
Whittemore, Alice S. ;
Boehnke, Michael ;
Clark, Andrew G. ;
Eichler, Evan E. ;
Gibson, Greg ;
Haines, Jonathan L. ;
Mackay, Trudy F. C. ;
McCarroll, Steven A. ;
Visscher, Peter M. .
NATURE, 2009, 461 (7265) :747-753
[69]   AN OVERVIEW OF MESSAGE-PASSING ENVIRONMENTS [J].
MCBRYAN, OA .
PARALLEL COMPUTING, 1994, 20 (04) :417-444
[70]   A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility [J].
Moore, Jason H. ;
Gilbert, Joshua C. ;
Tsai, Chia-Ti ;
Chiang, Fu-Tien ;
Holden, Todd ;
Barney, Nate ;
White, Bill C. .
JOURNAL OF THEORETICAL BIOLOGY, 2006, 241 (02) :252-261