Review: High-performance computing to detect epistasis in genome scale data sets

被引:32
作者
Upton, Alex [1 ]
Trelles, Oswaldo [2 ]
Antonio Cornejo-Garcia, Jose [3 ]
Richard Perkins, James [3 ]
机构
[1] Univ Malaga, Dept Comp Architecture, Bitlab Res Grp, E-29071 Malaga, Spain
[2] Univ Malaga, Dept Comp Architecture, E-29071 Malaga, Spain
[3] Reg Univ Hosp Malaga, IBIMA Res Lab, Malaga, Spain
关键词
epistasis; SNP-interactions; high-performance computing; disease marker; biomarker; genome sequencing; genotyping; GENE-GENE INTERACTIONS; MULTIFACTOR-DIMENSIONALITY REDUCTION; SNP-SNP INTERACTIONS; EXACERBATED RESPIRATORY-DISEASE; ASSOCIATION INTERACTION NETWORK; WIDE ASSOCIATION; EVOLUTIONAL PROPERTIES; LOGISTIC-REGRESSION; VARIABLE SELECTION; SURVIVAL PROGNOSIS;
D O I
10.1093/bib/bbv058
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
It is becoming clear that most human diseases have a complex etiology that cannot be explained by single nucleotide polymorphisms (SNPs) or simple additive combinations; the general consensus is that they are caused by combinations of multiple genetic variations. The limited success of some genome-wide association studies is partly a result of this focus on single genetic markers. A more promising approach is to take into account epistasis, by considering the association of multiple SNP interactions with disease. However, as genomic data continues to grow in resolution, and genome and exome sequencing become more established, the number of combinations of variants to consider increases rapidly. Two potential solutions should be considered: the use of high-performance computing, which allows us to consider a larger number of variables, and heuristics to make the solution more tractable, essential in the case of genome sequencing. In this review, we look at different computational methods to analyse epistatic interactions within disease-related genetic data sets created by microarray technology. We also review efforts to use epistatic analysis results to produce biomarkers for diagnostic tests and give our views on future directions in this field in light of advances in sequencing technology and variants in non-coding regions.
引用
收藏
页码:368 / 379
页数:12
相关论文
共 126 条
  • [1] Tavaxy: Integrating Taverna and Galaxy workflows with cloud computing support
    Abouelhoda, Mohamed
    Issa, Shadi Alaa
    Ghanem, Moustafa
    [J]. BMC BIOINFORMATICS, 2012, 13
  • [2] Cuckoo search epistasis: a new method for exploring significant genetic interactions
    Aflakparast, M.
    Salimi, H.
    Gerami, A.
    Dube, M-P
    Visweswaran, S.
    Masoudi-Nejad, A.
    [J]. HEREDITY, 2014, 112 (06) : 666 - 674
  • [3] Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells
    Arner, Erik
    Daub, Carsten O.
    Vitting-Seerup, Kristoffer
    Andersson, Robin
    Lilje, Berit
    Drablos, Finn
    Lennartsson, Andreas
    Roennerblad, Michelle
    Hrydziuszko, Olga
    Vitezic, Morana
    Freeman, Tom C.
    Alhendi, Ahmad M. N.
    Arner, Peter
    Axton, Richard
    Baillie, J. Kenneth
    Beckhouse, Anthony
    Bodega, Beatrice
    Briggs, James
    Brombacher, Frank
    Davis, Margaret
    Detmar, Michael
    Ehrlund, Anna
    Endoh, Mitsuhiro
    Eslami, Afsaneh
    Fagiolini, Michela
    Fairbairn, Lynsey
    Faulkner, Geoffrey J.
    Ferrai, Carmelo
    Fisher, Malcolm E.
    Forrester, Lesley
    Goldowitz, Daniel
    Guler, Reto
    Ha, Thomas
    Hara, Mitsuko
    Herlyn, Meenhard
    Ikawa, Tomokatsu
    Kai, Chieko
    Kawamoto, Hiroshi
    Khachigian, Levon M.
    Klinken, S. Peter
    Kojima, Soichi
    Koseki, Haruhiko
    Klein, Sarah
    Mejhert, Niklas
    Miyaguchi, Ken
    Mizuno, Yosuke
    Morimoto, Mitsuru
    Morris, Kelly J.
    Mummery, Christine
    Nakachi, Yutaka
    [J]. SCIENCE, 2015, 347 (6225) : 1010 - 1014
  • [4] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [5] Bodenhofer U, 2013, 63 ANN M AM SOC HUM
  • [6] FACTORING AND WEIGHTING APPROACHES TO STATUS SCORES AND CLIQUE IDENTIFICATION
    BONACICH, P
    [J]. JOURNAL OF MATHEMATICAL SOCIOLOGY, 1972, 2 (01) : 113 - 120
  • [7] Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls
    Burton, Paul R.
    Clayton, David G.
    Cardon, Lon R.
    Craddock, Nick
    Deloukas, Panos
    Duncanson, Audrey
    Kwiatkowski, Dominic P.
    McCarthy, Mark I.
    Ouwehand, Willem H.
    Samani, Nilesh J.
    Todd, John A.
    Donnelly, Peter
    Barrett, Jeffrey C.
    Davison, Dan
    Easton, Doug
    Evans, David
    Leung, Hin-Tak
    Marchini, Jonathan L.
    Morris, Andrew P.
    Spencer, Chris C. A.
    Tobin, Martin D.
    Attwood, Antony P.
    Boorman, James P.
    Cant, Barbara
    Everson, Ursula
    Hussey, Judith M.
    Jolley, Jennifer D.
    Knight, Alexandra S.
    Koch, Kerstin
    Meech, Elizabeth
    Nutland, Sarah
    Prowse, Christopher V.
    Stevens, Helen E.
    Taylor, Niall C.
    Walters, Graham R.
    Walker, Neil M.
    Watkins, Nicholas A.
    Winzer, Thilo
    Jones, Richard W.
    McArdle, Wendy L.
    Ring, Susan M.
    Strachan, David P.
    Pembrey, Marcus
    Breen, Gerome
    St Clair, David
    Caesar, Sian
    Gordon-Smith, Katherine
    Jones, Lisa
    Fraser, Christine
    Green, Elain K.
    [J]. NATURE, 2007, 447 (7145) : 661 - 678
  • [8] Chapter 11: Genome-Wide Association Studies
    Bush, William S.
    Moore, Jason H.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (12)
  • [9] Market-Oriented Cloud Computing: Vision, Hype, and Reality for Delivering IT Services as Computing Utilities
    Buyya, Rajkumar
    Yeo, Chee Shin
    Venugopal, Srikumar
    [J]. HPCC 2008: 10TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2008, : 5 - 13
  • [10] Second-generation PLINK: rising to the challenge of larger and richer datasets
    Chang, Christopher C.
    Chow, Carson C.
    Tellier, Laurent C. A. M.
    Vattikuti, Shashaank
    Purcell, Shaun M.
    Lee, James J.
    [J]. GIGASCIENCE, 2015, 4