High-throughput analysis of epistasis in genome-wide association studies with BiForce

被引:39
|
作者
Gyenesei, Attila [2 ,3 ]
Moody, Jonathan [1 ]
Semple, Colin A. M. [1 ]
Haley, Chris S. [1 ]
Wei, Wen-Hua [1 ]
机构
[1] Univ Edinburgh, Western Gen Hosp, Inst Genet & Mol Med, MRC Human Genet Unit, Edinburgh EH4 2XU, Midlothian, Scotland
[2] Univ Turku, Turku Ctr Biotechnol, Finnish Microarray & Sequencing Ctr, FIN-20520 Turku, Finland
[3] Abo Akad Univ, FIN-20520 Turku, Finland
基金
英国生物技术与生命科学研究理事会; 英国医学研究理事会;
关键词
GENE-GENE INTERACTIONS; MISSING HERITABILITY; COMPLEX DISEASES; LOCI; TRAITS; SUSCEPTIBILITY; POPULATION; STRATEGIES; MODELS; ERAP1;
D O I
10.1093/bioinformatics/bts304
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Gene-gene interactions (epistasis) are thought to be important in shaping complex traits, but they have been under-explored in genome-wide association studies (GWAS) due to the computational challenge of enumerating billions of single nucleotide polymorphism (SNP) combinations. Fast screening tools are needed to make epistasis analysis routinely available in GWAS. Results: We present BiForce to support high-throughput analysis of epistasis in GWAS for either quantitative or binary disease (case-control) traits. BiForce achieves great computational efficiency by using memory efficient data structures, Boolean bitwise operations and multithreaded parallelization. It performs a full pair-wise genome scan to detect interactions involving SNPs with or without significant marginal effects using appropriate Bonferroni-corrected significance thresholds. We show that BiForce is more powerful and significantly faster than published tools for both binary and quantitative traits in a series of performance tests on simulated and real datasets. We demonstrate BiForce in analysing eight metabolic traits in a GWAS cohort (323 697 SNPs, > 4500 individuals) and two disease traits in another (> 340 000 SNPs, > 1750 cases and 1500 controls) on a 32-node computing cluster. BiForce completed analyses of the eight metabolic traits within 1 day, identified nine epistatic pairs of SNPs in five metabolic traits and 18 SNP pairs in two disease traits. BiForce can make the analysis of epistasis a routine exercise in GWAS and thus improve our understanding of the role of epistasis in the genetic regulation of complex traits.
引用
收藏
页码:1957 / 1964
页数:8
相关论文
共 50 条
  • [21] Optimized high-throughput screening of non-coding variants identified from genome-wide association studies
    Morova, Tunc
    Ding, Yi
    Huang, Chia-Chi F.
    Sar, Funda
    Schwarz, Tommer
    Giambartolomei, Claudia
    Baca, Sylvan C.
    Grishin, Dennis
    Hach, Faraz
    Gusev, Alexander
    Freedman, Matthew L.
    Pasaniuc, Bogdan
    Lack, Nathan A.
    NUCLEIC ACIDS RESEARCH, 2023, 51 (03) : E18 - E18
  • [22] Data mining for high throughput data from genome-wide association studies
    Park, Taesung
    Ott, Jurg
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2012, 6 (05)
  • [23] High-throughput approaches to genome-wide analysis of genetic variation in polyploid wheat
    Akhunov, E.
    Chao, S.
    Saintenac, C.
    Kiani, S.
    See, D.
    Brown-Guedira, G.
    Sorrells, M.
    Akhunova, A.
    Dubcovsky, J.
    Cavanagh, C.
    Hayden, M.
    CANADIAN JOURNAL OF PLANT SCIENCE, 2012, 92 (03) : 596 - 596
  • [24] Genome-wide high-throughput screening of T cell epitopes
    Arunima Singh
    Nature Methods, 2019, 16 : 953 - 953
  • [25] Genome-wide high-throughput screening of T cell epitopes
    Singh, Arunima
    NATURE METHODS, 2019, 16 (10) : 953 - 953
  • [26] High-throughput screening using genome-wide siRNA libraries
    Liang, ZC
    IDRUGS, 2005, 8 (11) : 924 - 926
  • [27] Use of genome-wide high-throughput technologies in biomarker development
    Classen, Sabine
    Staratschek-Jox, Andrea
    Schultze, Joachim L.
    BIOMARKERS IN MEDICINE, 2008, 2 (05) : 509 - 523
  • [28] Cellaxess®HT: high-throughput transfection for genome-wide RNAi
    Johan Pihl
    Marie-Louise Johansson
    Daniel Granfeldt
    Michal Tokarz
    Mattias Karlsson
    Jon Sinclair
    Nature Methods, 2008, 5 (6) : i - ii
  • [29] Human Genomic Loci Important in Common Infectious Diseases: Role of High-Throughput Sequencing and Genome-Wide Association Studies
    Mboowa, Gerald
    Sserwadda, Ivan
    Amujal, Marion
    Namatovu, Norah
    CANADIAN JOURNAL OF INFECTIOUS DISEASES & MEDICAL MICROBIOLOGY, 2018, 2018
  • [30] PLATE-Seq for genome-wide regulatory network analysis of high-throughput screens
    Bush, Erin C.
    Ray, Forest
    Alvarez, Mariano J.
    Realubit, Ronald
    Li, Hai
    Karan, Charles
    Califano, Andrea
    Sims, Peter A.
    NATURE COMMUNICATIONS, 2017, 8