High-throughput analysis of epistasis in genome-wide association studies with BiForce

被引:39
|
作者
Gyenesei, Attila [2 ,3 ]
Moody, Jonathan [1 ]
Semple, Colin A. M. [1 ]
Haley, Chris S. [1 ]
Wei, Wen-Hua [1 ]
机构
[1] Univ Edinburgh, Western Gen Hosp, Inst Genet & Mol Med, MRC Human Genet Unit, Edinburgh EH4 2XU, Midlothian, Scotland
[2] Univ Turku, Turku Ctr Biotechnol, Finnish Microarray & Sequencing Ctr, FIN-20520 Turku, Finland
[3] Abo Akad Univ, FIN-20520 Turku, Finland
基金
英国生物技术与生命科学研究理事会; 英国医学研究理事会;
关键词
GENE-GENE INTERACTIONS; MISSING HERITABILITY; COMPLEX DISEASES; LOCI; TRAITS; SUSCEPTIBILITY; POPULATION; STRATEGIES; MODELS; ERAP1;
D O I
10.1093/bioinformatics/bts304
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Gene-gene interactions (epistasis) are thought to be important in shaping complex traits, but they have been under-explored in genome-wide association studies (GWAS) due to the computational challenge of enumerating billions of single nucleotide polymorphism (SNP) combinations. Fast screening tools are needed to make epistasis analysis routinely available in GWAS. Results: We present BiForce to support high-throughput analysis of epistasis in GWAS for either quantitative or binary disease (case-control) traits. BiForce achieves great computational efficiency by using memory efficient data structures, Boolean bitwise operations and multithreaded parallelization. It performs a full pair-wise genome scan to detect interactions involving SNPs with or without significant marginal effects using appropriate Bonferroni-corrected significance thresholds. We show that BiForce is more powerful and significantly faster than published tools for both binary and quantitative traits in a series of performance tests on simulated and real datasets. We demonstrate BiForce in analysing eight metabolic traits in a GWAS cohort (323 697 SNPs, > 4500 individuals) and two disease traits in another (> 340 000 SNPs, > 1750 cases and 1500 controls) on a 32-node computing cluster. BiForce completed analyses of the eight metabolic traits within 1 day, identified nine epistatic pairs of SNPs in five metabolic traits and 18 SNP pairs in two disease traits. BiForce can make the analysis of epistasis a routine exercise in GWAS and thus improve our understanding of the role of epistasis in the genetic regulation of complex traits.
引用
收藏
页码:1957 / 1964
页数:8
相关论文
共 50 条
  • [31] PLATE-Seq for genome-wide regulatory network analysis of high-throughput screens
    Erin C. Bush
    Forest Ray
    Mariano J. Alvarez
    Ronald Realubit
    Hai Li
    Charles Karan
    Andrea Califano
    Peter A. Sims
    Nature Communications, 8
  • [32] Integrative analysis of genome-wide experiments in the context of a large high-throughput data compendium
    Tanay, Amos
    Steinfeld, Israel
    Kupiec, Martin
    Shamir, Ron
    MOLECULAR SYSTEMS BIOLOGY, 2005, 1 (1) : 2005.0002
  • [33] An Analysis Pipeline for Genome-wide Association Studies
    Stefanov, Stefan
    Lautenberger, James
    Gold, Bert
    CANCER INFORMATICS, 2008, 6 : 455 - +
  • [34] REPLICABILITY ANALYSIS FOR GENOME-WIDE ASSOCIATION STUDIES
    Heller, Ruth
    Yekutieli, Daniel
    ANNALS OF APPLIED STATISTICS, 2014, 8 (01): : 481 - 498
  • [35] Power analysis for genome-wide association studies
    Robert J Klein
    BMC Genetics, 8
  • [36] Power analysis for genome-wide association studies
    Klein, Robert J.
    BMC GENETICS, 2007, 8 (1)
  • [37] Genome-wide association study of stem structural characteristics that extracted by a high-throughput phenotypic analysis "LabelmeP rice" in rice
    Li, Jianguo
    Yang, Mingchong
    He, Dandan
    Luo, Zixuan
    Li, Bo
    Huang, Xiaojin
    Wu, Fangxi
    Xie, Guosheng
    Fan, Chuchuan
    Sun, Wenqiang
    Yu, Sibin
    Wang, Lingqiang
    PLANT JOURNAL, 2024, 119 (04): : 2080 - 2095
  • [38] Genome-wide association analysis of sucrose concentration in soybean (Glycine maxL.) seed based on high-throughput sequencing
    Sui, Meinan
    Wang, Yue
    Bao, Yuyue
    Wang, Xi
    Li, Ruiqiong
    Lv, Yan
    Yan, Ming
    Quan, Chao
    Li, Chunxia
    Teng, Weili
    Li, Wenbin
    Zhao, Xue
    Han, Yingpeng
    PLANT GENOME, 2020, 13 (03):
  • [39] Pathway discovery by genome-wide, high-throughput, quantitative mass spectrometry
    Jin, Shuangshuang
    Suleiman, Atef
    Daly, Donald
    Springer, David
    Miller, John
    2008 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS, 2008, : 3 - +
  • [40] iLOCi: a SNP interaction prioritization technique for detecting epistasis in genome-wide association studies
    Jittima Piriyapongsa
    Chumpol Ngamphiw
    Apichart Intarapanich
    Supasak Kulawonganunchai
    Anunchai Assawamakin
    Chaiwat Bootchai
    Philip J Shaw
    Sissades Tongsima
    BMC Genomics, 13