A Simple and Fast Two-Locus Quality Control Test to Detect False Positives Due to Batch Effects in Genome-Wide Association Studies

被引:10
|
作者
Lee, Sang Hong [1 ]
Nyholt, Dale R. [1 ]
Macgregor, Stuart [1 ]
Henders, Anjali K. [1 ]
Zondervan, Krina T. [2 ]
Montgomery, Grant W. [1 ]
Visscher, Peter M. [1 ]
机构
[1] Queensland Inst Med Res, Herston, Qld 4006, Australia
[2] Univ Oxford, John Radcliffe Hosp, Nuffield Dept Obstet & Gynaecol, Oxford OX3 9DU, England
基金
英国惠康基金;
关键词
genome-wide association study; batch effects; genotyping errors; linear model-based quality control; GENOTYPING ERRORS; LINKAGE; ENDOMETRIOSIS; HERITABILITY; FAMILIES; RISK;
D O I
10.1002/gepi.20541
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The impact of erroneous genotypes having passed standard quality control (QC) can be severe in genome-wide association studies, genotype imputation, and estimation of heritability and prediction of genetic risk based on single nucleotide polymorphisms (SNP). To detect such genotyping errors, a simple two-locus QC method, based on the difference in test statistic of association between single SNPs and pairs of SNPs, was developed and applied. The proposed approach could detect many problematic SNPs with statistical significance even when standard single SNP QC analyses fail to detect them in real data. Depending on the data set used, the number of erroneous SNPs that were not filtered out by standard single SNP QC but detected by the proposed approach varied from a few hundred to thousands. Using simulated data, it was shown that the proposed method was powerful and performed better than other tested existing methods. The power of the proposed approach to detect erroneous genotypes was similar to 80% for a 3% error rate per SNP. This novel QC approach is easy to implement and computationally efficient, and can lead to a better quality of genotypes for subsequent genotype-phenotype investigations. Genet. Epidemiol. 34:854-862, 2010. (C) 2010 Wiley-Liss, Inc.
引用
收藏
页码:854 / 862
页数:9
相关论文
共 50 条
  • [1] A hidden two-locus disease association pattern in genome-wide association studies
    Can Yang
    Xiang Wan
    Qiang Yang
    Hong Xue
    Nelson LS Tang
    Weichuan Yu
    BMC Bioinformatics, 12
  • [2] A hidden two-locus disease association pattern in genome-wide association studies
    Yang, Can
    Wan, Xiang
    Yang, Qiang
    Xue, Hong
    Tang, Nelson L. S.
    Yu, Weichuan
    BMC BIOINFORMATICS, 2011, 12
  • [3] A comment on two-locus epistatic interaction models for genome-wide association studies
    Sohn, Kyung-Ah
    Wee, Kyubum
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2015, 13 (06)
  • [4] Detecting two-locus associations allowing for interactions in genome-wide association studies
    Wan, Xiang
    Yang, Can
    Yang, Qiang
    Xue, Hong
    Tang, Nelson L. S.
    Yu, Weichuan
    BIOINFORMATICS, 2010, 26 (20) : 2517 - 2525
  • [5] An omnibus permutation test on ensembles of two-locus analyses can detect pure epistasis and genetic heterogeneity in genome-wide association studies
    Setsirichok, Damrongrit
    Tienboon, Phuwadej
    Jaroonruang, Nattapong
    Kittichaijaroen, Somkit
    Wongseree, Waranyu
    Piroonratana, Theera
    Usavanarong, Touchpong
    Limwongse, Chanin
    Aporntewan, Chatchawit
    Phadoongsidhi, Marong
    Chaiyaratana, Nachol
    SPRINGERPLUS, 2013, 2
  • [6] Two-stage two-locus models in genome-wide association
    Evans, David M.
    Marchini, Jonathan
    Morris, Andrew P.
    Cardon, Lon R.
    PLOS GENETICS, 2006, 2 (09): : 1424 - 1432
  • [7] Application of seventeen two-locus models in genome-wide association studies by two-stage strategy
    Adan Niu
    Zhaogong Zhang
    Qiuying Sha
    BMC Proceedings, 3 (Suppl 7)
  • [8] COE: A General Approach for Efficient Genome-Wide Two-Locus Epistasis Test in Disease Association Study
    Zhang, Xiang
    Pan, Feng
    Xie, Yuying
    Zou, Fei
    Wang, Wei
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2009, 5541 : 253 - +
  • [9] COE: A General Approach for Efficient Genome-Wide Two-Locus Epistasis Test in Disease Association Study
    Zhang, Xiang
    Pan, Feng
    Xie, Yuying
    Zou, Fei
    Wang, Wei
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2010, 17 (03) : 401 - 415
  • [10] Genome-Wide Association Tests by Two-Stage Approaches with Seventeen Two-Locus Models
    Zhang, Zhaogong
    Niu, Adan
    Zhang, Shuanglin
    Sha, Qiuying
    GENETIC EPIDEMIOLOGY, 2009, 33 (08) : 768 - 769