Methods for testing association between uncertain genotypes and quantitative traits

被引:29
作者
Kutalik, Zoltan [1 ,2 ]
Johnson, Toby [1 ,2 ,3 ]
Bochud, Murielle [3 ]
Mooser, Vincent [4 ]
Vollenweider, Peter [5 ]
Waeber, Gerard [5 ]
Waterworth, Dawn [4 ]
Beckmann, Jacques S. [1 ,6 ]
Bergmann, Sven [1 ,2 ]
机构
[1] Univ Lausanne, Dept Med Genet, CH-1005 Lausanne, Switzerland
[2] Swiss Inst Bioinformat, CH-1015 Lausanne, Switzerland
[3] CHU Vaudois, Inst Social & Prevent Med, CH-1005 Lausanne, Switzerland
[4] GlaxoSmithKline, Med Genet, King Of Prussia, PA 19406 USA
[5] CHU Vaudois, Dept Internal Med, CH-1011 Lausanne, Switzerland
[6] CHU Vaudois, Serv Med Genet, CH-1011 Lausanne, Switzerland
基金
瑞士国家科学基金会;
关键词
Genetic association; GWAS; Imputed genotypes; Statistical genetics; Uncertain genotypes; GENOME-WIDE ASSOCIATION; STATISTICAL-METHODS; MAXIMUM-LIKELIHOOD; SCORE TESTS; IMPUTATION; HAPLOTYPES; INFERENCE; COVERAGE; RISK; LOCI;
D O I
10.1093/biostatistics/kxq039
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Interpretability and power of genome-wide association studies can be increased by imputing unobserved genotypes, using a reference panel of individuals genotyped at higher marker density. For many markers, genotypes cannot be imputed with complete certainty, and the uncertainty needs to be taken into account when testing for association with a given phenotype. In this paper, we compare currently available methods for testing association between uncertain genotypes and quantitative traits. We show that some previously described methods offer poor control of the false-positive rate (FPR), and that satisfactory performance of these methods is obtained only by using ad hoc filtering rules or by using a harsh transformation of the trait under study. We propose new methods that are based on exact maximum likelihood estimation and use a mixture model to accommodate nonnormal trait distributions when necessary. The new methods adequately control the FPR and also have equal or better power compared to all previously described methods. We provide a fast software implementation of all the methods studied here; our new method requires computation time of less than one computer-day for a typical genome-wide scan, with 2.5 M single nucleotide polymorphisms and 5000 individuals.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 42 条
[41]   Asymptotic equivalence between two score tests for haplotype-specific risk in general linear models [J].
Xie, RR ;
Stram, DO .
GENETIC EPIDEMIOLOGY, 2005, 29 (02) :166-170
[42]   Testing association of statistically inferred haplotypes with discrete and continuous traits in samples of unrelated individuals [J].
Zaykin, DV ;
Westfall, PH ;
Young, SS ;
Karnoub, MA ;
Wagner, MJ ;
Ehm, MG .
HUMAN HEREDITY, 2002, 53 (02) :79-91