Methods for testing association between uncertain genotypes and quantitative traits

被引:29
作者
Kutalik, Zoltan [1 ,2 ]
Johnson, Toby [1 ,2 ,3 ]
Bochud, Murielle [3 ]
Mooser, Vincent [4 ]
Vollenweider, Peter [5 ]
Waeber, Gerard [5 ]
Waterworth, Dawn [4 ]
Beckmann, Jacques S. [1 ,6 ]
Bergmann, Sven [1 ,2 ]
机构
[1] Univ Lausanne, Dept Med Genet, CH-1005 Lausanne, Switzerland
[2] Swiss Inst Bioinformat, CH-1015 Lausanne, Switzerland
[3] CHU Vaudois, Inst Social & Prevent Med, CH-1005 Lausanne, Switzerland
[4] GlaxoSmithKline, Med Genet, King Of Prussia, PA 19406 USA
[5] CHU Vaudois, Dept Internal Med, CH-1011 Lausanne, Switzerland
[6] CHU Vaudois, Serv Med Genet, CH-1011 Lausanne, Switzerland
基金
瑞士国家科学基金会;
关键词
Genetic association; GWAS; Imputed genotypes; Statistical genetics; Uncertain genotypes; GENOME-WIDE ASSOCIATION; STATISTICAL-METHODS; MAXIMUM-LIKELIHOOD; SCORE TESTS; IMPUTATION; HAPLOTYPES; INFERENCE; COVERAGE; RISK; LOCI;
D O I
10.1093/biostatistics/kxq039
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Interpretability and power of genome-wide association studies can be increased by imputing unobserved genotypes, using a reference panel of individuals genotyped at higher marker density. For many markers, genotypes cannot be imputed with complete certainty, and the uncertainty needs to be taken into account when testing for association with a given phenotype. In this paper, we compare currently available methods for testing association between uncertain genotypes and quantitative traits. We show that some previously described methods offer poor control of the false-positive rate (FPR), and that satisfactory performance of these methods is obtained only by using ad hoc filtering rules or by using a harsh transformation of the trait under study. We propose new methods that are based on exact maximum likelihood estimation and use a mixture model to accommodate nonnormal trait distributions when necessary. The new methods adequately control the FPR and also have equal or better power compared to all previously described methods. We provide a fast software implementation of all the methods studied here; our new method requires computation time of less than one computer-day for a typical genome-wide scan, with 2.5 M single nucleotide polymorphisms and 5000 individuals.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 42 条
[1]   Evaluating the effects of imputation on the power, coverage, and cost efficiency of genome-wide SNP platforms [J].
Anderson, Carl A. ;
Pettersson, Fredrik H. ;
Barrett, Jeffrey C. ;
Zhuang, Joanna J. ;
Ragoussis, Jiannis ;
Cardon, Lon R. ;
Morris, Andrew P. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 83 (01) :112-119
[2]  
[Anonymous], 2001, In all likelihood: statistical modelling and inference using likelihood
[3]  
[Anonymous], NUMERICAL RECIPES C
[4]   A tutorial on statistical methods for population association studies [J].
Balding, David J. .
NATURE REVIEWS GENETICS, 2006, 7 (10) :781-791
[5]   Evaluating coverage of genome-wide association studies [J].
Barrett, Jeffrey C. ;
Cardon, Lon R. .
NATURE GENETICS, 2006, 38 (06) :659-662
[6]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[7]   AN ANALYSIS OF TRANSFORMATIONS [J].
BOX, GEP ;
COX, DR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1964, 26 (02) :211-252
[8]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[9]   A generalization of the transmission/disequilibrium test for uncertain-haplotype transmission [J].
Clayton, D .
AMERICAN JOURNAL OF HUMAN GENETICS, 1999, 65 (04) :1170-1177
[10]   QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data [J].
Colella, Stefano ;
Yau, Christopher ;
Taylor, Jennifer M. ;
Mirza, Ghazala ;
Butler, Helen ;
Clouston, Penny ;
Bassett, Anne S. ;
Seller, Anneke ;
Holmes, Christopher C. ;
Ragoussis, Jiannis .
NUCLEIC ACIDS RESEARCH, 2007, 35 (06) :2013-2025