A Generalized Kruskal-Wallis Test Incorporating Group Uncertainty with Application to Genetic Association Studies

被引:42
作者
Acar, Elif F. [1 ]
Sun, Lei [1 ,2 ]
机构
[1] Univ Toronto, Dept Stat, Toronto, ON M5S 1A1, Canada
[2] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON M5S 1A1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Genome-wide association studies; Imputation; k-sample problem; Nonparametric test; Probabilistic data; Ranks; GENOME-WIDE ASSOCIATION; GENOTYPE IMPUTATION;
D O I
10.1111/biom.12006
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Motivated by genetic association studies of SNPs with genotype uncertainty, we propose a generalization of the Kruskal-Wallis test that incorporates group uncertainty when comparing k samples. The extended test statistic is based on probability-weighted rank-sums and follows an asymptotic chi-square distribution with k - 1 degrees of freedom under the null hypothesis. Simulation studies confirm the validity and robustness of the proposed test in finite samples. Application to a genome-wide association study of type 1 diabetic complications further demonstrates the utilities of this generalized Kruskal-Wallis test for studies with group uncertainty. The method has been implemented as an open-resource R program, GKW.
引用
收藏
页码:427 / 435
页数:9
相关论文
共 23 条
[1]  
[Anonymous], SELECTED TABLES MATH
[2]  
[Anonymous], 1999, Theory of Rank Tests
[3]   ProbABEL package for genome-wide association analysis of imputed data [J].
Aulchenko, Yurii S. ;
Struchalin, Maksim V. ;
van Duijn, Cornelia M. .
BMC BIOINFORMATICS, 2010, 11
[4]   High-throughput, pooled sequencing identifies mutations in NUBPL and FOXRED1 in human complex I deficiency [J].
Calvo, Sarah E. ;
Tucker, Elena J. ;
Compton, Alison G. ;
Kirby, Denise M. ;
Crawford, Gabriel ;
Burtt, Noel P. ;
Rivas, Manuel ;
Guiducci, Candace ;
Bruno, Damien L. ;
Goldberger, Olga A. ;
Redman, Michelle C. ;
Wiltshire, Esko ;
Wilson, Callum J. ;
Altshuler, David ;
Gabriel, Stacey B. ;
Daly, Mark J. ;
Thorburn, David R. ;
Mootha, Vamsi K. .
NATURE GENETICS, 2010, 42 (10) :851-+
[5]   Quantifying uncertainty in genotype calls [J].
Carvalho, Benilton S. ;
Louis, Thomas A. ;
Irizarry, Rafael A. .
BIOINFORMATICS, 2010, 26 (02) :242-249
[6]  
Fraser D. A. S., 1957, NONPARAMETRIC METHOD
[7]   Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs [J].
Korn, Joshua M. ;
Kuruvilla, Finny G. ;
McCarroll, Steven A. ;
Wysoker, Alec ;
Nemesh, James ;
Cawley, Simon ;
Hubbell, Earl ;
Veitch, Jim ;
Collins, Patrick J. ;
Darvishi, Katayoon ;
Lee, Charles ;
Nizzari, Marcia M. ;
Gabriel, Stacey B. ;
Purcell, Shaun ;
Daly, Mark J. ;
Altshuler, David .
NATURE GENETICS, 2008, 40 (10) :1253-1260
[8]   USE OF RANKS IN ONE-CRITERION VARIANCE ANALYSIS [J].
KRUSKAL, WH ;
WALLIS, WA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1952, 47 (260) :583-621
[9]   A NONPARAMETRIC TEST FOR THE SEVERAL SAMPLE PROBLEM [J].
KRUSKAL, WH .
ANNALS OF MATHEMATICAL STATISTICS, 1952, 23 (04) :525-540
[10]   Methods for testing association between uncertain genotypes and quantitative traits [J].
Kutalik, Zoltan ;
Johnson, Toby ;
Bochud, Murielle ;
Mooser, Vincent ;
Vollenweider, Peter ;
Waeber, Gerard ;
Waterworth, Dawn ;
Beckmann, Jacques S. ;
Bergmann, Sven .
BIOSTATISTICS, 2011, 12 (01) :1-17