Small Sample Kernel Association Tests for Human Genetic and Microbiome Association Studies

被引:42
作者
Chen, Jun [1 ]
Chen, Wenan [1 ]
Zhao, Ni [2 ]
Wu, Michael C. [2 ]
Schaid, Daniel J. [1 ]
机构
[1] Mayo Clin, Dept Hlth Sci Res, Div Biomed Stat & Informat, Rochester, MN 55905 USA
[2] Fred Hutchinson Canc Res Ctr, Div Publ Hlth Sci, Seattle, WA 98104 USA
关键词
kernel machine based association tests; small sample problem; exact tests; overdispersion; SEQUENCING DATA; MIXED MODELS; REGRESSION; RARE; PATHWAY; DISEASE; TRAITS;
D O I
10.1002/gepi.21934
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Kernel machine based association tests (KAT) have been increasingly used in testing the association between an outcome and a set of biological measurements due to its power to combine multiple weak signals of complex relationship with the outcome through the specification of a relevant kernel. Human genetic and microbiome association studies are two important applications of KAT. However, the classic KAT framework relies on large sample theory, and conservativeness has been observed for small sample studies, especially for microbiome association studies. The common approach for addressing the small sample problem relies on computationally intensive resampling methods. Here, we derive an exact test for KAT with continuous traits, which resolve the small sample conservatism of KAT without the need for resampling. The exact test has significantly improved power to detect association for microbiome studies. For binary traits, we propose a similar approximate test, and we show that the approximate test is very powerful for a wide range of kernels including common variant- and microbiome-based kernels, and the approximate test controls the type I error well for these kernels. In contrast, the sequence kernel association tests have slightly inflated genomic inflation factors after small sample adjustment. Extensive simulations and application to a real microbiome association study are used to demonstrate the utility of our method. (C) 2015 Wiley Periodicals, Inc.
引用
收藏
页码:5 / 19
页数:15
相关论文
共 22 条
[1]   QIIME allows analysis of high-throughput community sequencing data [J].
Caporaso, J. Gregory ;
Kuczynski, Justin ;
Stombaugh, Jesse ;
Bittinger, Kyle ;
Bushman, Frederic D. ;
Costello, Elizabeth K. ;
Fierer, Noah ;
Pena, Antonio Gonzalez ;
Goodrich, Julia K. ;
Gordon, Jeffrey I. ;
Huttley, Gavin A. ;
Kelley, Scott T. ;
Knights, Dan ;
Koenig, Jeremy E. ;
Ley, Ruth E. ;
Lozupone, Catherine A. ;
McDonald, Daniel ;
Muegge, Brian D. ;
Pirrung, Meg ;
Reeder, Jens ;
Sevinsky, Joel R. ;
Tumbaugh, Peter J. ;
Walters, William A. ;
Widmann, Jeremy ;
Yatsunenko, Tanya ;
Zaneveld, Jesse ;
Knight, Rob .
NATURE METHODS, 2010, 7 (05) :335-336
[2]   Disordered Microbial Communities in the Upper Respiratory Tract of Cigarette Smokers [J].
Charlson, Emily S. ;
Chen, Jun ;
Custers-Allen, Rebecca ;
Bittinger, Kyle ;
Li, Hongzhe ;
Sinha, Rohini ;
Hwang, Jennifer ;
Bushman, Frederic D. ;
Collman, Ronald G. .
PLOS ONE, 2010, 5 (12)
[3]   Kernel Methods for Regression Analysis of Microbiome Compositional Data [J].
Chen, Jun ;
Li, Hongzhe .
TOPICS IN APPLIED STATISTICS, 2013, 55 :191-201
[4]   Associating microbiome composition with environmental covariates using generalized UniFrac distances [J].
Chen, Jun ;
Bittinger, Kyle ;
Charlson, Emily S. ;
Hoffmann, Christian ;
Lewis, James ;
Wu, Gary D. ;
Collman, Ronald G. ;
Bushman, Frederic D. ;
Li, Hongzhe .
BIOINFORMATICS, 2012, 28 (16) :2106-2113
[5]   APPLICATIONS OF NEXT-GENERATION SEQUENCING The human microbiome: at the interface of health and disease [J].
Cho, Ilseung ;
Blaser, Martin J. .
NATURE REVIEWS GENETICS, 2012, 13 (04) :260-270
[6]  
Davies R. B., 1980, J ROYAL STAT SOC C, V29, P323, DOI DOI 10.2307/2346911
[7]   Computing the distribution of quadratic forms: Further comparisons between the Liu-Tang-Zhang approximation and exact methods [J].
Duchesne, Pierre ;
De Micheaux, Pierre Lafaye .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (04) :858-862
[8]   Exome sequencing of extreme phenotypes identifies DCTN4 as a modifier of chronic Pseudomonas aeruginosa infection in cystic fibrosis [J].
Emond, Mary J. ;
Louie, Tin ;
Emerson, Julia ;
Zhao, Wei ;
Mathias, Rasika A. ;
Knowles, Michael R. ;
Wright, Fred A. ;
Rieder, Mark J. ;
Tabor, Holly K. ;
Nickerson, Deborah A. ;
Barnes, Kathleen C. ;
Go, Lung ;
Gibson, Ronald L. ;
Bamshad, Michael J. .
NATURE GENETICS, 2012, 44 (08) :886-+
[9]   Sequence Kernel Association Tests for the Combined Effect of Rare and Common Variants [J].
Ionita-Laza, Iuliana ;
Lee, Seunggeun ;
Makarov, Vlad ;
Buxbaum, Joseph D. ;
Lin, Xihong .
AMERICAN JOURNAL OF HUMAN GENETICS, 2013, 92 (06) :841-853
[10]   A powerful and flexible multilocus association test for quantitative traits [J].
Kwee, Lydia Coulter ;
Liu, Dawei ;
Lin, Xihong ;
Ghosh, Debashis ;
Epstein, Michael P. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 82 (02) :386-397