A Nonparametric Regression Approach to Control for Population Stratification in Rare Variant Association Studies

被引:8
作者
Sha, Qiuying [1 ]
Zhang, Kui [1 ]
Zhang, Shuanglin [1 ]
机构
[1] Michigan Technol Univ, Dept Math Sci, Houghton, MI 49931 USA
基金
美国国家卫生研究院;
关键词
FAST-LMM-SELECT; DETECTING ASSOCIATION; PRINCIPAL-COMPONENTS; SPATIAL STRUCTURE; COMMON VARIANTS; GENOMIC CONTROL; SEQUENCE; DISEASES; GENES; TESTS;
D O I
10.1038/srep37444
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recently, there is increasing interest to detect associations between rare variants and complex traits. Rare variant association studies usually need large sample sizes due to the rarity of the variants, and large sample sizes typically require combining information from different geographic locations within and across countries. Although several statistical methods have been developed to control for population stratification in common variant association studies, these methods are not necessarily controlling for population stratification in rare variant association studies. Thus, new statistical methods that can control for population stratification in rare variant association studies are needed. In this article, we propose a principal component based nonparametric regression (PC-nonp) approach to control for population stratification in rare variant association studies. Our simulations show that the proposed PC-nonp can control for population stratification well in all scenarios, while existing methods cannot control for population stratification at least in some scenarios. Simulations also show that PC-nonp's robustness to population stratification will not reduce power. Furthermore, we illustrate our proposed method by using whole genome sequencing data from genetic analysis workshop 18 (GAW18).
引用
收藏
页数:12
相关论文
共 42 条
[1]   A METHOD FOR QUANTIFYING DIFFERENTIATION BETWEEN POPULATIONS AT MULTI-ALLELIC LOCI AND ITS IMPLICATIONS FOR INVESTIGATING IDENTITY AND PATERNITY [J].
BALDING, DJ ;
NICHOLS, RA .
GENETICA, 1995, 96 (1-2) :3-12
[2]   Robust and Powerful Tests for Rare Variants Using Fisher's Method to Combine Evidence of Association From Two or More Complementary Tests [J].
Derkach, Andriy ;
Lawless, Jerry F. ;
Sun, Lei .
GENETIC EPIDEMIOLOGY, 2013, 37 (01) :110-121
[3]   Genomic control for association studies [J].
Devlin, B ;
Roeder, K .
BIOMETRICS, 1999, 55 (04) :997-1004
[4]   Genomic control, a new approach to genetic-based association studies [J].
Devlin, B ;
Roeder, K ;
Wasserman, L .
THEORETICAL POPULATION BIOLOGY, 2001, 60 (03) :155-166
[5]   Adapting to unknown smoothness via wavelet shrinkage [J].
Donoho, DL ;
Johnstone, IM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (432) :1200-1224
[6]   A Permutation Procedure to Correct for Confounders in Case-Control Studies, Including Tests of Rare Variation [J].
Epstein, Michael P. ;
Duncan, Richard ;
Jiang, Yunxuan ;
Conneely, Karen N. ;
Allen, Andrew S. ;
Satten, Glen A. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2012, 91 (02) :215-223
[7]  
FAN J, 1993, THE ANNALS OF STATIS, P196
[8]   Local linear estimation in partly linear models [J].
Hamilton, SA ;
Truong, YK .
JOURNAL OF MULTIVARIATE ANALYSIS, 1997, 60 (01) :1-19
[9]   A Data-Adaptive Sum Test for Disease Association with Multiple Common or Rare Variants [J].
Han, Fang ;
Pan, Wei .
HUMAN HEREDITY, 2010, 70 (01) :42-54
[10]  
Hart J., 2013, Nonparametric Smoothing and Lack-Of-Fit Tests