Model-free Estimation of Recent Genetic Relatedness

被引:243
作者
Conomos, Matthew P. [1 ]
Reiner, Alexander P. [2 ,3 ]
Weir, Bruce S. [1 ]
Thornton, Timothy A. [1 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[2] Univ Washington, Dept Epidemiol, Seattle, WA 98195 USA
[3] Fred Hutchinson Canc Res Ctr, Div Publ Hlth Sci, Seattle, WA 98109 USA
关键词
POPULATION STRATIFICATION; ASSOCIATION; ANCESTRY; INFERENCE; HERITABILITY; COEFFICIENT; SELECTION; IDENTITY; COMMON; TOOL;
D O I
10.1016/j.ajhg.2015.11.022
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genealogical inference from genetic data is essential for a variety of applications in human genetics. In genome-wide and sequencing association studies, for example, accurate inference on both recent genetic relatedness, such as family structure, and more distant genetic relatedness, such as population structure, is necessary for protection against spurious associations. Distinguishing familial relatedness from population structure with genotype data, however, is difficult because both manifest as genetic similarity through the sharing of alleles. Existing approaches for inference on recent genetic relatedness have limitations in the presence of population structure, where they either (1) make strong and simplifying assumptions about population structure, which are often untenable, or (2) require correct specification of and appropriate reference population panels for the ancestries in the sample, which might be unknown or not well defined. Here, we propose PC-Relate, a model-free approach for estimating commonly used measures of recent genetic relatedness, such as kinship coefficients and IBD sharing probabilities, in the presence of unspecified structure. PC-Relate uses principal components calculated from genome-screen data to partition genetic correlations among sampled individuals due to the sharing of recent ancestors and more distant common ancestry into two separate components, without requiring specification of the ancestral populations or reference population panels. In simulation studies with population structure, including admixture, we demonstrate that PC-Relate provides accurate estimates of genetic relatedness and improved relationship classification over widely used approaches. We further demonstrate the utility of PC-Relate in applications to three ancestrally diverse samples that vary in both size and genealogical complexity.
引用
收藏
页码:127 / 148
页数:22
相关论文
共 47 条
[41]  
WRIGHT S, 1951, ANN EUGENIC, V15, P323
[42]   A Comparison of Association Methods Correcting for Population Stratification in Case-Control Studies [J].
Wu, Chengqing ;
DeWan, Andrew ;
Hoh, Josephine ;
Wang, Zuoheng .
ANNALS OF HUMAN GENETICS, 2011, 75 :418-427
[43]   Advantages and pitfalls in the application of mixed-model association methods [J].
Yang, Jian ;
Zaitlen, Noah A. ;
Goddard, Michael E. ;
Visscher, Peter M. ;
Price, Alkes L. .
NATURE GENETICS, 2014, 46 (02) :100-106
[44]   GCTA: A Tool for Genome-wide Complex Trait Analysis [J].
Yang, Jian ;
Lee, S. Hong ;
Goddard, Michael E. ;
Visscher, Peter M. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2011, 88 (01) :76-82
[45]   Common SNPs explain a large proportion of the heritability for human height [J].
Yang, Jian ;
Benyamin, Beben ;
McEvoy, Brian P. ;
Gordon, Scott ;
Henders, Anjali K. ;
Nyholt, Dale R. ;
Madden, Pamela A. ;
Heath, Andrew C. ;
Martin, Nicholas G. ;
Montgomery, Grant W. ;
Goddard, Michael E. ;
Visscher, Peter M. .
NATURE GENETICS, 2010, 42 (07) :565-U131
[46]  
Zheng X., 2015, THEOR POPUL BIOL
[47]   Genome-wide efficient mixed-model analysis for association studies [J].
Zhou, Xiang ;
Stephens, Matthew .
NATURE GENETICS, 2012, 44 (07) :821-U136