Model-free Estimation of Recent Genetic Relatedness

被引:243
作者
Conomos, Matthew P. [1 ]
Reiner, Alexander P. [2 ,3 ]
Weir, Bruce S. [1 ]
Thornton, Timothy A. [1 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[2] Univ Washington, Dept Epidemiol, Seattle, WA 98195 USA
[3] Fred Hutchinson Canc Res Ctr, Div Publ Hlth Sci, Seattle, WA 98109 USA
关键词
POPULATION STRATIFICATION; ASSOCIATION; ANCESTRY; INFERENCE; HERITABILITY; COEFFICIENT; SELECTION; IDENTITY; COMMON; TOOL;
D O I
10.1016/j.ajhg.2015.11.022
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genealogical inference from genetic data is essential for a variety of applications in human genetics. In genome-wide and sequencing association studies, for example, accurate inference on both recent genetic relatedness, such as family structure, and more distant genetic relatedness, such as population structure, is necessary for protection against spurious associations. Distinguishing familial relatedness from population structure with genotype data, however, is difficult because both manifest as genetic similarity through the sharing of alleles. Existing approaches for inference on recent genetic relatedness have limitations in the presence of population structure, where they either (1) make strong and simplifying assumptions about population structure, which are often untenable, or (2) require correct specification of and appropriate reference population panels for the ancestries in the sample, which might be unknown or not well defined. Here, we propose PC-Relate, a model-free approach for estimating commonly used measures of recent genetic relatedness, such as kinship coefficients and IBD sharing probabilities, in the presence of unspecified structure. PC-Relate uses principal components calculated from genome-screen data to partition genetic correlations among sampled individuals due to the sharing of recent ancestors and more distant common ancestry into two separate components, without requiring specification of the ancestral populations or reference population panels. In simulation studies with population structure, including admixture, we demonstrate that PC-Relate provides accurate estimates of genetic relatedness and improved relationship classification over widely used approaches. We further demonstrate the utility of PC-Relate in applications to three ancestrally diverse samples that vary in both size and genealogical complexity.
引用
收藏
页码:127 / 148
页数:22
相关论文
共 47 条
[1]   Fast model-based estimation of ancestry in unrelated individuals [J].
Alexander, David H. ;
Novembre, John ;
Lange, Kenneth .
GENOME RESEARCH, 2009, 19 (09) :1655-1664
[2]   Data for Genetic Analysis Workshop 18: human whole genome sequence, blood pressure, and simulated phenotypes in extended pedigrees [J].
Laura Almasy ;
Thomas D Dyer ;
Juan M Peralta ;
Goo Jun ;
Andrew R Wood ;
Christian Fuchsberger ;
Marcio A Almeida ;
Jack W Kent ;
Sharon Fowler ;
Tom W Blackwell ;
Sobha Puppala ;
Satish Kumar ;
Joanne E Curran ;
Donna Lehman ;
Goncalo Abecasis ;
Ravindranath Duggirala ;
John Blangero .
BMC Proceedings, 8 (Suppl 1)
[3]   Integrating common and rare genetic variation in diverse human populations [J].
Altshuler, David M. ;
Gibbs, Richard A. ;
Peltonen, Leena ;
Dermitzakis, Emmanouil ;
Schaffner, Stephen F. ;
Yu, Fuli ;
Bonnen, Penelope E. ;
de Bakker, Paul I. W. ;
Deloukas, Panos ;
Gabriel, Stacey B. ;
Gwilliam, Rhian ;
Hunt, Sarah ;
Inouye, Michael ;
Jia, Xiaoming ;
Palotie, Aarno ;
Parkin, Melissa ;
Whittaker, Pamela ;
Chang, Kyle ;
Hawes, Alicia ;
Lewis, Lora R. ;
Ren, Yanru ;
Wheeler, David ;
Muzny, Donna Marie ;
Barnes, Chris ;
Darvishi, Katayoon ;
Hurles, Matthew ;
Korn, Joshua M. ;
Kristiansson, Kati ;
Lee, Charles ;
McCarroll, Steven A. ;
Nemesh, James ;
Keinan, Alon ;
Montgomery, Stephen B. ;
Pollack, Samuela ;
Price, Alkes L. ;
Soranzo, Nicole ;
Gonzaga-Jauregui, Claudia ;
Anttila, Verneri ;
Brodeur, Wendy ;
Daly, Mark J. ;
Leslie, Stephen ;
McVean, Gil ;
Moutsianas, Loukas ;
Nguyen, Huy ;
Zhang, Qingrun ;
Ghori, Mohammed J. R. ;
McGinnis, Ralph ;
McLaren, William ;
Takeuchi, Fumihiko ;
Grossman, Sharon R. .
NATURE, 2010, 467 (7311) :52-58
[4]  
Anderson G, 1998, CONTROL CLIN TRIALS, V19, P61
[5]   A METHOD FOR QUANTIFYING DIFFERENTIATION BETWEEN POPULATIONS AT MULTI-ALLELIC LOCI AND ITS IMPLICATIONS FOR INVESTIGATING IDENTITY AND PATERNITY [J].
BALDING, DJ ;
NICHOLS, RA .
GENETICA, 1995, 96 (1-2) :3-12
[6]   Estimating and interpreting FST: The impact of rare variants [J].
Bhatia, Gaurav ;
Patterson, Nick ;
Sankararaman, Sriram ;
Price, Alkes L. .
GENOME RESEARCH, 2013, 23 (09) :1514-1521
[7]   Population Structure Can Inflate SNP-Based Heritability Estimates [J].
Browning, Sharon R. ;
Browning, Brian L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2011, 89 (01) :191-193
[8]   Case-Control Association Testing in the Presence of Unknown Relationships [J].
Choi, Yoonha ;
Wijsman, Ellen M. ;
Weir, Bruce S. .
GENETIC EPIDEMIOLOGY, 2009, 33 (08) :668-678
[9]   Robust Inference of Population Structure for Ancestry Prediction and Correction of Stratification in the Presence of Relatedness [J].
Conomos, Matthew P. ;
Miller, Michael B. ;
Thornton, Timothy A. .
GENETIC EPIDEMIOLOGY, 2015, 39 (04) :276-293
[10]   The coefficient of dominance is not (always) estimable with biallelic markers [J].
Garcia-Cortes, L. A. ;
Legarra, A. ;
Toro, M. A. .
JOURNAL OF ANIMAL BREEDING AND GENETICS, 2014, 131 (02) :97-104