Model-free Estimation of Recent Genetic Relatedness

被引:243
作者
Conomos, Matthew P. [1 ]
Reiner, Alexander P. [2 ,3 ]
Weir, Bruce S. [1 ]
Thornton, Timothy A. [1 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[2] Univ Washington, Dept Epidemiol, Seattle, WA 98195 USA
[3] Fred Hutchinson Canc Res Ctr, Div Publ Hlth Sci, Seattle, WA 98109 USA
关键词
POPULATION STRATIFICATION; ASSOCIATION; ANCESTRY; INFERENCE; HERITABILITY; COEFFICIENT; SELECTION; IDENTITY; COMMON; TOOL;
D O I
10.1016/j.ajhg.2015.11.022
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genealogical inference from genetic data is essential for a variety of applications in human genetics. In genome-wide and sequencing association studies, for example, accurate inference on both recent genetic relatedness, such as family structure, and more distant genetic relatedness, such as population structure, is necessary for protection against spurious associations. Distinguishing familial relatedness from population structure with genotype data, however, is difficult because both manifest as genetic similarity through the sharing of alleles. Existing approaches for inference on recent genetic relatedness have limitations in the presence of population structure, where they either (1) make strong and simplifying assumptions about population structure, which are often untenable, or (2) require correct specification of and appropriate reference population panels for the ancestries in the sample, which might be unknown or not well defined. Here, we propose PC-Relate, a model-free approach for estimating commonly used measures of recent genetic relatedness, such as kinship coefficients and IBD sharing probabilities, in the presence of unspecified structure. PC-Relate uses principal components calculated from genome-screen data to partition genetic correlations among sampled individuals due to the sharing of recent ancestors and more distant common ancestry into two separate components, without requiring specification of the ancestral populations or reference population panels. In simulation studies with population structure, including admixture, we demonstrate that PC-Relate provides accurate estimates of genetic relatedness and improved relationship classification over widely used approaches. We further demonstrate the utility of PC-Relate in applications to three ancestrally diverse samples that vary in both size and genealogical complexity.
引用
收藏
页码:127 / 148
页数:22
相关论文
共 47 条
[11]   Increased accuracy of artificial selection by using the realized relationship matrix [J].
Hayes, B. J. ;
Visscher, P. M. ;
Goddard, M. E. .
GENETICS RESEARCH, 2009, 91 (01) :47-60
[12]   The Women's Health Initiative recruitment methods and results [J].
Hays, J ;
Hunt, JR ;
Hubbell, FA ;
Anderson, GL ;
Limacher, M ;
Allen, C ;
Rossouw, JE .
ANNALS OF EPIDEMIOLOGY, 2003, 13 (09) :S18-S77
[13]   Variation in actual relationship as a consequence of Mendelian sampling and linkage [J].
Hill, W. G. ;
Weir, B. S. .
GENETICS RESEARCH, 2011, 93 (01) :47-64
[14]  
Jacquard Albert., 1970, Structures Genetiques des Populations
[15]   Variance component model to account for sample structure in genome-wide association studies [J].
Kang, Hyun Min ;
Sul, Jae Hoon ;
Service, Susan K. ;
Zaitlen, Noah A. ;
Kong, Sit-yee ;
Freimer, Nelson B. ;
Sabatti, Chiara ;
Eskin, Eleazar .
NATURE GENETICS, 2010, 42 (04) :348-U110
[16]   Worldwide human relationships inferred from genome-wide patterns of variation [J].
Li, Jun Z. ;
Absher, Devin M. ;
Tang, Hua ;
Southwick, Audrey M. ;
Casto, Amanda M. ;
Ramachandran, Sohini ;
Cann, Howard M. ;
Barsh, Gregory S. ;
Feldman, Marcus ;
Cavalli-Sforza, Luigi L. ;
Myers, Richard M. .
SCIENCE, 2008, 319 (5866) :1100-1104
[17]   Population Structure of Hispanics in the United States: The Multi-Ethnic Study of Atherosclerosis [J].
Manichaikul, Ani ;
Palmas, Walter ;
Rodriguez, Carlos J. ;
Peralta, Carmen A. ;
Divers, Jasmin ;
Guo, Xiuqing ;
Chen, Wei-Min ;
Wong, Quenna ;
Williams, Kayleen ;
Kerr, Kathleen F. ;
Taylor, Kent D. ;
Tsai, Michael Y. ;
Goodarzi, Mark O. ;
Sale, Michele M. ;
Diez-Roux, Ana V. ;
Rich, Stephen S. ;
Rotter, Jerome I. ;
Mychaleckyj, Josyf C. .
PLOS GENETICS, 2012, 8 (04) :285-298
[18]   Robust relationship inference in genome-wide association studies [J].
Manichaikul, Ani ;
Mychaleckyj, Josyf C. ;
Rich, Stephen S. ;
Daly, Kathy ;
Sale, Michele ;
Chen, Wei-Min .
BIOINFORMATICS, 2010, 26 (22) :2867-2873
[19]  
Milligan BG, 2003, GENETICS, V163, P1153
[20]   RelateAdmix: a software tool for estimating relatedness between admixed individuals [J].
Moltke, Ida ;
Albrechtsen, Anders .
BIOINFORMATICS, 2014, 30 (07) :1027-1028