Best linear unbiased allele-frequency estimation in complex pedigrees

被引:1
作者
McPeek, MS
Wu, XD
Ober, C
机构
[1] Univ Chicago, Dept Stat, Chicago, IL 60637 USA
[2] Univ Chicago, Dept Human Genet, Chicago, IL 60637 USA
关键词
allele-frequency estimation; BLUE; BLUP; complex pedigree; quasi-likelihood;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Many types of genetic analyses depend on estimates of allele frequencies. We consider the problem of allele-frequency estimation based on data from related individuals. The motivation for this work is data collected on the Hutterites, an isolated founder population, so we focus particularly on the case in which the relationships among the sampled individuals are specified by a large, complex pedigree for which maximum likelihood estimation is impractical. For this case, we propose to use the best linear unbiased estimator (BLUE) of allele frequency. We derive this estimator, which is equivalent to the quasi-likelihood estimator for this problem, and we describe an efficient algorithm for computing the estimate and its variance. We show that our estimator has certain desirable small-sample properties in common with the maximum likelihood estimator (MLE) for this problem. We treat both the case when parental origin of each allele is known and when it is unknown. The results are extended to prediction of allele frequency in some set of individuals S based on genotype data collected on a set of individuals R. We compare the mean-squared error of the BLUE, the commonly used naive estimator (sample frequency) and the MLE when the latter is feasible to calculate. The results indicate that although the MLE performs the best of the three, the BLUE is close in performance to the MLE and is substantially easier to calculate, making it particularly useful for large complex pedigrees in which MLE calculation is impractical or infeasible. We apply our method to allele-frequency estimation in a Hutterite data set.
引用
收藏
页码:359 / 367
页数:9
相关论文
共 22 条
[11]   THE ESTIMATION OF GENE FREQUENCIES FROM FAMILY RECORDS .1. FACTORS WITHOUT DOMINANCE [J].
FINNEY, DJ .
HEREDITY, 1948, 2 (02) :199-218
[12]   The estimation of the proportion of recessives from tests carried out on a sample not wholly unrelated [J].
Fisher, RA .
ANNALS OF EUGENICS, 1940, 10 :160-170
[13]  
Graybill F. A., 1976, THEORY APPL LINEAR M
[14]   PROGRAMS FOR PEDIGREE ANALYSIS - MENDEL, FISHER, AND DGENE [J].
LANGE, K ;
WEEKS, D ;
BOEHNKE, M .
GENETIC EPIDEMIOLOGY, 1988, 5 (06) :471-472
[15]  
Lehmann E. L., 2006, Springer Texts in Statistics), DOI 10.1007/b98854
[16]  
Lockwood JR, 2001, GENET EPIDEMIOL, V20, P17, DOI 10.1002/1098-2272(200101)20:1<17::AID-GEPI3>3.0.CO
[17]  
2-Q
[18]  
McCullagh P., 2018, Generalized Linear Models
[19]   The genetic dissection of complex traits in a founder population [J].
Ober, C ;
Abney, M ;
McPeek, MS .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 69 (05) :1068-1079
[20]   ROBUST ESTIMATION OF GENE-FREQUENCY AND ASSOCIATION PARAMETERS [J].
OLSON, JM .
BIOMETRICS, 1994, 50 (03) :665-674