Performance of penalized maximum likelihood in estimation of genetic covariances matrices

被引:8
作者
Meyer, Karin [1 ]
机构
[1] Univ New England, Anim Genet & Breeding Unit, Armidale, NSW 2351, Australia
关键词
VARIANCE COMPONENT ESTIMATION; REGRESSION; SHRINKAGE; SELECTION;
D O I
10.1186/1297-9686-43-39
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Background: Estimation of genetic covariance matrices for multivariate problems comprising more than a few traits is inherently problematic, since sampling variation increases dramatically with the number of traits. This paper investigates the efficacy of regularized estimation of covariance components in a maximum likelihood framework, imposing a penalty on the likelihood designed to reduce sampling variation. In particular, penalties that "borrow strength" from the phenotypic covariance matrix are considered. Methods: An extensive simulation study was carried out to investigate the reduction in average 'loss', i.e. the deviation in estimated matrices from the population values, and the accompanying bias for a range of parameter values and sample sizes. A number of penalties are examined, penalizing either the canonical eigenvalues or the genetic covariance or correlation matrices. In addition, several strategies to determine the amount of penalization to be applied, i.e. to estimate the appropriate tuning factor, are explored. Results: It is shown that substantial reductions in loss for estimates of genetic covariance can be achieved for small to moderate sample sizes. While no penalty performed best overall, penalizing the variance among the estimated canonical eigenvalues on the logarithmic scale or shrinking the genetic towards the phenotypic correlation matrix appeared most advantageous. Estimating the tuning factor using cross-validation resulted in a loss reduction 10 to 15% less than that obtained if population values were known. Applying a mild penalty, chosen so that the deviation in likelihood from the maximum was non-significant, performed as well if not better than cross-validation and can be recommended as a pragmatic strategy. Conclusions: Penalized maximum likelihood estimation provides the means to 'make the most' of limited and precious data and facilitates more stable estimation for multi-dimensional analyses. It should become part of our everyday toolkit for multivariate estimation in quantitative genetics.
引用
收藏
页数:15
相关论文
共 31 条
[1]  
CHEN CF, 1979, J ROY STAT SOC B, V41, P235
[2]  
CHEVERUD JM, 1988, EVOLUTION, V42, P958, DOI [10.2307/2408911, 10.1111/j.1558-5646.1988.tb02514.x]
[3]   Shrinkage estimators for covariance matrices [J].
Daniels, MJ ;
Kass, RE .
BIOMETRICS, 2001, 57 (04) :1173-1184
[4]  
Evans M, 2000, STAT DISTRIBUTIONS, P34
[5]  
Green P.J., 1998, Encyclopedia of Statistical Sciences, V2, P578
[6]  
HARVILLE DA, 1977, J AM STAT ASSOC, V72, P320, DOI 10.2307/2286796
[7]   MODIFICATION OF ESTIMATES OF PARAMETERS IN THE CONSTRUCTION OF GENETIC SELECTION INDEXES (BENDING) [J].
HAYES, JF ;
HILL, WG .
BIOMETRICS, 1981, 37 (03) :483-493
[8]   PROBABILITIES OF NON-POSITIVE DEFINITE BETWEEN-GROUP OR GENETIC COVARIANCE MATRICES [J].
HILL, WG ;
THOMPSON, R .
BIOMETRICS, 1978, 34 (03) :429-439
[9]   RIDGE REGRESSION - BIASED ESTIMATION FOR NONORTHOGONAL PROBLEMS [J].
HOERL, AE ;
KENNARD, RW .
TECHNOMETRICS, 1970, 12 (01) :55-&
[10]   Covariance matrix selection and estimation via penalised normal likelihood [J].
Huang, JZ ;
Liu, NP ;
Pourahmadi, M ;
Liu, LX .
BIOMETRIKA, 2006, 93 (01) :85-98