Toward a diagnostic toolkit for linear models with Gaussian-process distributed random effects

被引:5
作者
Bose, Maitreyee [1 ]
Hodges, James S. [2 ]
Banerjee, Sudipto [3 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[2] Univ Minnesota, Div Biostat, Minneapolis, MN 55455 USA
[3] Univ Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USA
基金
美国国家科学基金会;
关键词
Added variable plot; Gaussian process; Lack of fit; Linear mixed model; Missing predictor; Spectral approximation;
D O I
10.1111/biom.12848
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gaussian processes (GPs) are widely used as distributions of random effects in linear mixed models, which are fit using the restricted likelihood or the closely related Bayesian analysis. This article addresses two problems. First, we propose tools for understanding how data determine estimates in these models, using a spectral basis approximation to the GP under which the restricted likelihood is formally identical to the likelihood for a gamma-errors GLM with identity link. Second, to examine the data's support for a covariate and to understand how adding that covariate moves variation in the outcome y out of the GP and error parts of the fit, we apply a linear-model diagnostic, the added variable plot (AVP), both to the original observations and to projections of the data onto the spectral basis functions. The spectral- and observation-domain AVPs estimate the same coefficient for a covariate but emphasize low- and high-frequency data features respectively and thus highlight the covariate's effect on the GP and error parts of the fit, respectively. The spectral approximation applies to data observed on a regular grid; for data observed at irregular locations, we propose smoothing the data to a grid before applying our methods. The methods are illustrated using the forest-biomass data of Finley et al.(2008).
引用
收藏
页码:863 / 873
页数:11
相关论文
共 23 条