Model averaging and muddled multimodel inferences

被引:521
作者
Cade, Brian S. [1 ]
机构
[1] US Geol Survey, Ft Collins, CO 80526 USA
关键词
generalized linear models; Greater Sage-Grouse; model averaging; multicollinearity; multimodel inference; partial effects; partial standard deviations; regression coefficients; relative importance of predictors; species distribution models; zero-truncated Poisson regression; SPECIES DISTRIBUTION MODELS; RELATIVE IMPORTANCE; LINEAR-REGRESSION; SAGE-GROUSE; P VALUES; SELECTION; ECOLOGY; BIOGEOGRAPHY; COLLINEARITY; PREDICTION;
D O I
10.1890/14-1639.1
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Three flawed practices associated with model averaging coefficients for predictor variables in regression models commonly occur when making multimodel inferences in analyses of ecological data. Model-averaged regression coefficients based on Akaike information criterion (AIC) weights have been recommended for addressing model uncertainty but they are not valid, interpretable estimates of partial effects for individual predictors when there is multicollinearity among the predictor variables. Multicollinearity implies that the scaling of units in the denominators of the regression coefficients may change across models such that neither the parameters nor their estimates have common scales, therefore averaging them makes no sense. The associated sums of AIC model weights recommended to assess relative importance of individual predictors are really a measure of relative importance of models, with little information about contributions by individual predictors compared to other measures of relative importance based on effects size or variance reduction. Sometimes the model-averaged regression coefficients for predictor variables are incorrectly used to make model-averaged predictions of the response variable when the models are not linear in the parameters. I demonstrate the issues with the first two practices using the college grade point average example extensively analyzed by Burnham and Anderson. I show how partial standard deviations of the predictor variables can be used to detect changing scales of their estimates with multicollinearity. Standardizing estimates based on partial standard deviations for their variables can be used to make the scaling of the estimates commensurate across models, a necessary but not sufficient condition for model averaging of the estimates to be sensible. A unimodal distribution of estimates and valid interpretation of individual parameters are additional requisite conditions. The standardized estimates or equivalently the t statistics on unstandardized estimates also can be used to provide more informative measures of relative importance than sums of AIC weights. Finally, I illustrate how seriously compromised statistical interpretations and predictions can be for all three of these flawed practices by critiquing their use in a recent species distribution modeling technique developed for predicting Greater Sage-Grouse (Centrocercus urophasianus) distribution in Colorado, USA. These model averaging issues are common in other ecological literature and ought to be discontinued if we are to make effective scientific contributions to ecological knowledge and conservation of natural resources.
引用
收藏
页码:2370 / 2382
页数:13
相关论文
共 48 条
[21]  
Franklin J., 2009, Mapping species distributions
[22]   Species distribution models in conservation biogeography: developments and challenges [J].
Franklin, Janet .
DIVERSITY AND DISTRIBUTIONS, 2013, 19 (10) :1217-1223
[23]   Dealing with collinearity in behavioural and ecological data: model averaging and the problems of measurement error [J].
Freckleton, Robert P. .
BEHAVIORAL ECOLOGY AND SOCIOBIOLOGY, 2011, 65 (01) :91-101
[24]   PARTIAL TIME REGRESSIONS AS COMPARED WITH INDIVIDUAL TRENDS [J].
Frisch, Ragnar ;
Waugh, Frederick V. .
ECONOMETRICA, 1933, 1 (04) :387-401
[25]   Ecologists overestimate the importance of predictor variables in model averaging: a plea for cautious interpretations [J].
Galipaud, Matthias ;
Gillingham, Mark A. F. ;
David, Morgan ;
Dechaume-Moncharmont, Francois-Xavier .
METHODS IN ECOLOGY AND EVOLUTION, 2014, 5 (10) :983-991
[26]   Spending degrees of freedom in a poor economy: A case study of building a sightability model for moose in northeastern minnesota [J].
Giudice, John H. ;
Fieberg, John R. ;
Lenarz, Mark S. .
JOURNAL OF WILDLIFE MANAGEMENT, 2012, 76 (01) :75-87
[27]   Confronting multicollinearity in ecological multiple regression [J].
Graham, MH .
ECOLOGY, 2003, 84 (11) :2809-2815
[28]   Estimators of relative importance in linear regression based on variance decomposition [J].
Groemping, Ulrike .
AMERICAN STATISTICIAN, 2007, 61 (02) :139-147
[29]   Multimodel inference in ecology and evolution: challenges and solutions [J].
Grueber, C. E. ;
Nakagawa, S. ;
Laws, R. J. ;
Jamieson, I. G. .
JOURNAL OF EVOLUTIONARY BIOLOGY, 2011, 24 (04) :699-711
[30]  
Harrell FE, 2001, REGRESSION MODELING, DOI DOI 10.1007/978-1-4757-3462-1