Considerations for assessing model averaging of regression coefficients

被引:83
作者
Banner, Katharine M. [1 ]
Higgs, Megan D. [1 ,2 ]
机构
[1] Montana State Univ, Dept Math Sci, Wilson Hall 2-214,POB 172400, Bozeman, MT 59717 USA
[2] Neptune & Co Inc, Bozeman, MT 59715 USA
基金
美国国家科学基金会;
关键词
Bayesian model averaging; explanatory inference; linear regression; model averaging; model selection; multimodel inference; predictive inference; MULTIMODEL INFERENCE; VARIABLE SELECTION; PREDICTION; TIME;
D O I
10.1002/eap.1419
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Model choice is usually an inevitable source of uncertainty in model-based statistical analyses. While the focus of model choice was traditionally on methods for choosing a single model, methods to formally account for multiple models within a single analysis are now accessible to many researchers. The specific technique of model averaging was developed to improve predictive ability by combining predictions from a set of models. However, it is now often used to average regression coefficients across multiple models with the ultimate goal of capturing a variable's overall effect. This use of model averaging implicitly assumes the same parameter exists across models so that averaging is sensible. While this assumption may initially seem tenable, regression coefficients associated with particular explanatory variables may not hold equivalent interpretations across all of the models in which they appear, making explanatory inference about covariates challenging. Accessibility to easily implementable software, concerns about being criticized for ignoring model uncertainty, and the chance to avoid having to justify choice of a final model have all led to the increasing popularity of model averaging in practice. We see a gap between the theoretical development of model averaging and its current use in practice, potentially leaving well-intentioned researchers with unclear inferences or difficulties justifying reasons for using (or not using) model averaging. We attempt to narrow this gap by revisiting some relevant foundations of regression modeling, suggesting more explicit notation and graphical tools, and discussing how individual model results are combined to obtain a model averaged result. Our goal is to help researchers make informed decisions about model averaging and to encourage question-focused modeling over-method-focused modeling.
引用
收藏
页码:78 / 93
页数:16
相关论文
共 50 条
  • [1] Albert J., 2014, LearnBayes: Functions for Learning Bayesian Inference
  • [2] [Anonymous], 1996, Bayesian Statistics 5: Proceedings of the Fifth Valencia International Meeting
  • [3] [Anonymous], BAS BAYESIAN ADAPTIV
  • [4] [Anonymous], 2008, Journal of Statistical Software, Code Snippets, DOI [10.18637/jss.v028.c01, DOI 10.18637/JSS.V028.C01]
  • [5] Truth, models, model sets, AIC, and multimodel inference: A Bayesian perspective
    Barker, Richard J.
    Link, William A.
    [J]. JOURNAL OF WILDLIFE MANAGEMENT, 2015, 79 (05) : 730 - 738
  • [6] Bayesian Multimodel Inference by RJMCMC: A Gibbs Sampling Approach
    Barker, Richard J.
    Link, William A.
    [J]. AMERICAN STATISTICIAN, 2013, 67 (03) : 150 - 156
  • [7] Brewer M., 2015, 10 TOP TIPS REVIEWIN
  • [8] Burnham K. P., 2002, A practical information-theoretic approach: model selection and multimodel inference
  • [9] Burnham K.P., 2015, Multimodel Inference: Understanding AIC Relative Variable Importance Values
  • [10] Model averaging and muddled multimodel inferences
    Cade, Brian S.
    [J]. ECOLOGY, 2015, 96 (09) : 2370 - 2382