Conditional vs marginal estimation of the predictive loss of hierarchical models using WAIC and cross-validation

被引:29
作者
Millar, Russell B. [1 ]
机构
[1] Univ Auckland, Dept Stat, Private Bag 92019, Auckland, New Zealand
关键词
Cross-validation; Hierarchical model; Importance sampling; Leave-one-out; Marginalized likelihood; Model comparison; Over-dispersed count data; Pointwise predictive loss; WAIC; CRITERIA; HABITAT; CHOICE;
D O I
10.1007/s11222-017-9736-8
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The predictive loss of Bayesian models can be estimated using a sample from the full-data posterior by evaluating the Watanabe-Akaike information criterion (WAIC) or using an importance sampling (ISCVL) approximation to leave-one-out cross-validation loss. With hierarchical models the loss can be specified at different levels of the hierarchy, and in the published literature, it is routine for these estimators to use the conditional likelihood provided by the lowest level of model hierarchy. However, the regularity conditions underlying these estimators may not hold at this level, and the behaviour of conditional-level WAIC as an estimator of conditional-level predictive loss must be determined on a case-by-case basis. Conditional-level ISCVL does not target conditional-level predictive loss and instead is an estimator of marginal-level predictive loss. Using examples for analysis of over-dispersed count data, it is shown that conditional-level WAIC does not provide a reliable estimator of its target loss, and simulations show that it can favour the incorrect model. Moreover, conditional-level ISCVL is numerically unstable compared to marginal-level ISCVL. It is recommended that WAIC and ISCVL be evaluated using the marginalized likelihood where practicable and that the reliability of these estimators always be checked using appropriate diagnostics.
引用
收藏
页码:375 / 385
页数:11
相关论文
共 28 条
[1]   Decision-making in stimulant and opiate addicts in protracted abstinence: evidence from computational modeling with pure users [J].
Ahn, Woo-Young ;
Vasilev, Georgi ;
Lee, Sung-Ha ;
Busemeyer, Jerome R. ;
Kruschke, John K. ;
Bechara, Antoine ;
Vassileva, Jasmin .
FRONTIERS IN PSYCHOLOGY, 2014, 5
[2]  
ALDERMAN DL, 1980, AM EDUC RES J, V17, P239, DOI 10.3102/00028312017002239
[3]   Spatial variation and effects of habitat on temperate reef fish assemblages in northeastern New Zealand [J].
Anderson, MJ ;
Millar, RB .
JOURNAL OF EXPERIMENTAL MARINE BIOLOGY AND ECOLOGY, 2004, 305 (02) :191-221
[4]  
[Anonymous], 2009, Algebraic Geometry and Statistical Learning Theory
[6]   PREDICTIVE APPROACH TO MODEL SELECTION [J].
GEISSER, S ;
EDDY, WF .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (365) :153-160
[7]  
GELFAND AE, 1994, J ROY STAT SOC B MET, V56, P501
[8]  
Gelman A, 1998, STAT SCI, V13, P163
[9]   Understanding predictive information criteria for Bayesian models [J].
Gelman, Andrew ;
Hwang, Jessica ;
Vehtari, Aki .
STATISTICS AND COMPUTING, 2014, 24 (06) :997-1016
[10]  
Geweke J., BAYESIAN STAT, V4