Bayesian criterion-based variable selection

被引:8
作者
Maity, Arnab Kumar [1 ]
Basu, Sanjib [2 ]
Ghosh, Santu [3 ]
机构
[1] Pfizer Inc, San Diego, CA USA
[2] Univ Illinois, Chicago, IL 60607 USA
[3] Augusta Univ, Augusta, GA USA
关键词
deviance information criterion; g‐ prior; highest posterior model; mis‐ selection; DEVIANCE INFORMATION CRITERION; NONLOCAL PRIOR DENSITIES; MARGINAL LIKELIHOOD; MODEL SELECTION; CROSS-VALIDATION; WEIGHT-GAIN; G-PRIORS; COMPUTATION; CONSISTENCY; MIXTURES;
D O I
10.1111/rssc.12488
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Bayesian approaches for criterion based selection include the marginal likelihood based highest posterior model (HPM) and the deviance information criterion (DIC). The DIC is popular in practice as it can often be estimated from sampling-based methods with relative ease and DIC is readily available in various Bayesian software. We find that sensitivity of DIC-based selection can be high, in the range of 90-100%. However, correct selection by DIC can be in the range of 0-2%. These performances persist consistently with increase in sample size. We establish that both marginal likelihood and DIC asymptotically disfavour under-fitted models, explaining the high sensitivities of both criteria. However, mis-selection probability of DIC remains bounded below by a positive constant in linear models with g-priors whereas mis-selection probability by marginal likelihood converges to 0 under certain conditions. A consequence of our results is that not only the DIC cannot asymptotically differentiate between the data-generating and an over-fitted model, but, in fact, it cannot asymptotically differentiate between two over-fitted models as well. We illustrate these results in multiple simulation studies and in a biomarker selection problem on cancer cachexia of non-small cell lung cancer patients. We further study the performances of HPM and DIC in generalized linear model as practitioners often choose to use DIC that is readily available in software in such non-conjugate settings.
引用
收藏
页码:835 / 857
页数:23
相关论文
共 63 条
[1]   Model selection for Bayesian linear mixed models with longitudinal data: Sensitivity to the choice of priors [J].
Ariyo, Oludare ;
Lesaffre, Emmanuel ;
Verbeke, Geert ;
Quintero, Adrian .
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (04) :1591-1615
[2]   Bayesian model selection in linear mixed models for longitudinal data [J].
Ariyo, Oludare ;
Quintero, Adrian ;
Munoz, Johanna ;
Verbeke, Geert ;
Lesaffre, Emmanuel .
JOURNAL OF APPLIED STATISTICS, 2020, 47 (05) :890-913
[3]   Optimal predictive model selection [J].
Barbieri, MM ;
Berger, JO .
ANNALS OF STATISTICS, 2004, 32 (03) :870-897
[4]   Default Bayesian analysis with global-local shrinkage priors [J].
Bhadra, Anindya ;
Datta, Jyotishka ;
Polson, Nicholas G. ;
Willard, Brandon .
BIOMETRIKA, 2016, 103 (04) :955-969
[5]   Practical and theoretical implications of weight gain in advanced non-small cell lung cancer patients [J].
Bonomi, Philip ;
Batus, Marta ;
Fidler, Mary Jo ;
Borgia, Jeffrey A. .
ANNALS OF TRANSLATIONAL MEDICINE, 2017, 5 (06)
[6]   The horseshoe estimator for sparse signals [J].
Carvalho, Carlos M. ;
Polson, Nicholas G. ;
Scott, James G. .
BIOMETRIKA, 2010, 97 (02) :465-480
[7]   Objective Bayesian variable selection [J].
Casella, G ;
Moreno, E .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) :157-167
[8]   CONSISTENCY OF BAYESIAN PROCEDURES FOR VARIABLE SELECTION [J].
Casella, George ;
Giron, F. Javier ;
Martinez, M. Lina ;
Moreno, Elias .
ANNALS OF STATISTICS, 2009, 37 (03) :1207-1228
[9]  
Celeux G, 2006, BAYESIAN ANAL, V1, P651, DOI 10.1214/06-BA122
[10]  
Chan Joshua C. C., 2009, International Journal of Mathematical Modelling and Numerical Optimisation, V1, P101, DOI 10.1504/IJMMNO.2009.030090