Bayesian Model Selection, the Marginal Likelihood, and Generalization

被引:0
|
作者
Lotfi, Sanae [1 ]
Izmailov, Pavel [1 ]
Benton, Gregory [1 ]
Goldblum, Micah [1 ]
Wilson, Andrew Gordon [1 ]
机构
[1] NYU, New York, NY 10003 USA
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162 | 2022年
关键词
CHOICE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How do we compare between hypotheses that are entirely consistent with observations? The marginal likelihood (aka Bayesian evidence), which represents the probability of generating our observations from a prior, provides a distinctive approach to this foundational question, automatically encoding Occam's razor. Although it has been observed that the marginal likelihood can overfit and is sensitive to prior assumptions, its limitations for hyperparameter learning and discrete model comparison have not been thoroughly investigated. We first revisit the appealing properties of the marginal likelihood for learning constraints and hypothesis testing. We then highlight the conceptual and practical issues in using the marginal likelihood as a proxy for generalization. Namely, we show how marginal likelihood can be negatively correlated with generalization, with implications for neural architecture search, and can lead to both underfitting and overfitting in hyperparameter learning. We provide a partial remedy through a conditional marginal likelihood, which we show is more aligned with generalization, and practically valuable for large-scale hyperparameter learning, such as in deep kernel learning.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Marginal Likelihood Computation for Model Selection and Hypothesis Testing: An Extensive Review
    Llorente, F.
    Martino, L.
    Delgado, D.
    Lopez-Santiago, J.
    SIAM REVIEW, 2023, 65 (01) : 3 - 58
  • [2] A guide to Bayesian model selection for ecologists
    Hooten, M. B.
    Hobbs, N. T.
    ECOLOGICAL MONOGRAPHS, 2015, 85 (01) : 3 - 28
  • [3] Posterior Predictive Bayesian Phylogenetic Model Selection
    Lewis, Paul O.
    Xie, Wangang
    Chen, Ming-Hui
    Fan, Yu
    Kuo, Lynn
    SYSTEMATIC BIOLOGY, 2014, 63 (03) : 309 - 321
  • [4] Pseudo-likelihood-based Bayesian information criterion for variable selection in survey data
    Xu, Chen
    Chen, Jiahua
    Mantel, Harold
    SURVEY METHODOLOGY, 2013, 39 (02) : 303 - 321
  • [5] Comparison of Bayesian predictive methods for model selection
    Piironen, Juho
    Vehtari, Aki
    STATISTICS AND COMPUTING, 2017, 27 (03) : 711 - 735
  • [6] Evaluating extensions to LCDM: an application of Bayesian model averaging and selection
    Paradiso, S.
    McGee, G.
    Percival, W. J.
    JOURNAL OF COSMOLOGY AND ASTROPARTICLE PHYSICS, 2024, (10):
  • [7] Bayesian model selection for spatial capture-recapture models
    Dey, Soumen
    Delampady, Mohan
    Gopalaswamy, Arjun M.
    ECOLOGY AND EVOLUTION, 2019, 9 (20): : 11569 - 11583
  • [8] The hydrologist's guide to Bayesian model selection, averaging and combination
    Hoege, M.
    Guthke, A.
    Nowak, W.
    JOURNAL OF HYDROLOGY, 2019, 572 : 96 - 107
  • [9] Bayesian Model Selection for Incomplete Data Using the Posterior Predictive Distribution
    Daniels, Michael J.
    Chatterjee, Arkendu S.
    Wang, Chenguang
    BIOMETRICS, 2012, 68 (04) : 1055 - 1063
  • [10] Objective Bayesian Model Selection in Generalized Additive Models With Penalized Splines
    Bove, Daniel Sabanes
    Held, Leonhard
    Kauermann, Goeran
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2015, 24 (02) : 394 - 415