Bayesian Model Selection, the Marginal Likelihood, and Generalization

被引：0

作者：

Lotfi, Sanae ^{[1
]}

Izmailov, Pavel ^{[1
]}

Benton, Gregory ^{[1
]}

Goldblum, Micah ^{[1
]}

Wilson, Andrew Gordon ^{[1
]}

机构：

[1] NYU, New York, NY 10003 USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162 | 2022年

关键词：

CHOICE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

How do we compare between hypotheses that are entirely consistent with observations? The marginal likelihood (aka Bayesian evidence), which represents the probability of generating our observations from a prior, provides a distinctive approach to this foundational question, automatically encoding Occam's razor. Although it has been observed that the marginal likelihood can overfit and is sensitive to prior assumptions, its limitations for hyperparameter learning and discrete model comparison have not been thoroughly investigated. We first revisit the appealing properties of the marginal likelihood for learning constraints and hypothesis testing. We then highlight the conceptual and practical issues in using the marginal likelihood as a proxy for generalization. Namely, we show how marginal likelihood can be negatively correlated with generalization, with implications for neural architecture search, and can lead to both underfitting and overfitting in hyperparameter learning. We provide a partial remedy through a conditional marginal likelihood, which we show is more aligned with generalization, and practically valuable for large-scale hyperparameter learning, such as in deep kernel learning.

引用

页数：25

共 50 条

[1] Marginal Likelihood Computation for Model Selection and Hypothesis Testing: An Extensive Review
Llorente, F.
Martino, L.
Delgado, D.
Lopez-Santiago, J.
SIAM REVIEW, 2023, 65 (01) : 3 - 58
[2] A guide to Bayesian model selection for ecologists
Hooten, M. B.
Hobbs, N. T.
ECOLOGICAL MONOGRAPHS, 2015, 85 (01) : 3 - 28
[3] Posterior Predictive Bayesian Phylogenetic Model Selection
Lewis, Paul O.
Xie, Wangang
Chen, Ming-Hui
Fan, Yu
Kuo, Lynn
SYSTEMATIC BIOLOGY, 2014, 63 (03) : 309 - 321
[4] Pseudo-likelihood-based Bayesian information criterion for variable selection in survey data
Xu, Chen
Chen, Jiahua
Mantel, Harold
SURVEY METHODOLOGY, 2013, 39 (02) : 303 - 321
[5] Comparison of Bayesian predictive methods for model selection
Piironen, Juho
Vehtari, Aki
STATISTICS AND COMPUTING, 2017, 27 (03) : 711 - 735
[6] Evaluating extensions to LCDM: an application of Bayesian model averaging and selection
Paradiso, S.
McGee, G.
Percival, W. J.
JOURNAL OF COSMOLOGY AND ASTROPARTICLE PHYSICS, 2024, (10):
[7] Bayesian model selection for spatial capture-recapture models
Dey, Soumen
Delampady, Mohan
Gopalaswamy, Arjun M.
ECOLOGY AND EVOLUTION, 2019, 9 (20): : 11569 - 11583
[8] The hydrologist's guide to Bayesian model selection, averaging and combination
Hoege, M.
Guthke, A.
Nowak, W.
JOURNAL OF HYDROLOGY, 2019, 572 : 96 - 107
[9] Bayesian Model Selection for Incomplete Data Using the Posterior Predictive Distribution
Daniels, Michael J.
Chatterjee, Arkendu S.
Wang, Chenguang
BIOMETRICS, 2012, 68 (04) : 1055 - 1063
[10] Objective Bayesian Model Selection in Generalized Additive Models With Penalized Splines
Bove, Daniel Sabanes
Held, Leonhard
Kauermann, Goeran
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2015, 24 (02) : 394 - 415

← 1 2 3 4 5 →