An Experimental and Theoretical Comparison of Model Selection Methods

被引:0
作者
Michael Kearns
Yishay Mansour
Andrew Y. Ng
Dana Ron
机构
[1] AT&T Laboratories Research,Department of Computer Science
[2] Tel Aviv University,Department of Computer Science
[3] Carnegie Mellon University,Laboratory of Computer Science
[4] MIT,undefined
来源
Machine Learning | 1997年 / 27卷
关键词
model selection; complexity regularization; cross validation; minimum description length principle; structural risk minimization; vc dimension;
D O I
暂无
中图分类号
学科分类号
摘要
We investigate the problem of model selection in the setting of supervised learning of boolean functions from independent random examples. More precisely, we compare methods for finding a balance between the complexity of the hypothesis chosen and its observed error on a random training sample of limited size, when the goal is that of minimizing the resulting generalization error. We undertake a detailed comparison of three well-known model selection methods — a variation of Vapnik's Guaranteed Risk Minimization (GRM), an instance of Rissanen's Minimum Description Length Principle (MDL), and (hold-out) cross validation (CV). We introduce a general class of model selection methods (called penalty-based methods) that includes both GRM and MDL, and provide general methods for analyzing such rules. We provide both controlled experimental evidence and formal theorems to support the following conclusions:
引用
收藏
页码:7 / 50
页数:43
相关论文
共 50 条
  • [1] An experimental and theoretical comparison of model selection methods
    Kearns, M
    Mansour, Y
    Ng, AY
    Ron, D
    MACHINE LEARNING, 1997, 27 (01) : 7 - 50
  • [2] A survey of Bayesian predictive methods for model assessment, selection and comparison
    Vehtari, Aki
    Ojanen, Janne
    STATISTICS SURVEYS, 2012, 6 : 142 - 228
  • [3] Comparison of Bayesian predictive methods for model selection
    Piironen, Juho
    Vehtari, Aki
    STATISTICS AND COMPUTING, 2017, 27 (03) : 711 - 735
  • [4] Comparison of Bayesian predictive methods for model selection
    Juho Piironen
    Aki Vehtari
    Statistics and Computing, 2017, 27 : 711 - 735
  • [5] Model selection for the North American Breeding Bird Survey: A comparison of methods
    Link, William A.
    Sauer, John R.
    Niven, Daniel K.
    CONDOR, 2017, 119 (03): : 546 - 556
  • [6] Comparison between different methods of model selection in cosmology
    Rezaei, Mehdi
    Malekjani, Mohammad
    EUROPEAN PHYSICAL JOURNAL PLUS, 2021, 136 (02)
  • [7] A note on the comparison of polynomial selection methods
    Viswanathan, M
    Wallace, C
    ARTIFICIAL INTELLIGENCE AND STATISTICS 99, PROCEEDINGS, 1999, : 169 - 177
  • [8] A Comprehensive Comparison of Model Selection Methods for Testing Factorial Invariance
    Liang, Xinya
    Luo, Yong
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2020, 27 (03) : 380 - 395
  • [9] IMPROVING MODEL SELECTION BY NONCONVERGENT METHODS
    FINNOFF, W
    HERGERT, F
    ZIMMERMANN, HG
    NEURAL NETWORKS, 1993, 6 (06) : 771 - 783
  • [10] Comparison of Bayesian model averaging and stepwise methods for model selection in logistic regression
    Wang, DL
    Zhang, WY
    Bakhai, A
    STATISTICS IN MEDICINE, 2004, 23 (22) : 3451 - 3467