Metric-based methods for adaptive model selection and regularization

被引:17
作者
Schuurmans, D [1 ]
Southey, F [1 ]
机构
[1] Univ Waterloo, Dept Comp Sci, Waterloo, ON N2L 3G1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
model selection; regularization; unlabeled examples;
D O I
10.1023/A:1013947519741
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a general approach to model selection and regularization that exploits unlabeled data to adaptively control hypothesis complexity in supervised learning tasks. The idea is to impose a metric structure on hypotheses by determining the discrepancy between their predictions across the distribution of unlabeled data. We show how this metric can be used to detect untrustworthy training error estimates, and devise novel model selection strategies that exhibit theoretical guarantees against over-fitting (while still avoiding under-fitting). We then extend the approach to derive a general training criterion for supervised learning-yielding an adaptive regularization method that uses unlabeled data to automatically set regularization parameters. This new criterion adjusts its regularization level to the specific set of training data received, and performs well on a variety of regression and conditional density estimation tasks. The only proviso for these methods is that sufficient unlabeled training data be available.
引用
收藏
页码:51 / 84
页数:34
相关论文
共 50 条
  • [41] Least third-order cumulant method with adaptive regularization parameter selection for neural networks
    Leung, CT
    Chow, TWS
    ARTIFICIAL INTELLIGENCE, 2001, 127 (02) : 169 - 197
  • [42] The covariance inflation criterion for adaptive model selection
    Tibshirani, R
    Knight, K
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1999, 61 : 529 - 546
  • [43] CAViaR Model Selection via Adaptive Lasso
    Cai, Zongwu
    Fang, Ying
    Tian, Dingshi
    JOURNAL OF TIME SERIES ANALYSIS, 2024,
  • [44] Regularization and optimization in model-based clustering
    Sampaio, Raphael Araujo
    Garcia, Joaquim Dias
    Poggi, Marcus
    Vidal, Thibaut
    PATTERN RECOGNITION, 2024, 150
  • [45] Adaptive estimation of linear functionals by model selection
    Laurent, Beatrice
    Ludena, Carenne
    Prieur, Clementine
    ELECTRONIC JOURNAL OF STATISTICS, 2008, 2 : 993 - 1020
  • [46] Adaptive estimation of a quadratic functional by model selection
    Laurent, B
    Massart, P
    ANNALS OF STATISTICS, 2000, 28 (05) : 1302 - 1338
  • [47] Adaptive tests of linear hypotheses by model selection
    Baraud, Y
    Huet, S
    Laurent, B
    ANNALS OF STATISTICS, 2003, 31 (01) : 225 - 251
  • [48] Comparison of the PDE-based regularization methods and a unifying framework
    Su, Bochao
    Zhang, Xiaohua
    Liu, Wanyu
    Li, Li
    2014 FOURTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC), 2014, : 527 - 532
  • [49] On Re-weighting, Regularization Selection, and Transient in Nuclear Norm based
    Abdalmoaty, Mohamed
    Hjalmarsson, Hakan
    IFAC PAPERSONLINE, 2015, 48 (28): : 92 - 97
  • [50] Online Model-Selection and Learning for Nonlinear Estimation Based on Multikernel Adaptive Filtering
    Toda, Osamu
    Yukawa, Masahiro
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (01): : 236 - 250