Metric-based methods for adaptive model selection and regularization

被引:17
|
作者
Schuurmans, D [1 ]
Southey, F [1 ]
机构
[1] Univ Waterloo, Dept Comp Sci, Waterloo, ON N2L 3G1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
model selection; regularization; unlabeled examples;
D O I
10.1023/A:1013947519741
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a general approach to model selection and regularization that exploits unlabeled data to adaptively control hypothesis complexity in supervised learning tasks. The idea is to impose a metric structure on hypotheses by determining the discrepancy between their predictions across the distribution of unlabeled data. We show how this metric can be used to detect untrustworthy training error estimates, and devise novel model selection strategies that exhibit theoretical guarantees against over-fitting (while still avoiding under-fitting). We then extend the approach to derive a general training criterion for supervised learning-yielding an adaptive regularization method that uses unlabeled data to automatically set regularization parameters. This new criterion adjusts its regularization level to the specific set of training data received, and performs well on a variety of regression and conditional density estimation tasks. The only proviso for these methods is that sufficient unlabeled training data be available.
引用
收藏
页码:51 / 84
页数:34
相关论文
共 50 条
  • [1] Metric-Based Methods for Adaptive Model Selection and Regularization
    Dale Schuurmans
    Finnegan Southey
    Machine Learning, 2002, 48 : 51 - 84
  • [2] Metric-based model selection for time-series forecasting
    Bengio, Y
    Chapados, N
    NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, 2002, : 13 - 22
  • [3] Image Segmentation Metric-Based Adaptive Method
    Berersky, Oleh
    Pitsun, Oleh
    Batryn, Natalia
    Bererska, Kateryna
    Savka, Nadiya
    Dolynyuk, Taras
    2018 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA STREAM MINING & PROCESSING (DSMP), 2018, : 554 - 557
  • [4] Fast Supervised Selection of Prototypes for Metric-Based Learning
    Belanche, Lluis A.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT II, 2018, 11140 : 577 - 586
  • [5] Metric-based upscaling
    Owhadi, Houman
    Zhang, Lei
    COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 2007, 60 (05) : 675 - 723
  • [6] Nonnegative matrix factorization with Wasserstein metric-based regularization for enhanced text embedding
    Li, Mingming
    Wang, Xingjie
    Li, Chunhua
    Zeng, Anping
    PLOS ONE, 2024, 19 (12):
  • [7] Aligned Metric-Based Anisotropic Solution Adaptive Mesh Generation
    Marcum, David
    Alauzet, Frederic
    23RD INTERNATIONAL MESHING ROUNDTABLE (IMR23), 2014, 82 : 428 - 444
  • [8] A Local Geometrical Metric-based Model for Polyp Classification
    Cao, Weiguo
    Pomeroy, Marc J.
    Pickhardt, Perry J.
    Barich, Matthew A.
    Stanly, Samuel, III
    Liang, Zhengrong
    MEDICAL IMAGING 2019: COMPUTER-AIDED DIAGNOSIS, 2019, 10950
  • [9] An Empirical Study of Metric-Based Methods to Detect Obfuscated Code
    Visaggio, Corrado Aaron
    Pagin, Giuseppe Antonio
    Canfora, Gerardo
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2013, 7 (02): : 59 - 73
  • [10] A Metric-Based Evaluation Model for Applications on Mobile Phone
    Hussain, Azham
    Kutar, Maria
    Kamal, Fazillah Mohmad
    PROCEEDINGS OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2012, 2012, : 720 - +