Model selection by bootstrap penalization for classification

被引:3
作者
Magalie Fromont
机构
[1] Université Rennes II,Laboratoire de Statistique, U.F.R. de Sciences Sociales–Département MASS
来源
Machine Learning | 2007年 / 66卷
关键词
Model selection; Classification; Bootstrap penalty; Exponential inequality; Oracle inequality; Minimax risk;
D O I
暂无
中图分类号
学科分类号
摘要
We consider the binary classification problem. Given an i.i.d. sample drawn from the distribution of an χ×{0,1}−valued random pair, we propose to estimate the so-called Bayes classifier by minimizing the sum of the empirical classification error and a penalty term based on Efron’s or i.i.d. weighted bootstrap samples of the data. We obtain exponential inequalities for such bootstrap type penalties, which allow us to derive non-asymptotic properties for the corresponding estimators. In particular, we prove that these estimators achieve the global minimax risk over sets of functions built from Vapnik-Chervonenkis classes. The obtained results generalize Koltchinskii (2001) and Bartlett et al.’s (2002) ones for Rademacher penalties that can thus be seen as special examples of bootstrap type penalties. To illustrate this, we carry out an experimental study in which we compare the different methods for an intervals model selection problem.
引用
收藏
页码:165 / 207
页数:42
相关论文
共 50 条
[21]   On strong consistency of model selection in classification [J].
Suzuki, Joe .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (11) :4767-4774
[22]   Forecasting model selection using intermediate classification: Application to MonarchFx corporation [J].
Taghiyeh, Sajjad ;
Lengacher, David C. ;
Handfield, Robert B. .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 151
[23]   Model selection based on particle swarm optimization for omics data classification [J].
Xu, Zhao ;
Yang, Junshan .
2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, :1334-1337
[24]   Model Parameters Selection for SVM Classification using Particle Swarm Optimization [J].
Hric, Martin ;
Chmulik, Michal ;
Jarina, Roman .
PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE - RADIOELEKTRONIKA 2011, 2011, :387-390
[25]   A Feature Selection and Classification Algorithm Based on Randomized Extraction of Model Populations [J].
Brankovic, Aida ;
Falsone, Alessandro ;
Prandini, Maria ;
Piroddi, Luigi .
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (04) :1151-1162
[26]   Bootstrap techniques for sensitivity analysis and model selection in building thermal performance analysis [J].
Tian, Wei ;
Song, Jitian ;
Li, Zhanyong ;
de Wilde, Pieter .
APPLIED ENERGY, 2014, 135 :320-328
[27]   A CONCEPTUAL STUDY OF MODEL SELECTION IN CLASSIFICATION Multiple Local Models vs One Global Model [J].
Vilalta, R. ;
Ocegueda-Hernandez, F. ;
Bagaria, C. .
ICAART 2010: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1: ARTIFICIAL INTELLIGENCE, 2010, :113-118
[28]   Variable Selection in Causal Inference using a Simultaneous Penalization Method [J].
Ertefaie, Ashkan ;
Asgharian, Masoud ;
Stephens, David A. .
JOURNAL OF CAUSAL INFERENCE, 2018, 6 (01)
[29]   Analysis of Classification Model and Feature Subset Selection [J].
Khan, Muhammad A. ;
Mirza, Anwar M. .
INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2011, 14 (10) :3325-3334
[30]   Model selection and assessment for classification using validation [J].
Jaworski, W .
ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, PT 1, PROCEEDINGS, 2005, 3641 :481-490