PAC-Bayes risk bounds for stochastic averages and majority votes of sample-compressed classifiers

被引:0
作者
Laviolette, Francois [1 ]
Marchand, Mario [1 ]
机构
[1] Univ Laval, Dept IFT GLO, Quebec City, PQ G1K 7P4, Canada
关键词
PAC-Bayes; risk bounds; sample-compression; set covering machines; decision list machines;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a PAC-Bayes theorem for the sample-compression setting where each classifier is described by a compression subset of the training data and a message string of additional information. This setting, which is the appropriate one to describe many learning algorithms, strictly generalizes the usual data-independent setting where classifiers are represented only by data-independent message strings (or parameters taken from a continuous set). The proposed PAC-Bayes theorem for the sample-compression setting reduces to the PAC-Bayes theorem of Seeger (2002) and Langford (2005) when the compression subset of each classifier vanishes. For posteriors having all their weights on a single sample-compressed classifier, the general risk bound reduces to a bound similar to the tight sample-compression bound proposed in Laviolette et al. (2005). Finally, we extend our results to the case where each sample-compressed classifier of a data-dependent ensemble may abstain of predicting a class label.
引用
收藏
页码:1461 / 1487
页数:27
相关论文
共 26 条
[1]  
[Anonymous], J MACHINE LEARNING R
[2]  
[Anonymous], 2000, Pattern Classification
[3]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[4]  
CATONI O, 2004, PAC BAYESIAN APPROAC
[5]  
Floyd S, 1995, MACH LEARN, V21, P269
[6]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[7]  
GERMAIN P, 2004, ADV NEURAL INFORM PR, V19
[8]   PAC-Bayesian compression bounds on the prediction error of learning algorithms for classification [J].
Graepel, T ;
Herbrich, R .
MACHINE LEARNING, 2005, 59 (1-2) :55-76
[9]   Bayes point machines [J].
Herbich, R ;
Graepel, T ;
Campbell, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (04) :245-279
[10]  
LACASSE A, 2007, ADV NEURAL INFORM PR, V19