PAC-Bayes risk bounds for stochastic averages and majority votes of sample-compressed classifiers

被引：0

作者：

Laviolette, Francois ^{[1
]}

Marchand, Mario ^{[1
]}

机构：

[1] Univ Laval, Dept IFT GLO, Quebec City, PQ G1K 7P4, Canada

来源：

JOURNAL OF MACHINE LEARNING RESEARCH | 2007年 / 8卷

关键词：

PAC-Bayes; risk bounds; sample-compression; set covering machines; decision list machines;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose a PAC-Bayes theorem for the sample-compression setting where each classifier is described by a compression subset of the training data and a message string of additional information. This setting, which is the appropriate one to describe many learning algorithms, strictly generalizes the usual data-independent setting where classifiers are represented only by data-independent message strings (or parameters taken from a continuous set). The proposed PAC-Bayes theorem for the sample-compression setting reduces to the PAC-Bayes theorem of Seeger (2002) and Langford (2005) when the compression subset of each classifier vanishes. For posteriors having all their weights on a single sample-compressed classifier, the general risk bound reduces to a bound similar to the tight sample-compression bound proposed in Laviolette et al. (2005). Finally, we extend our results to the case where each sample-compressed classifier of a data-dependent ensemble may abstain of predicting a class label.

引用

页码：1461 / 1487

页数：27

共 26 条

[1]

[Anonymous], J MACHINE LEARNING R

[2]

[Anonymous], 2000, Pattern Classification

[3] Bagging predictors [J].

Breiman, L .

MACHINE LEARNING, 1996, 24 (02) :123-140

[4]

CATONI O, 2004, PAC BAYESIAN APPROAC

[5]

Floyd S, 1995, MACH LEARN, V21, P269

[6] A decision-theoretic generalization of on-line learning and an application to boosting [J].

Freund, Y ;

Schapire, RE .

JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139

[7]

GERMAIN P, 2004, ADV NEURAL INFORM PR, V19

[8] PAC-Bayesian compression bounds on the prediction error of learning algorithms for classification [J].

Graepel, T ;

Herbrich, R .

MACHINE LEARNING, 2005, 59 (1-2) :55-76

[9] Bayes point machines [J].

Herbich, R ;

Graepel, T ;

Campbell, C .

JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (04) :245-279

[10]

LACASSE A, 2007, ADV NEURAL INFORM PR, V19

← 1 2 3 →