Bagging predictors

被引:6435
作者
Breiman, L
机构
[1] Statistics Department, University of California, Berkeley
关键词
aggregation; bootstrap; averaging; combining;
D O I
10.1007/bf00058655
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bagging predictors is a method for generating multiple versions of a predictor and using these to gel an aggregated predictor. The aggregation averages over the versions when predicting a numerical outcome and does a plurality vote when predicting a class. The multiple versions are formed by making bootstrap replicates of the learning set and using these as new learning sets. Tests on real and simulated data sets using classification and regression trees and subset selection in linear regression show that bagging can give substantial gains in accuracy. The vital element is the instability of the prediction method. If perturbing the learning set can cause significant changes in the predictor constructed, then bagging can improve accuracy.
引用
收藏
页码:123 / 140
页数:18
相关论文
共 16 条
[11]  
KWOK S, 1990, MULTIPLE DECISION TR, V4, P327
[12]  
Michie D., 1994, Technometrics, V37, P459, DOI DOI 10.2307/1269742
[13]  
OLSHEN RA, 1985, P BERKELEY C HONOR J, P245
[14]  
SIGILLITO VG, 1989, J HOPKINS APL TECH D, V10, P262
[15]  
SMITH JW, 1988, 12TH P ANN S COMP AP, P261
[16]   MULTISURFACE METHOD OF PATTERN SEPARATION FOR MEDICAL DIAGNOSIS APPLIED TO BREAST CYTOLOGY [J].
WOLBERG, WH ;
MANGASARIAN, OL .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (23) :9193-9196