A geometric approach to leveraging weak learners

被引:13
作者
Duffy, N [1 ]
Helmbold, D [1 ]
机构
[1] Univ Calif Santa Cruz, Santa Cruz, CA 95064 USA
关键词
learning; classification; boosting; ensemble methods; gradient descent;
D O I
10.1016/S0304-3975(01)00083-4
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
AdaBoost is a popular and effective leveraging procedure for improving the hypotheses generated by weak learning algorithms. AdaBoost and many other leveraging algorithms can be viewed as performing a constrained gradient descent over a potential function. At each iteration the distribution over the sample given to the weak learner is proportional to the direction of steepest descent. We introduce a new leveraging algorithm based on a natural potential function. For this potential function, the direction of steepest descent can have negative components. Therefore, we provide two techniques for obtaining suitable distributions from these directions of steepest descent. The resulting algorithms have bounds that are incomparable to AdaBoost's. The analysis suggests that our algorithm is likely to perform better than AdaBoost on noisy data and with weak learners returning low confidence hypotheses. Modest experiments confirm that our algorithm can perform better than AdaBoost in these situations. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:67 / 108
页数:42
相关论文
共 38 条
[1]  
ABE N, 1991, PROCEEDINGS OF THE FOURTH ANNUAL WORKSHOP ON COMPUTATIONAL LEARNING THEORY, P277
[2]  
[Anonymous], 1982, ESTIMATION DEPENDENC
[3]  
[Anonymous], CMUCS90100 SCH COMP
[4]   An empirical comparison of voting classification algorithms: Bagging, boosting, and variants [J].
Bauer, E ;
Kohavi, R .
MACHINE LEARNING, 1999, 36 (1-2) :105-139
[5]   What Size Net Gives Valid Generalization? [J].
Baum, Eric B. ;
Haussler, David .
NEURAL COMPUTATION, 1989, 1 (01) :151-160
[6]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[7]   Prediction games and arcing algorithms [J].
Breiman, L .
NEURAL COMPUTATION, 1999, 11 (07) :1493-1517
[8]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[9]  
Breiman L, 1997, Technical Report 486
[10]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946