Scalable Ensemble Learning by Adaptive Sampling

被引:6
作者
Chen, Jianhua [1 ]
机构
[1] Louisiana State Univ, Div Comp Sci & Engn, Sch Elect Engn & Comp Sci, Baton Rouge, LA 70803 USA
来源
2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1 | 2012年
关键词
Scalable Learning; Ensemble Learning; Adaptive Sampling; Sample Size; Boosting;
D O I
10.1109/ICMLA.2012.115
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scalability has become an increasingly critical problem for successful data mining and knowledge discovery applications in real world where we often encounter extremely huge data sets that will render the traditional learning algorithms infeasible. Among various approaches to scalable learning, sampling techniques can be exploited to address the issue of scalability. This paper presents a brief outline on how to utilize the new sampling method in [3] to develop a scalable ensemble learning method with Boosting. Preliminary experimental results using benchmark data sets from the UC-Irvine ML data repository are also presented confirming the efficiency and competitive prediction accuracy of the proposed adaptive boosting method.
引用
收藏
页码:622 / 625
页数:4
相关论文
共 13 条
[1]  
BALCAN MF, 2008, P 21 ANN C LEARN THE
[2]  
Chen J., 2011, P INT S METH INT SYS
[3]  
Chen X., ARXIV08091241V20MATH
[4]   A MEASURE OF ASYMPTOTIC EFFICIENCY FOR TESTS OF A HYPOTHESIS BASED ON THE SUM OF OBSERVATIONS [J].
CHERNOFF, H .
ANNALS OF MATHEMATICAL STATISTICS, 1952, 23 (04) :493-507
[5]  
Domingo C, 2000, LECT NOTES ARTIF INT, V1805, P317
[6]  
Domingo C., 1999, P 2 INT C DISC SCI D
[7]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[9]   QUERY SIZE ESTIMATION BY ADAPTIVE SAMPLING [J].
LIPTON, RJ ;
NAUGHTON, JF .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1995, 51 (01) :18-25
[10]   EFFICIENT SAMPLING STRATEGIES FOR RELATIONAL DATABASE OPERATIONS [J].
LIPTON, RJ ;
NAUGHTON, JF ;
SCHNEIDER, DA ;
SESHADRI, S .
THEORETICAL COMPUTER SCIENCE, 1993, 116 (01) :195-226