An Adaptive Ensemble Classifier for Mining Complex Noisy Instances in Data Streams

被引:0
作者
Karim, Md Rejaul [1 ]
Farid, Dewan Md [1 ]
机构
[1] United Int Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
来源
2014 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) | 2014年
关键词
Data streams; decision tree; ensemble classifier; multi-class classification; noisy data; single classifier;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-time data streams classification is a challenging data mining task. In real-time streaming environments concepts of instances might change at any time such as weather predictions, astronomical and intrusion detection etc. To address this issue, we present an adaptive ensemble classifier for data streams classification, which uses a set of decision trees for mining complex noisy instances in data streams. The ensemble model updates automatically so that it represents the most recent concepts in data streams. In each iteration, the ensemble model generates a new training data from original training dataset, then builds a decision tree using new training data and assigns a weight to the tree based on its classification accuracy on original training instances. Also it updates the weight of training instances in training dataset. We tested the performance of the proposed ensemble classifier against that of existing C4.5 decision tree classifier using real benchmark datasets from UCI (University of California, Irvine) machine learning repository. The experimental results prove that the proposed ensemble classifier shows great flexibility and robustness in data streams classification.
引用
收藏
页数:4
相关论文
共 19 条
[1]   Evolving Fuzzy-Rule-Based Classifiers From Data Streams [J].
Angelov, Plamen P. ;
Zhou, Xiaowei .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2008, 16 (06) :1462-1475
[2]  
[Anonymous], 2014, C4. 5: programs for machine learning
[3]  
[Anonymous], 2009, WEKA DATA MINING SOF
[4]  
[Anonymous], 2011, INT J COMPUTER APPL
[5]  
Biswas A., 2012, J COMPUTER SCI ENG, V14, P1
[6]   Interval set clustering [J].
Chen, Min ;
Miao, Duoqian .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (04) :2923-2932
[7]  
Farid Dewan Md, 2013, Journal of Advances in Information Technology, V4, P129, DOI 10.4304/jait.4.3.129-135
[8]   Hybrid decision tree and naive Bayes classifiers for multi-class classification tasks [J].
Farid, Dewan Md. ;
Zhang, Li ;
Rahman, Chowdhury Mofizur ;
Hossain, M. A. ;
Strachan, Rebecca .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) :1937-1946
[9]   An adaptive ensemble classifier for mining concept drifting data streams [J].
Farid, Dewan Md. ;
Zhang, Li ;
Hossain, Alamgir ;
Rahman, Chowdhury Mofizur ;
Strachan, Rebecca ;
Sexton, Graham ;
Dahal, Keshav .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (15) :5895-5906
[10]  
Farid DMd., 2013, Int. J. Data Min. Knowl. Manag. Process, V3, P13, DOI [10.5121/ijdkp.2013.3102, DOI 10.5121/IJDKP.2013.3102]