One-class classifiers with incremental learning and forgetting for data streams with concept drift

被引:67
作者
Krawczyk, Bartosz [1 ]
Wozniak, Michal [1 ]
机构
[1] Wroclaw Univ Technol, Dept Syst & Comp Networks, PL-50370 Wroclaw, Poland
关键词
Pattern classification; One-class classification; Data stream classification; Concept drift; Incremental learning; Forgetting; WEIGHTED MAJORITY; ENSEMBLES;
D O I
10.1007/s00500-014-1492-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the most important challenges for machine learning community is to develop efficient classifiers which are able to cope with data streams, especially with the presence of the so-called concept drift. This phenomenon is responsible for the change of classification task characteristics, and poses a challenge for the learning model to adapt itself to the current state of the environment. So there is a strong belief that one-class classification is a promising research direction for data stream analysis-it can be used for binary classification without an access to counterexamples, decomposing a multi-class data stream, outlier detection or novel class recognition. This paper reports a novel modification of weighted one-class support vector machine, adapted to the non-stationary streaming data analysis. Our proposition can deal with the gradual concept drift, as the introduced one-class classifier model can adapt its decision boundary to new, incoming data and additionally employs a forgetting mechanism which boosts the ability of the classifier to follow the model changes. In this work, we propose several different strategies for incremental learning and forgetting, and additionally we evaluate them on the basis of several real data streams. Obtained results confirmed the usability of proposed classifier to the problem of data stream classification with the presence of concept drift. Additionally, implemented forgetting mechanism assures the limited memory consumption, because only quite new and valuable examples should be memorized.
引用
收藏
页码:3387 / 3400
页数:14
相关论文
共 57 条
[41]  
Ouyang Zhenzheng, 2011, 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2011), P1673, DOI 10.1109/FSKD.2011.6019889
[42]   Learn++: An incremental learning algorithm for supervised neural networks [J].
Polikar, R ;
Udpa, L ;
Udpa, SS ;
Honavar, V .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2001, 31 (04) :497-508
[43]  
Rodríguez JJ, 2008, LECT NOTES COMPUT SC, V5342, P520, DOI 10.1007/978-3-540-89689-0_56
[44]   Incremental learning for robust visual tracking [J].
Ross, David A. ;
Lim, Jongwoo ;
Lin, Ruei-Sung ;
Yang, Ming-Hsuan .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 77 (1-3) :125-141
[45]  
Schlimmer J. C., 1986, Machine Learning, V1, P317, DOI 10.1007/BF00116895
[46]  
Shipp C. A., 2002, Information Fusion, V3, P135, DOI 10.1016/S1566-2535(02)00051-9
[47]  
Sobolewski P, 2013, J UNIVERS COMPUT SCI, V19, P462
[48]  
Street W. N., 2001, KDD-2001. Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P377, DOI 10.1145/502512.502568
[49]   Dynamic integration of classifiers for handling concept drift [J].
Tsymbal, Alexey ;
Pechenizkiy, Mykola ;
Cunningham, Padraig ;
Puuronen, Seppo .
INFORMATION FUSION, 2008, 9 (01) :56-68
[50]  
Wang H, 2003, P 9 ACM SIGKDD INT C, P226, DOI 10.1145/956750.956778