A Partial Labeling Framework for Multi-Class Imbalanced Streaming Data

被引:0
作者
Arabmakki, Elaheh [1 ]
Kantardzic, Mehmed [1 ]
Sethi, Tegjyot Singh [1 ]
机构
[1] Univ Louisville, Dept Comp Engn & Comp Sci, Louisville, KY 40203 USA
来源
2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2017年
关键词
data stream; multi-class; concept drift; imbalance; partial labeling; EXTREME LEARNING-MACHINE; SUPPORT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced data streams are found in many real world applications such as spam email detection, and internet traffic data. The classification of such data is challenging, since data stream usually changes, and the model should be updated to maintain the performance. However, obtaining the true labels of the samples to build a new model is not easy, since labeling is expensive and time consuming. Additionally, existence of the multiple and imbalanced classes may cause to lose performance over one class while trying to gain on another. In this paper, we propose RLS-Multi (Reduced Labeled Samples-Multiple class) which is a classification framework for the multi-class and evolving imbalanced data stream. RLS-Multi handles the data with multiple classes, and it uses a small fraction of the data to update the model. RLS-Multi is compared with McELM, and VWOS-ELM which are two fully labeling approaches for classification of the imbalanced and multi-class data stream. The experimental results show that the performance of the RLS-Multi is not significantly different from the two other techniques, requiring only up to 25% of the samples to label for majority of the data sets, on average.
引用
收藏
页码:1018 / 1025
页数:8
相关论文
共 26 条
[1]   To Combat Multi-Class Imbalanced Problems by Means of Over-Sampling Techniques [J].
Abdi, Lida ;
Hashemi, Sattar .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) :238-251
[2]  
Arabmakki E., 2016, REDUCED LABELED SAMP
[3]   Ensemble Classifier for Imbalanced Streaming Data Using Partial Labeling [J].
Arabmakki, Elaheh ;
Kantardzic, Mehmed ;
Sethi, Tegjyot Singh .
PROCEEDINGS OF 2016 IEEE 17TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI), 2016, :257-260
[4]  
Arabmakki E, 2014, 2014 IEEE 15TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), P779, DOI 10.1109/IRI.2014.7051968
[5]  
Asuncion A., 2015, UCI machine learning repository
[6]  
Bishop C.M., 2006, Machine Learning, V128
[7]   Classifying evolving data streams with partially labeled data [J].
Borchani, Hanen ;
Larranaga, Pedro ;
Bielza, Concha .
INTELLIGENT DATA ANALYSIS, 2011, 15 (05) :655-670
[8]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[9]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[10]   A Survey on Concept Drift Adaptation [J].
Gama, Joao ;
Zliobaite, Indre ;
Bifet, Albert ;
Pechenizkiy, Mykola ;
Bouchachia, Abdelhamid .
ACM COMPUTING SURVEYS, 2014, 46 (04)