A Partial Labeling Framework for Multi-Class Imbalanced Streaming Data

被引:0
|
作者
Arabmakki, Elaheh [1 ]
Kantardzic, Mehmed [1 ]
Sethi, Tegjyot Singh [1 ]
机构
[1] Univ Louisville, Dept Comp Engn & Comp Sci, Louisville, KY 40203 USA
关键词
data stream; multi-class; concept drift; imbalance; partial labeling; EXTREME LEARNING-MACHINE; SUPPORT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced data streams are found in many real world applications such as spam email detection, and internet traffic data. The classification of such data is challenging, since data stream usually changes, and the model should be updated to maintain the performance. However, obtaining the true labels of the samples to build a new model is not easy, since labeling is expensive and time consuming. Additionally, existence of the multiple and imbalanced classes may cause to lose performance over one class while trying to gain on another. In this paper, we propose RLS-Multi (Reduced Labeled Samples-Multiple class) which is a classification framework for the multi-class and evolving imbalanced data stream. RLS-Multi handles the data with multiple classes, and it uses a small fraction of the data to update the model. RLS-Multi is compared with McELM, and VWOS-ELM which are two fully labeling approaches for classification of the imbalanced and multi-class data stream. The experimental results show that the performance of the RLS-Multi is not significantly different from the two other techniques, requiring only up to 25% of the samples to label for majority of the data sets, on average.
引用
收藏
页码:1018 / 1025
页数:8
相关论文
共 50 条
  • [1] A Dynamic Sampling Framework for Multi-Class Imbalanced Data
    Debowski, B.
    Areibi, S.
    Grewal, G.
    Tempelman, J.
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 2, 2012, : 113 - 118
  • [2] Multi-class Boosting for Imbalanced Data
    Fernandez-Baldera, Antonio
    Buenaposada, Jose M.
    Baumela, Luis
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 57 - 64
  • [3] Multi-class WHMBoost: An ensemble algorithm for multi-class imbalanced data
    Zhao, Jiakun
    Jin, Ju
    Zhang, Yibo
    Zhang, Ruifeng
    Chen, Si
    INTELLIGENT DATA ANALYSIS, 2022, 26 (03) : 599 - 614
  • [4] Evaluating Difficulty of Multi-class Imbalanced Data
    Lango, Mateusz
    Napierala, Krystyna
    Stefanowski, Jerzy
    FOUNDATIONS OF INTELLIGENT SYSTEMS, ISMIS 2017, 2017, 10352 : 312 - 322
  • [5] Ensemble Classifier for Imbalanced Streaming Data Using Partial Labeling
    Arabmakki, Elaheh
    Kantardzic, Mehmed
    Sethi, Tegjyot Singh
    PROCEEDINGS OF 2016 IEEE 17TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI), 2016, : 257 - 260
  • [6] Survey on Highly Imbalanced Multi-class Data
    Hamid, Hakim Abdul
    Yusoff, Marina
    Mohamed, Azlinah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (06) : 211 - 229
  • [7] Fast Learning and Testing for Imbalanced Multi-Class Changes in Streaming Data by Dynamic Multi-Stratum Network
    Thakong, Mongkhon
    Phimoltares, Suphakant
    Jaiyen, Saichon
    Lursinsap, Chidchanok
    IEEE ACCESS, 2017, 5 : 10633 - 10648
  • [8] An Algorithm for Selective Preprocessing of Multi-class Imbalanced Data
    Wojciechowski, Szymon
    Wilk, Szymon
    Stefanowski, Jerzy
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2017, 2018, 578 : 238 - 247
  • [9] A survey of multi-class imbalanced data classification methods
    Han, Meng
    Li, Ang
    Gao, Zhihui
    Mu, Dongliang
    Liu, Shujuan
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (02) : 2471 - 2501
  • [10] Multi-class imbalanced big data classification on Spark
    Sleeman, William C.
    Krawczyk, Bartosz
    KNOWLEDGE-BASED SYSTEMS, 2021, 212