A Partial Labeling Framework for Multi-Class Imbalanced Streaming Data

被引:0
|
作者
Arabmakki, Elaheh [1 ]
Kantardzic, Mehmed [1 ]
Sethi, Tegjyot Singh [1 ]
机构
[1] Univ Louisville, Dept Comp Engn & Comp Sci, Louisville, KY 40203 USA
关键词
data stream; multi-class; concept drift; imbalance; partial labeling; EXTREME LEARNING-MACHINE; SUPPORT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced data streams are found in many real world applications such as spam email detection, and internet traffic data. The classification of such data is challenging, since data stream usually changes, and the model should be updated to maintain the performance. However, obtaining the true labels of the samples to build a new model is not easy, since labeling is expensive and time consuming. Additionally, existence of the multiple and imbalanced classes may cause to lose performance over one class while trying to gain on another. In this paper, we propose RLS-Multi (Reduced Labeled Samples-Multiple class) which is a classification framework for the multi-class and evolving imbalanced data stream. RLS-Multi handles the data with multiple classes, and it uses a small fraction of the data to update the model. RLS-Multi is compared with McELM, and VWOS-ELM which are two fully labeling approaches for classification of the imbalanced and multi-class data stream. The experimental results show that the performance of the RLS-Multi is not significantly different from the two other techniques, requiring only up to 25% of the samples to label for majority of the data sets, on average.
引用
收藏
页码:1018 / 1025
页数:8
相关论文
共 50 条
  • [21] SCALA: Scaling algorithm for multi-class imbalanced classification A novel algorithm specifically designed for multi-class multiple minority imbalanced data problems.
    Barzinji, Ala O.
    Ma, Jixin
    Ma, Chaoying
    PROCEEDINGS OF 2023 8TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2023, 2023, : 68 - 73
  • [22] Boosting methods for multi-class imbalanced data classification: an experimental review
    Jafar Tanha
    Yousef Abdi
    Negin Samadi
    Nazila Razzaghi
    Mohammad Asadpour
    Journal of Big Data, 7
  • [23] Boosting methods for multi-class imbalanced data classification: an experimental review
    Tanha, Jafar
    Abdi, Yousef
    Samadi, Negin
    Razzaghi, Nazila
    Asadpour, Mohammad
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [24] Improved multi-class classification approach for imbalanced big data on spark
    Tinku Singh
    Riya Khanna
    Manish Satakshi
    The Journal of Supercomputing, 2023, 79 : 6583 - 6611
  • [25] Concept Drift Detection from Multi-Class Imbalanced Data Streams
    Korycki, Lukasz
    Krawczyk, Bartosz
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1068 - 1079
  • [26] Improved multi-class classification approach for imbalanced big data on spark
    Singh, Tinku
    Khanna, Riya
    Satakshi
    Kumar, Manish
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (06): : 6583 - 6611
  • [27] OAHO: an effective algorithm for multi-class learning from imbalanced data
    Murphey, Yi L.
    Wang, Haoxing
    Ou, Guobin
    Feldkamp, Lee A.
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 406 - +
  • [28] A new data complexity measure for multi-class imbalanced classification tasks
    Han, Mingming
    Guo, Husheng
    Wang, Wenjian
    PATTERN RECOGNITION, 2025, 157
  • [29] Parameter-free classification in multi-class imbalanced data sets
    Cerf, Loic
    Gay, Dominique
    Selmaoui-Folcher, Nazha
    Cremilleux, Bruno
    Boulicaut, Jean-Francois
    DATA & KNOWLEDGE ENGINEERING, 2013, 87 : 109 - 129
  • [30] Multi-class Imbalanced Data Oversampling for Vertebral Column Pathologies Classification
    Saez, Jose A.
    Quintian, Hector
    Krawczyk, Bartosz
    Wozniak, Michal
    Corchado, Emilio
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2018), 2018, 10870 : 131 - 142