A Partial Labeling Framework for Multi-Class Imbalanced Streaming Data

被引:0
|
作者
Arabmakki, Elaheh [1 ]
Kantardzic, Mehmed [1 ]
Sethi, Tegjyot Singh [1 ]
机构
[1] Univ Louisville, Dept Comp Engn & Comp Sci, Louisville, KY 40203 USA
关键词
data stream; multi-class; concept drift; imbalance; partial labeling; EXTREME LEARNING-MACHINE; SUPPORT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced data streams are found in many real world applications such as spam email detection, and internet traffic data. The classification of such data is challenging, since data stream usually changes, and the model should be updated to maintain the performance. However, obtaining the true labels of the samples to build a new model is not easy, since labeling is expensive and time consuming. Additionally, existence of the multiple and imbalanced classes may cause to lose performance over one class while trying to gain on another. In this paper, we propose RLS-Multi (Reduced Labeled Samples-Multiple class) which is a classification framework for the multi-class and evolving imbalanced data stream. RLS-Multi handles the data with multiple classes, and it uses a small fraction of the data to update the model. RLS-Multi is compared with McELM, and VWOS-ELM which are two fully labeling approaches for classification of the imbalanced and multi-class data stream. The experimental results show that the performance of the RLS-Multi is not significantly different from the two other techniques, requiring only up to 25% of the samples to label for majority of the data sets, on average.
引用
收藏
页码:1018 / 1025
页数:8
相关论文
共 50 条
  • [31] An oversampling method for multi-class imbalanced data based on composite weights
    Deng, Mingyang
    Guo, Yingshi
    Wang, Chang
    Wu, Fuwei
    PLOS ONE, 2021, 16 (11):
  • [32] Performance Analysis of Binarization Strategies for Multi-class Imbalanced Data Classification
    Zak, Michal
    Wozniak, Michal
    COMPUTATIONAL SCIENCE - ICCS 2020, PT IV, 2020, 12140 : 141 - 155
  • [33] Online active learning method for multi-class imbalanced data stream
    Ang Li
    Meng Han
    Dongliang Mu
    Zhihui Gao
    Shujuan Liu
    Knowledge and Information Systems, 2024, 66 : 2355 - 2391
  • [34] Online active learning method for multi-class imbalanced data stream
    Li, Ang
    Han, Meng
    Mu, Dongliang
    Gao, Zhihui
    Liu, Shujuan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (04) : 2355 - 2391
  • [35] An online ensemble classification algorithm for multi-class imbalanced data stream
    Han, Meng
    Li, Chunpeng
    Meng, Fanxing
    He, Feifei
    Zhang, Ruihua
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (11) : 6845 - 6880
  • [36] An Effective Recursive Technique for Multi-Class Classification and Regression for Imbalanced Data
    Alam, Tahira
    Ahmed, Chowdhury Farhan
    Zahin, Sabit Anwar
    Khan, Muhammad Asif Hossain
    Islam, Maliha Tashfia
    IEEE ACCESS, 2019, 7 : 127615 - 127630
  • [37] Optimizing Multi-Class Text Classification Models for Imbalanced News Data
    Anitha, S.
    Kavi Varshini, E.
    Haritha Mahalakshmi, N.
    Jishnu, S.
    2024 15th International Conference on Computing Communication and Networking Technologies, ICCCNT 2024, 2024,
  • [38] An Effective Ensemble Method for Multi-class Classification and Regression for Imbalanced Data
    Alam, Tahira
    Ahmed, Chowdhury Farhan
    Zahin, Sabit Anwar
    Khan, Muhammad Asif Hossain
    Islam, Maliha Tashfia
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS (ICDM 2018), 2018, 10933 : 59 - 74
  • [39] MULTI-CLASS DATA CLASSIFICATION FOR IMBALANCED DATA SET USING COMBINED SAMPLING APPROACHES
    Prachuabsupakij, Wanthanee
    Snonthornphisaj, Nuanwan
    KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 166 - 171
  • [40] Sequential Multi-Class Labeling in Crowdsourcing
    Kang, Qiyu
    Tay, Wee Peng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (11) : 2190 - 2199