A Partial Labeling Framework for Multi-Class Imbalanced Streaming Data

被引:0
|
作者
Arabmakki, Elaheh [1 ]
Kantardzic, Mehmed [1 ]
Sethi, Tegjyot Singh [1 ]
机构
[1] Univ Louisville, Dept Comp Engn & Comp Sci, Louisville, KY 40203 USA
关键词
data stream; multi-class; concept drift; imbalance; partial labeling; EXTREME LEARNING-MACHINE; SUPPORT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced data streams are found in many real world applications such as spam email detection, and internet traffic data. The classification of such data is challenging, since data stream usually changes, and the model should be updated to maintain the performance. However, obtaining the true labels of the samples to build a new model is not easy, since labeling is expensive and time consuming. Additionally, existence of the multiple and imbalanced classes may cause to lose performance over one class while trying to gain on another. In this paper, we propose RLS-Multi (Reduced Labeled Samples-Multiple class) which is a classification framework for the multi-class and evolving imbalanced data stream. RLS-Multi handles the data with multiple classes, and it uses a small fraction of the data to update the model. RLS-Multi is compared with McELM, and VWOS-ELM which are two fully labeling approaches for classification of the imbalanced and multi-class data stream. The experimental results show that the performance of the RLS-Multi is not significantly different from the two other techniques, requiring only up to 25% of the samples to label for majority of the data sets, on average.
引用
收藏
页码:1018 / 1025
页数:8
相关论文
共 50 条
  • [41] Multi-class and feature selection extensions of Roughly Balanced Bagging for imbalanced data
    Mateusz Lango
    Jerzy Stefanowski
    Journal of Intelligent Information Systems, 2018, 50 : 97 - 127
  • [42] A Novel Double Ensemble Algorithm for the Classification of Multi-Class Imbalanced Hyperspectral Data
    Quan, Daying
    Feng, Wei
    Dauphin, Gabriel
    Wang, Xiaofeng
    Huang, Wenjiang
    Xing, Mengdao
    REMOTE SENSING, 2022, 14 (15)
  • [43] Multi-class random forest model to classify wastewater treatment imbalanced data
    Distefano, Veronica
    Palma, Monica
    De Iaco, Sandra
    SOCIO-ECONOMIC PLANNING SCIENCES, 2024, 95
  • [44] Global-local information based oversampling for multi-class imbalanced data
    Han, Mingming
    Guo, Husheng
    Li, Jinyan
    Wang, Wenjian
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (06) : 2071 - 2086
  • [45] Accurate and efficient sequential ensemble learning for highly imbalanced multi-class data
    Vong, Chi-Man
    Du, Jie
    NEURAL NETWORKS, 2020, 128 : 268 - 278
  • [46] Feature Selection for Multi-Class Imbalanced Data Sets Based on Genetic Algorithm
    Du L.-M.
    Xu Y.
    Zhu H.
    Ann. Data Sci., 3 (293-300): : 293 - 300
  • [47] Multi-class and feature selection extensions of Roughly Balanced Bagging for imbalanced data
    Lango, Mateusz
    Stefanowski, Jerzy
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 50 (01) : 97 - 127
  • [48] GDHS: An efficient hybrid sampling method for multi-class imbalanced data classification
    Yan, Yuanting
    Lv, Yan
    Han, Shuangyue
    Yu, Chengjin
    Zhou, Peng
    Neurocomputing, 2025, 637
  • [49] Sentiment Classification from Multi-class Imbalanced Twitter Data Using Binarization
    Krawczyk, Bartosz
    McInnes, Bridget T.
    Cano, Alberto
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2017, 2017, 10334 : 26 - 37
  • [50] Global-local information based oversampling for multi-class imbalanced data
    Mingming Han
    Husheng Guo
    Jinyan Li
    Wenjian Wang
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 2071 - 2086