An Online Learning Algorithm for Non-stationary Imbalanced Data by Extra-Charging Minority Class

被引:3
|
作者
Siahroudi, Sajjad Kamali [1 ]
Kudenko, Daniel [1 ]
机构
[1] Leibniz Univ Hannover, L3S Res Ctr, Hannover, Germany
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT I | 2021年 / 12712卷
关键词
Online learning; Imbalanced data; Nonstationary data;
D O I
10.1007/978-3-030-75762-5_48
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online learning is one of the trending areas of machine learning in recent years. How to update the model based on new data is the core question in developing an online classifier. When new data arrives, the classifier should keep its model up-to-date by (1) learn new knowledge, (2) keep relevant learned knowledge, and (3) forget obsolete knowledge. This problem becomes more challenging in imbalanced non-stationary scenarios. Previous approaches save arriving instances, then utilize up/down sampling techniques to balance preserved samples and update their models. However, this strategy comes with two drawbacks: first, a delay in updating the models, and second, the up/down sampling causes information loss for the majority classes and introduces noise for the minority classes. To address these drawbacks, we propose the Hyper-Ellipses-Extra-Margin model (HEEM), which properly addresses the class imbalance challenge in online learning by reacting to every new instance as it arrives. HEEM keeps an ensemble of hyper-extended-ellipses for the minority class. Misclassified instances of the majority class are then used to shrink the ellipse, and correctly predicted instances of the minority class are used to enlarge the ellipse. Experimental results show that HEEM mitigates the class imbalance problem and outperforms the state-of-the-art methods.
引用
收藏
页码:603 / 615
页数:13
相关论文
共 24 条
  • [1] Online neural network model for non-stationary and imbalanced data stream classification
    Ghazikhani, Adel
    Monsefi, Reza
    Yazdi, Hadi Sadoghi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2014, 5 (01) : 51 - 62
  • [2] Online neural network model for non-stationary and imbalanced data stream classification
    Adel Ghazikhani
    Reza Monsefi
    Hadi Sadoghi Yazdi
    International Journal of Machine Learning and Cybernetics, 2014, 5 : 51 - 62
  • [3] Ensemble of online neural networks for non-stationary and imbalanced data streams
    Ghazikhani, Adel
    Monsefi, Reza
    Yazdi, Hadi Sadoghi
    NEUROCOMPUTING, 2013, 122 : 535 - 544
  • [4] Online cost-sensitive neural network classifiers for non-stationary and imbalanced data streams
    Ghazikhani, Adel
    Monsefi, Reza
    Yazdi, Hadi Sadoghi
    NEURAL COMPUTING & APPLICATIONS, 2013, 23 (05) : 1283 - 1295
  • [5] Incremental kernel spectral clustering for online learning of non-stationary data
    Langone, Rocco
    Agudelo, Oscar Mauricio
    De Moor, Bart
    Suykens, Johan A. K.
    NEUROCOMPUTING, 2014, 139 : 246 - 260
  • [6] Online cost-sensitive neural network classifiers for non-stationary and imbalanced data streams
    Adel Ghazikhani
    Reza Monsefi
    Hadi Sadoghi Yazdi
    Neural Computing and Applications, 2013, 23 : 1283 - 1295
  • [7] An Online Algorithm for Computation Offloading in Non-Stationary Environments
    Rahman, Aniq Ur
    Ghatak, Gourab
    De Domenico, Antonio
    IEEE COMMUNICATIONS LETTERS, 2020, 24 (10) : 2167 - 2171
  • [8] A Non-Stationary Online Learning Approach to Mobility Management
    Zhou, Yiming
    Shen, Cong
    van der Schaar, Mihaela
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (02) : 1434 - 1446
  • [9] A Non-Stationary Online Learning Approach to Mobility Management
    Zhou, Yiming
    Shen, Cong
    Luo, Xiliang
    van der Schaar, Mihaela
    2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2018,
  • [10] Online Learning Bipartite Matching with Non-stationary Distributions
    Chen, Weirong
    Zheng, Jiaqi
    Yu, Haoyu
    Chen, Guihai
    Chen, Yixin
    Li, Dongsheng
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (05)