Similar classes latent distribution modelling-based oversampling method for imbalanced image classification

被引:1
|
作者
Ye, Wei [1 ,2 ]
Dong, Minggang [1 ,2 ]
Wang, Yan [1 ,2 ]
Gan, Guojun [1 ,2 ]
Liu, Deao [1 ,2 ]
机构
[1] Guilin Univ Technol, Sch Informat Sci & Engn, Guilin 541004, Peoples R China
[2] Guangxi Key Lab Embedded Technol & Intelligent Sys, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
Imbalanced classification; Oversampling; Latent distribution; Similar classes; Boundary samples; SMOTE; ALGORITHMS; NETWORK;
D O I
10.1007/s11227-022-05037-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Learning an unbiased classifier from imbalanced image datasets is challenging since the classifier may be strongly biased toward the majority class. To address this issue, some generative model-based oversampling methods have been proposed. However, most of these methods pay little attention to boundary samples, which may contribute tiny to learning an unbiased classifier. In this paper, we focus on boundary samples and propose a similar classes latent distribution modelling-based oversampling method. Specifically, first, we model each class as different von Mises-Fisher distributions, thereby aligning feature learning with the class distributions. Furthermore, we develop a distance minimization loss function, which makes latent representations from similar classes close to each other. In this way, the generator can capture more shared features during training. In addition, we propose a boundary sampling strategy, which uses latent variables near the decision boundary to generate boundary samples. These samples expand the minority decision region and reshape the decision boundary. Experiments on four imbalanced image datasets show that the proposed method achieves promising performance in terms of Recall, Precision, F1-score, and G-mean.
引用
收藏
页码:9985 / 10019
页数:35
相关论文
共 50 条
  • [21] Local distribution-based adaptive minority oversampling for imbalanced data classification
    Wang, Xinyue
    Xu, Jian
    Zeng, Tieyong
    Jing, Liping
    NEUROCOMPUTING, 2021, 422 : 200 - 213
  • [22] Stochastic Sensitivity Measure-based Noise Filtering and Oversampling Method for Imbalanced Classification Problems
    Zhang, Jianjun
    Ng, Wing W. Y.
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 403 - 408
  • [23] Self-adaptive oversampling method based on the complexity of minority data in imbalanced datasets classification
    Tao, Xinmin
    Guo, Xinyue
    Zheng, Yujia
    Zhang, Xiaohan
    Chen, Zhiyu
    KNOWLEDGE-BASED SYSTEMS, 2023, 277
  • [24] Noise-Robust Gaussian Distribution Based Imbalanced Oversampling
    Shao, Xuetao
    Yan, Yuanting
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT II, 2024, 14488 : 221 - 234
  • [25] A NOVEL RULE-BASED OVERSAMPLING APPROACH FOR IMBALANCED DATA CLASSIFICATION
    Zhang, Xiao
    Paz, Ivan
    Nebot, Angela
    37TH ANNUAL EUROPEAN SIMULATION AND MODELLING CONFERENCE 2023, ESM 2023, 2023, : 208 - 212
  • [26] Model-Based Oversampling for Imbalanced Sequence Classification
    Gong, Zhichen
    Chen, Huanhuan
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1009 - 1018
  • [27] A New Oversampling Method Based on the Classification Contribution Degree
    Jiang, Zhenhao
    Pan, Tingting
    Zhang, Chao
    Yang, Jie
    SYMMETRY-BASEL, 2021, 13 (02): : 1 - 13
  • [28] A non-parameter oversampling approach for imbalanced data classification based on hybrid natural neighbors
    Lin, Junyue
    Liang, Lu
    APPLIED INTELLIGENCE, 2025, 55 (05)
  • [29] A Synthetic Minority Based on Probabilistic Distribution (SyMProD) Oversampling for Imbalanced Datasets
    Kunakorntum, Intouch
    Hinthong, Woranich
    Phunchongharn, Phond
    IEEE ACCESS, 2020, 8 : 114692 - 114704
  • [30] An ensemble oversampling method for imbalanced classification with prior knowledge via generative adversarial network
    Zhang, Yulin
    Liu, Yuchen
    Wang, Yan
    Yang, Jie
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2023, 235