Similar classes latent distribution modelling-based oversampling method for imbalanced image classification

被引:1
|
作者
Ye, Wei [1 ,2 ]
Dong, Minggang [1 ,2 ]
Wang, Yan [1 ,2 ]
Gan, Guojun [1 ,2 ]
Liu, Deao [1 ,2 ]
机构
[1] Guilin Univ Technol, Sch Informat Sci & Engn, Guilin 541004, Peoples R China
[2] Guangxi Key Lab Embedded Technol & Intelligent Sys, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
Imbalanced classification; Oversampling; Latent distribution; Similar classes; Boundary samples; SMOTE; ALGORITHMS; NETWORK;
D O I
10.1007/s11227-022-05037-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Learning an unbiased classifier from imbalanced image datasets is challenging since the classifier may be strongly biased toward the majority class. To address this issue, some generative model-based oversampling methods have been proposed. However, most of these methods pay little attention to boundary samples, which may contribute tiny to learning an unbiased classifier. In this paper, we focus on boundary samples and propose a similar classes latent distribution modelling-based oversampling method. Specifically, first, we model each class as different von Mises-Fisher distributions, thereby aligning feature learning with the class distributions. Furthermore, we develop a distance minimization loss function, which makes latent representations from similar classes close to each other. In this way, the generator can capture more shared features during training. In addition, we propose a boundary sampling strategy, which uses latent variables near the decision boundary to generate boundary samples. These samples expand the minority decision region and reshape the decision boundary. Experiments on four imbalanced image datasets show that the proposed method achieves promising performance in terms of Recall, Precision, F1-score, and G-mean.
引用
收藏
页码:9985 / 10019
页数:35
相关论文
共 50 条
  • [31] Evidence-based adaptive oversampling algorithm for imbalanced classification
    Chen-ju Lin
    Florence Leony
    Knowledge and Information Systems, 2024, 66 : 2209 - 2233
  • [32] A new boundary-degree-based oversampling method for imbalanced data
    Chen, Yueqi
    Pedrycz, Witold
    Yang, Jie
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26518 - 26541
  • [33] Perturbation-based oversampling technique for imbalanced classification problems
    Jianjun Zhang
    Ting Wang
    Wing W. Y. Ng
    Witold Pedrycz
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 773 - 787
  • [34] SMOTE-BD: An Exact and Scalable Oversampling Method for Imbalanced Classification in Big Data
    Basgall, Maria Jose
    Hasperue, Waldo
    Naiouf, Marcelo
    Fernandez, Alberto
    Herrera, Francisco
    JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2018, 18 (03): : 203 - 209
  • [35] An oversampling method for wafer map defect pattern classification considering small and imbalanced data
    Kim, Eun-Su
    Choi, Seung-Hyun
    Lee, Dong-Hee
    Kim, Kwang-Jae
    Bae, Young-Mok
    Oh, Young-Chan
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 162
  • [36] A new instance density-based synthetic minority oversampling method for imbalanced classification problems
    Ma, Chung-Kang
    Park, You-Jin
    ENGINEERING OPTIMIZATION, 2022, 54 (10) : 1743 - 1757
  • [37] A New Segmented Oversampling Method for Imbalanced Data Classification Using Quasi-Linear SVM
    Zhou, Bo
    Li, Weite
    Hu, Jinglu
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2017, 12 (06) : 891 - 898
  • [38] Imbalanced Classification via Feature Dictionary-Based Minority Oversampling
    Park, Minho
    Song, Hwa Jeon
    Kang, Dong-Oh
    IEEE ACCESS, 2022, 10 : 34236 - 34245
  • [39] Binary imbalanced data classification based on diversity oversampling by generative models
    Zhai, Junhai
    Qi, Jiaxing
    Shen, Chu
    INFORMATION SCIENCES, 2022, 585 : 313 - 343
  • [40] An oversampling method for imbalanced dataset based on sparsity and boundary degree
    Zhen Xue
    Yan Gao
    Liangliang Zhang
    Xu Yang
    Jianzhen Wu
    Multimedia Tools and Applications, 2025, 84 (17) : 17361 - 17387