Similar classes latent distribution modelling-based oversampling method for imbalanced image classification

被引:1
|
作者
Ye, Wei [1 ,2 ]
Dong, Minggang [1 ,2 ]
Wang, Yan [1 ,2 ]
Gan, Guojun [1 ,2 ]
Liu, Deao [1 ,2 ]
机构
[1] Guilin Univ Technol, Sch Informat Sci & Engn, Guilin 541004, Peoples R China
[2] Guangxi Key Lab Embedded Technol & Intelligent Sys, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
Imbalanced classification; Oversampling; Latent distribution; Similar classes; Boundary samples; SMOTE; ALGORITHMS; NETWORK;
D O I
10.1007/s11227-022-05037-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Learning an unbiased classifier from imbalanced image datasets is challenging since the classifier may be strongly biased toward the majority class. To address this issue, some generative model-based oversampling methods have been proposed. However, most of these methods pay little attention to boundary samples, which may contribute tiny to learning an unbiased classifier. In this paper, we focus on boundary samples and propose a similar classes latent distribution modelling-based oversampling method. Specifically, first, we model each class as different von Mises-Fisher distributions, thereby aligning feature learning with the class distributions. Furthermore, we develop a distance minimization loss function, which makes latent representations from similar classes close to each other. In this way, the generator can capture more shared features during training. In addition, we propose a boundary sampling strategy, which uses latent variables near the decision boundary to generate boundary samples. These samples expand the minority decision region and reshape the decision boundary. Experiments on four imbalanced image datasets show that the proposed method achieves promising performance in terms of Recall, Precision, F1-score, and G-mean.
引用
收藏
页码:9985 / 10019
页数:35
相关论文
共 50 条
  • [41] Clustering-based improved adaptive synthetic minority oversampling technique for imbalanced data classification
    Jin, Dian
    Xie, Dehong
    Liu, Di
    Gong, Murong
    INTELLIGENT DATA ANALYSIS, 2023, 27 (03) : 635 - 652
  • [42] A Synthetic Minority Oversampling Technique Based on Gaussian Mixture Model Filtering for Imbalanced Data Classification
    Xu, Zhaozhao
    Shen, Derong
    Kou, Yue
    Nie, Tiezheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3740 - 3753
  • [43] DDSC-SMOTE: an imbalanced data oversampling algorithm based on data distribution and spectral clustering
    Li, Xinqi
    Liu, Qicheng
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (12) : 17760 - 17789
  • [44] OALDPC: oversampling approach based on local density peaks clustering for imbalanced classification
    Li, Junnan
    Zhu, Qingsheng
    APPLIED INTELLIGENCE, 2023, 53 (24) : 30987 - 31017
  • [45] Classification method for imbalanced LiDAR point cloud based on stack autoencoder
    Ren, Peng
    Xia, Qunli
    ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (06): : 3453 - 3470
  • [46] Oversampling method based on GAN for tabular binary classification problems
    Yang, Jie
    Jiang, Zhenhao
    Pan, Tingting
    Chen, Yueqi
    Pedrycz, Witold
    INTELLIGENT DATA ANALYSIS, 2023, 27 (05) : 1287 - 1308
  • [47] A new boundary-degree-based oversampling method for imbalanced data
    Yueqi Chen
    Witold Pedrycz
    Jie Yang
    Applied Intelligence, 2023, 53 : 26518 - 26541
  • [48] A Novel Oversampling Method for Imbalanced Datasets Based on Density Peaks Clustering
    Cao, Jie
    Shi, Yong
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2021, 28 (06): : 1813 - 1819
  • [49] Natural local density-based adaptive oversampling algorithm for imbalanced classification
    Wang, Wentong
    Yang, Lijun
    Zhang, Jinghui
    Yang, Juntao
    Tang, Dongming
    Liu, Tao
    KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [50] An Improved Oversampling Method for imbalanced Data-SMOTE Based on Canopy and K-means
    Guo, Chaoyou
    Ma, Yankun
    Xu, Zhe
    Cao, Mengmeng
    Yao, Qian
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1467 - 1469