Improving long-tailed classification with PixDyMix: a localized pixel-level mixing method

被引:4
作者
Zeng, Wu [1 ]
Xiao, Zhengying [1 ]
机构
[1] Putian Univ, Engn Training Ctr, Putian 351100, Peoples R China
关键词
Long-tailed classification; Imbalanced learning; Image classification; Data augmentation; Adaptive weight adjustment; Pixel-level dynamic mixing; SMOTE; GAN;
D O I
10.1007/s11760-024-03382-z
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the continuous expansion of dataset size, the issue of long-tailed distribution has become increasingly prominent. Traditional approaches often favor head categories while neglecting the importance of tail categories. To address this limitation, this paper innovatively proposes the PDMLT (pixel-level dynamic mixing for long-tailed classification) method, the core of which lies in a pixel-level dynamic mixing image data augmentation technique called PixDyMix (pixel-level dynamic mixing). This technique intelligently adjusts mixing weights based on image cropping area, effectively preventing excessive loss of key pixel information during large-area cropping and improving the quality and label matching of newly generated samples. By generating higher-quality tail category sample images, it effectively increases the number of high-quality tail category samples, thereby enhancing the overall generalization ability of the model. Additionally, to overcome the limitations of existing resampling strategies in category weight allocation, we introduce an adaptive weight function to optimize the sampling process. This function can adaptively adjust the sampling weights of each category based on the degree of imbalance in the dataset, significantly improving the classification accuracy and stability of the model. Through comprehensive experimental validation on three standard long-tailed distribution datasets, our method demonstrates clear advantages and effectiveness.
引用
收藏
页码:7157 / 7170
页数:14
相关论文
共 30 条
[11]   Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning [J].
Han, H ;
Wang, WY ;
Mao, BH .
ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 :878-887
[12]   StyleMix: Separating Content and Style for Enhanced Data Augmentation [J].
Hong, Minui ;
Choi, Jinwoo ;
Kim, Gunhee .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14857-14865
[13]   Disentangling Label Distribution for Long-tailed Visual Recognition [J].
Hong, Youngkyu ;
Han, Seungju ;
Choi, Kwanghee ;
Seo, Seokjun ;
Kim, Beomsu ;
Chang, Buru .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :6622-6632
[14]   Remix: Rebalanced Mixup [J].
Chou, Hsin-Ping ;
Chang, Shih-Chieh ;
Pan, Jia-Yu ;
Wei, Wei ;
Juan, Da-Cheng .
COMPUTER VISION - ECCV 2020 WORKSHOPS, PT VI, 2020, 12540 :95-110
[15]  
Kang B., 2020, Decoupling Representation and Classifier for Long-Tailed Recognition, P1
[16]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[17]   Large-Scale Long-Tailed Recognition in an Open World [J].
Liu, Ziwei ;
Miao, Zhongqi ;
Zhan, Xiaohang ;
Wang, Jiayun ;
Gong, Boqing ;
Yu, Stella X. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2532-2541
[18]   The Majority Can Help the Minority: Context-rich Minority Oversampling for Long-tailed Classification [J].
Park, Seulki ;
Hong, Youngkyu ;
Heo, Byeongho ;
Yun, Sangdoo ;
Choi, Jin Young .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :6877-6886
[19]   Influence-Balanced Loss for Imbalanced Visual Classification [J].
Park, Seulki ;
Lim, Jongin ;
Jeon, Younghan ;
Choi, Jin Young .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :715-724
[20]  
Ren J., 2020, Advances in Neural Information Processing Systems, V33, P4175, DOI 10.5555/3495724.3496075