Semantic Transfer from Head to Tail: Enlarging Tail Margin for Long-Tailed Visual Recognition

被引:0
作者
Zhang, Shan [1 ]
Ni, Yao [1 ]
Du, Jinhao [2 ]
Liu, Yanxia [3 ]
Koniusz, Piotr [1 ,4 ]
机构
[1] Australian Natl Univ, Canberra, ACT, Australia
[2] Peking Univ, Beijing, Peoples R China
[3] Beijing Union Univ, Beijing, Peoples R China
[4] Data61 CSIRO, Eveleigh, Australia
来源
2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024 | 2024年
关键词
D O I
10.1109/WACV57701.2024.00138
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks excel in visual recognition tasks, but their success hinges on access to balanced datasets. Yet, real-world datasets often exhibit a long-tailed distribution, compromising network efficiency and hampering generalization on unseen data. To enhance the model's generalization in long-tailed scenarios, we present a novel feature augmentation approach termed SeMAntic tRansfer from head to Tail (SMART), which enriches the feature patterns for tail samples by transferring semantic covariance from the head classes to the tail classes along semantically correlating dimensions. This strategy boosts the model's generalization ability by implicitly and adaptively weighting the logits, thereby widening the classification margin of tail classes. Inspired by the success of this weighting, we further incorporate a semantic-aware weighting strategy for the loss tied to tail samples. This amplifies the effect of enlarging the margin for tail classes. We are the first to provide theoretical analysis that demonstrates a large semantic diversity in tail samples can increase class margins during the training stage, leading to improved generalization. Empirical observations support our theory. Notably, with no need for extra data or learnable parameters, SMART achieves state-of-the-art results on five long-tailed benchmark datasets: CIFAR-10/100-LT, Places-LT, ImageNet-LT, and iNaturalist 2018.
引用
收藏
页码:1339 / 1349
页数:11
相关论文
共 50 条
[21]   Disentangling Label Distribution for Long-tailed Visual Recognition [J].
Hong, Youngkyu ;
Han, Seungju ;
Choi, Kwanghee ;
Seo, Seokjun ;
Kim, Beomsu ;
Chang, Buru .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :6622-6632
[22]   Attentive Feature Augmentation for Long-Tailed Visual Recognition [J].
Wang, Weiqiu ;
Zhao, Zhicheng ;
Wang, Pingyu ;
Su, Fei ;
Meng, Hongying .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) :5803-5816
[23]   A dual progressive strategy for long-tailed visual recognition [J].
Liang, Hong ;
Cao, Guoqing ;
Shao, Mingwen ;
Zhang, Qian .
MACHINE VISION AND APPLICATIONS, 2024, 35 (01)
[24]   Nested Collaborative Learning for Long-Tailed Visual Recognition [J].
Li, Jun ;
Tan, Zichang ;
Wan, Jun ;
Lei, Zhen ;
Guo, Guodong .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :6939-6948
[25]   Probabilistic Contrastive Learning for Long-Tailed Visual Recognition [J].
Du, Chaoqun ;
Wang, Yulin ;
Song, Shiji ;
Huang, Gao .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) :5890-5904
[26]   Exploring the auxiliary learning for long-tailed visual recognition [J].
Zhang, Junjie ;
Liu, Lingqiao ;
Wang, Peng ;
Zhang, Jian .
NEUROCOMPUTING, 2021, 449 :303-314
[27]   Balanced Contrastive Learning for Long-Tailed Visual Recognition [J].
Zhu, Jianggang ;
Wang, Zheng ;
Chen, Jingjing ;
Chen, Yi-Ping Phoebe ;
Jiang, Yu-Gang .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :6898-6907
[28]   Self Supervision to Distillation for Long-Tailed Visual Recognition [J].
Li, Tianhao ;
Wang, Limin ;
Wu, Gangshan .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :610-619
[29]   Separating Noisy Samples From Tail Classes for Long-Tailed Image Classification With Label Noise [J].
Fang, Chaowei ;
Cheng, Lechao ;
Mao, Yining ;
Zhang, Dingwen ;
Fang, Yixiang ;
Li, Guanbin ;
Qi, Huiyan ;
Jiao, Licheng .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) :16036-16048
[30]   Difficulty-aware Balancing Margin Loss for Long-tailed Recognition [J].
Son, Minseok ;
Koo, Inyong ;
Park, Jinyoung ;
Kim, Changick .
THIRTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, AAAI-25, VOL 39 NO 19, 2025, :20522-20530