Disentangling Label Distribution for Long-tailed Visual Recognition

被引:138
|
作者
Hong, Youngkyu [1 ]
Han, Seungju [1 ]
Choi, Kwanghee [1 ]
Seo, Seokjun [1 ]
Kim, Beomsu [1 ]
Chang, Buru [1 ]
机构
[1] Hyperconnect, Seoul, South Korea
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
CLASS IMBALANCE;
D O I
10.1109/CVPR46437.2021.00656
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The current evaluation protocol of long-tailed visual recognition trains the classification model on the long-tailed source label distribution and evaluates its performance on the uniform target label distribution. Such protocol has questionable practicality since the target may also be long-tailed. Therefore, we formulate long-tailed visual recognition as a label shift problem where the target and source label distributions are different. One of the significant hurdles in dealing with the label shift problem is the entanglement between the source label distribution and the model prediction. In this paper, we focus on disentangling the source label distribution from the model prediction. We first introduce a simple but overlooked baseline method that matches the target label distribution by post-processing the model prediction trained by the cross-entropy loss and the Softmax function. Although this method surpasses state-of-the-art methods on benchmark datasets, it can be further improved by directly disentangling the source label distribution from the model prediction in the training phase. Thus, we propose a novel method, LAbel distribution DisEntangling (LADE) loss based on the optimal bound of Donsker-Varadhan representation. LADE achieves state-of-the-art performance on benchmark datasets such as CIFAR-100-LL Places-LT ImageNet-LL and iNaturalist 2018. Moreover LADE outperforms existing methods on various shifted target label distributions, showing the general adaptability of our proposed method.
引用
收藏
页码:6622 / 6632
页数:11
相关论文
共 50 条
  • [1] Beyond the Label Distribution Prior for Long-Tailed Recognition
    Li, Ming
    Cao, Liujuan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 792 - 803
  • [2] A Survey on Long-Tailed Visual Recognition
    Yang, Lu
    Jiang, He
    Song, Qing
    Guo, Jun
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (07) : 1837 - 1872
  • [3] A Survey on Long-Tailed Visual Recognition
    Lu Yang
    He Jiang
    Qing Song
    Jun Guo
    International Journal of Computer Vision, 2022, 130 : 1837 - 1872
  • [4] Decoupled Optimisation for Long-Tailed Visual Recognition
    Cong, Cong
    Xuan, Shiyu
    Liu, Sidong
    Zhang, Shiliang
    Pagnucco, Maurice
    Song, Yang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1380 - 1388
  • [5] Feature fusion network for long-tailed visual recognition
    Zhou, Xuesong
    Zhai, Junhai
    Cao, Yang
    PATTERN RECOGNITION, 2023, 144
  • [6] Attentive Feature Augmentation for Long-Tailed Visual Recognition
    Wang, Weiqiu
    Zhao, Zhicheng
    Wang, Pingyu
    Su, Fei
    Meng, Hongying
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 5803 - 5816
  • [7] A dual progressive strategy for long-tailed visual recognition
    Hong Liang
    Guoqing Cao
    Mingwen Shao
    Qian Zhang
    Machine Vision and Applications, 2024, 35
  • [8] A dual progressive strategy for long-tailed visual recognition
    Liang, Hong
    Cao, Guoqing
    Shao, Mingwen
    Zhang, Qian
    MACHINE VISION AND APPLICATIONS, 2024, 35 (01)
  • [9] Nested Collaborative Learning for Long-Tailed Visual Recognition
    Li, Jun
    Tan, Zichang
    Wan, Jun
    Lei, Zhen
    Guo, Guodong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6939 - 6948
  • [10] Probabilistic Contrastive Learning for Long-Tailed Visual Recognition
    Du, Chaoqun
    Wang, Yulin
    Song, Shiji
    Huang, Gao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 5890 - 5904