Generalized Parametric Contrastive Learning

被引：15

作者：

Cui, Jiequan ^{[1
]}

Zhong, Zhisheng ^{[1
]}

Tian, Zhuotao ^{[1
]}

Liu, Shu ^{[2
]}

Yu, Bei ^{[1
]}

Jia, Jiaya ^{[1
,2
]}

机构：

[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, ShaTin, Hong Kong, Peoples R China

[2] SmartMore, Shenzhen 154100, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2024年 / 46卷 / 12期

关键词：

Representation learning; contrastive learning; OOD robustness; long-tailed recognition; semantic segmentation;

D O I：

10.1109/TPAMI.2023.3278694

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose the Generalized Parametric Contrastive Learning (GPaCo/PaCo) which works well on both imbalanced and balanced data. Based on theoretical analysis, we observe supervised contrastive loss tends to bias on high-frequency classes and thus increases the difficulty of imbalanced learning. We introduce a set of parametric class-wise learnable centers to rebalance from an optimization perspective. Further, we analyze our GPaCo/PaCo loss under a balanced setting. Our analysis demonstrates that GPaCo/PaCo can adaptively enhance the intensity of pushing samples of the same class close as more samples are pulled together with their corresponding centers and benefit hard example learning. Experiments on long-tailed benchmarks manifest the new state-of-the-art for long-tailed recognition. On full ImageNet, models from CNNs to vision transformers trained with GPaCo loss show better generalization performance and stronger robustness compared with MAE models. Moreover, GPaCo can be applied to semantic segmentation task and obvious improvements are observed on 4 most popular benchmarks.

引用

页码：7463 / 7474

页数：12

共 70 条

[21] Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, 10.48550/arXiv.2010.11929]
[22] Duggal R, 2020, Arxiv, DOI arXiv:2006.11979
[23] Grill J.-B., 2020, Adv. Neural Inf. Process. Syst., V33, P21271
[24] Learning from Imbalanced Data[J]. He, Haibo;Garcia, Edwardo A. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009(09)
[25] Masked Autoencoders Are Scalable Vision Learners[J]. He, Kaiming;Chen, Xinlei;Xie, Saining;Li, Yanghao;Dollar, Piotr;Girshick, Ross. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022
[26] Momentum Contrast for Unsupervised Visual Representation Learning[J]. He, Kaiming;Fan, Haoqi;Wu, Yuxin;Xie, Saining;Girshick, Ross. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020
[27] Deep Residual Learning for Image Recognition[J]. He, Kaiming;Zhang, Xiangyu;Ren, Shaoqing;Sun, Jian. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016
[28] Hendrycks D, 2019, Arxiv, DOI arXiv:1807.01697
[29] The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization[J]. Hendrycks, Dan;Basart, Steven;Mu, Norman;Kadavath, Saurav;Wang, Frank;Dorundo, Evan;Desai, Rahul;Zhu, Tyler;Parajuli, Samyak;Guo, Mike;Song, Dawn;Steinhardt, Jacob;Gilmer, Justin. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021
[30] Deep Imbalanced Learning for Face Recognition and Attribute Prediction[J]. Huang, Chen;Li, Yining;Loy, Chen Change;Tang, Xiaoou. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020(11)

← 1 2 3 4 5 6 7 →