A Broad Study on the Transferability of Visual Representations with Contrastive Learning

被引：33

作者：

Islam, Ashraful ^{[1
]}

Chen, Chun-Fu ^{[2
,3
]}

Panda, Rameswar ^{[2
,3
]}

Karlinsky, Leonid ^{[3
]}

Radke, Richard ^{[1
]}

Feris, Rogerio ^{[2
,3
]}

机构：

[1] Rensselaer Polytech Inst, Troy, NY 12181 USA

[2] MIT IBM Watson AI Lab, Cambridge, MA USA

[3] IBM Res, Armonk, NY USA

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年

关键词：

D O I：

10.1109/ICCV48922.2021.00872

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Tremendous progress has been made in visual representation learning, notably with the recent success of self-supervised contrastive learning methods. Supervised contrastive learning has also been shown to outperform its cross-entropy counterparts by leveraging labels for choosing where to contrast. However, there has been little work to explore the transfer capability of contrastive learning to a different domain. In this paper, we conduct a comprehensive study on the transferability of learned representations of different contrastive approaches for linear evaluation, full-network transfer, and few-shot recognition on 12 downstream datasets from different domains, and object detection tasks on MSCOCO and VOC0712. The results show that the contrastive approaches learn representations that are easily transferable to a different downstream task. We further observe that the joint objective of self-supervised contrastive loss with cross-entropy/supervised-contrastive loss leads to better transferability of these models over their supervised counterparts. Our analysis reveals that the representations learned from the contrastive approaches contain more low/mid-level semantics than cross-entropy models, which enables them to quickly adapt to a new task. Our codes and models will be publicly available to facilitate future research on transferability of visual representations.(1)

引用

页码：8825 / 8835

页数：11

共 53 条

[21] A Broader Study of Cross-Domain Few-Shot Learning [J].

Guo, Yunhui ;

Codella, Noel C. ;

Karlinsky, Leonid ;

Codella, James V. ;

Smith, John R. ;

Saenko, Kate ;

Rosing, Tajana ;

Feris, Rogerio .

COMPUTER VISION - ECCV 2020, PT XXVII, 2020, 12372 :124-141

[22] Momentum Contrast for Unsupervised Visual Representation Learning [J].

He, Kaiming ;

Fan, Haoqi ;

Wu, Yuxin ;

Xie, Saining ;

Girshick, Ross .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9726-9735

[23]

Helber Patrick, 2019, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, V12, P2217, DOI DOI 10.1109/JSTARS.2019.2918242

[24]

Hendrycks D., 2019, CVPR

[25]

Hendrycks D., 2020, P IEEE CVF INT C COM, DOI DOI 10.1109/ICCV48922.2021.00823

[26]

Hendrycks D., 2019, INT C LEARN REPR

[27]

Islam Ashraful, 2021, ARXIV210607807, P2021

[28]

Jiang Ziyu, 2020, ADV NEUR IN, V33

[29]

Khosla P, 2020, ADV NEUR IN, V33

[30]

Kim Youngsung, 2020, P ADV NEUR INF PROC, V33, P16846

← 1 2 3 4 5 6 →