Dense Semantic Contrast for Self-Supervised Visual Representation Learning

被引：21

作者：

Li, Xiaoni ^{[1
,2
]}

Zhou, Yu ^{[1
,2
]}

Zhang, Yifei ^{[1
,2
]}

Zhang, Aoting ^{[1
]}

Wang, Wei ^{[1
,2
]}

Jiang, Ning ^{[3
]}

Wu, Haiying ^{[3
]}

Wang, Weiping ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China

[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China

[3] Mashang Consumer Finance Co Ltd, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年

基金：

中国国家自然科学基金;

关键词：

Self-Supervised Learning; Representation Learning; Contrastive; Learning; Dense Representation; Semantics Discovery;

D O I：

10.1145/3474085.3475551

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Self-supervised representation learning for visual pre-training has achieved remarkable success with sample (instance or pixel) discrimination and semantics discovery of instance, whereas there still exists a non-negligible gap between pre-trained model and downstream dense prediction tasks. Concretely, these downstream tasks require more accurate representation, in other words, the pixels from the same object must belong to a shared semantic category, which is lacking in the previous methods. In this work, we present Dense Semantic Contrast (DSC) for modeling semantic category decision boundaries at a dense level to meet the requirement of these tasks. Furthermore, we propose a dense cross-image semantic contrastive learning framework for multi-granularity representation learning. Specially, we explicitly explore the semantic structure of the dataset by mining relations among pixels from different perspectives. For intra-image relation modeling, we discover pixel neighbors from multiple views. And for inter-image relations, we enforce pixel representation from the same semantic class to be more similar than the representation from different classes in one mini-batch. Experimental results show that our DSC model outperforms state-of-the-art methods when transferring to downstream dense prediction tasks, including object detection, semantic segmentation, and instance segmentation. Code will be made available.

引用

页码：1368 / 1376

页数：9

共 50 条

[41] Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning
Song, Kaiyou
Zhang, Shan
Luo, Zimeng
Wang, Tong
Xie, Jin
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16053 - 16062
[42] solo-learn: A Library of Self-supervised Methods for Visual Representation Learning
Turrisi da Costa, Victor G.
Fini, Enrico
Nabi, Moin
Sebe, Nicu
Ricci, Elisa
JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23 : 1 - 6
[43] Self-Supervised Audio-Visual Representation Learning for in-the-wild Videos
Feng, Zishun
Tu, Ming
Xia, Rui
Wang, Yuxuan
Krishnamurthy, Ashok
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5671 - 5672
[44] Feature selection and cascade dimensionality reduction for self-supervised visual representation learning
Qu, Peixin
Jin, Songlin
Tian, Yongqin
Zhou, Ling
Zheng, Ying
Zhang, Weidong
Xu, Yibo
Pan, Xipeng
Zhao, Wenyi
COMPUTERS & ELECTRICAL ENGINEERING, 2023, 106
[45] Self-Supervised Representation Learning using Visual Field Expansion on Digital Pathology
Boyd, Joseph
Liashuha, Mykola
Deutsch, Eric
Paragios, Nikos
Christodoulidis, Stergios
Vakalopoulou, Maria
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 639 - 647
[46] Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Ge, Chongjian
Liang, Youwei
Song, Yibing
Jiao, Jianbo
Wang, Jue
Luo, Ping
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[47] Self-supervised Representation Learning on Document Images
Cosma, Adrian
Ghidoveanu, Mihai
Panaitescu-Liess, Michael
Popescu, Marius
DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 : 103 - 117
[48] Self-Supervised Learning for Specified Latent Representation
Liu, Chicheng
Song, Libin
Zhang, Jiwen
Chen, Ken
Xu, Jing
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (01) : 47 - 59
[49] Self-Supervised Relational Reasoning for Representation Learning
Patacchiola, Massimiliano
Storkey, Amos
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[50] Adaptive Self-Supervised Graph Representation Learning
Gong, Yunchi
36TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2022), 2022, : 254 - 259

← 1 2 3 4 5 →