Delving into Inter-Image Invariance for Unsupervised Visual Representations

被引：20

作者：

Xie, Jiahao ^{[1
]}

Zhan, Xiaohang ^{[2
]}

Liu, Ziwei ^{[1
]}

Ong, Yew-Soon ^{[1
]}

Loy, Chen Change ^{[1
]}

机构：

[1] Nanyang Technol Univ, 50 Nanyang Ave, Singapore, Singapore

[2] Chinese Univ Hong Kong, Sha Tin, Hong Kong, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2022年 / 130卷 / 12期

关键词：

Unsupervised learning; Self-supervised learning; Representation learning; Contrastive learning; Inter-image invariance;

D O I：

10.1007/s11263-022-01681-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Contrastive learning has recently shown immense potential in unsupervised visual representation learning. Existing studies in this track mainly focus on intra-image invariance learning. The learning typically uses rich intra-image transformations to construct positive pairs and then maximizes agreement using a contrastive loss. The merits of inter-image invariance, conversely, remainmuch less explored. Onemajor obstacle to exploit inter-image invariance is that it is unclear how to reliably construct inter-image positive pairs, and further derive effective supervision from them since no pair annotations are available. In this work, we present a comprehensive empirical study to better understand the role of inter-image invariance learning from three main constituting components: pseudo-label maintenance, sampling strategy, and decision boundary design. To facilitate the study, we introduce a unified and generic framework that supports the integration of unsupervised intra- and inter-image invariance learning. Through carefully-designed comparisons and analysis, multiple valuable observations are revealed: 1) online labels converge faster and perform better than offline labels; 2) semi-hard negative samples are more reliable and unbiased than hard negative samples; 3) a less stringent decision boundary is more favorable for inter-image invariance learning. With all the obtained recipes, our final model, namely InterCLR, shows consistent improvements over state-of-the-art intra-image invariance learning methods on multiple standard benchmarks. We hope this work will provide useful experience for devising effective unsupervised inter-image invariance learning. Code: https://github.com/open-mmlab/ mmselfsup.

引用

页码：2994 / 3013

页数：20

共 50 条

[41] Parallel Registration of Multi-modal Medical Image Triples Having Unknown Inter-image Geometry
Papp, Laszlo
Zsoter, Norbert
Szabo, Gergely
Bejan, Csaba
Szimjanovszki, Emil
Zuhayra, Maaz
2009 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-20, 2009, : 5825 - +
[42] Image Cluster Compression using Partitioned Iterated Function Systems and efficient Inter-Image Similarity Features
Kramm, Matthias
SITIS 2007: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGIES & INTERNET BASED SYSTEMS, 2008, : 989 - 996
[43] SIGNAL-PROCESSING PROCEDURE FOR INTER-IMAGE IDENTIFICATION OF PARTICLE TRACKS IN BUBBLE CHAMBERS
LEVINSTONE, D
EDEN, M
NUCLEAR INSTRUMENTS & METHODS, 1978, 152 (2-3): : 357 - 366
[44] Relative Pose Estimation in Binocular Vision for a Planar Scene using Inter-Image Homographies
Ornhag, Marcus Valtonen
Heyden, Anders
PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM 2018), 2018, : 568 - 575
[45] Image representations for visual learning
Poggio, T
AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 143 - 143
[46] Image representations for visual learning
Beymer, D
Poggio, T
SCIENCE, 1996, 272 (5270) : 1905 - 1909
[47] Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations
Van Gansbeke, Wouter
Vandenhende, Simon
Georgoulis, Stamatios
Van Gool, Luc
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[48] Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles
Noroozi, Mehdi
Favaro, Paolo
COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 69 - 84
[49] Unsupervised classification of plethysmography signals with advanced visual representations
Germain, Thibaut
Truong, Charles
Oudre, Laurent
Krejci, Eric
FRONTIERS IN PHYSIOLOGY, 2023, 14
[50] Unsupervised learning of mid-level visual representations
Matteucci, Giulio
Piasini, Eugenio
Zoccolan, Davide
CURRENT OPINION IN NEUROBIOLOGY, 2024, 84

← 1 2 3 4 5 →