Delving into Inter-Image Invariance for Unsupervised Visual Representations

被引:20
|
作者
Xie, Jiahao [1 ]
Zhan, Xiaohang [2 ]
Liu, Ziwei [1 ]
Ong, Yew-Soon [1 ]
Loy, Chen Change [1 ]
机构
[1] Nanyang Technol Univ, 50 Nanyang Ave, Singapore, Singapore
[2] Chinese Univ Hong Kong, Sha Tin, Hong Kong, Peoples R China
关键词
Unsupervised learning; Self-supervised learning; Representation learning; Contrastive learning; Inter-image invariance;
D O I
10.1007/s11263-022-01681-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Contrastive learning has recently shown immense potential in unsupervised visual representation learning. Existing studies in this track mainly focus on intra-image invariance learning. The learning typically uses rich intra-image transformations to construct positive pairs and then maximizes agreement using a contrastive loss. The merits of inter-image invariance, conversely, remainmuch less explored. Onemajor obstacle to exploit inter-image invariance is that it is unclear how to reliably construct inter-image positive pairs, and further derive effective supervision from them since no pair annotations are available. In this work, we present a comprehensive empirical study to better understand the role of inter-image invariance learning from three main constituting components: pseudo-label maintenance, sampling strategy, and decision boundary design. To facilitate the study, we introduce a unified and generic framework that supports the integration of unsupervised intra- and inter-image invariance learning. Through carefully-designed comparisons and analysis, multiple valuable observations are revealed: 1) online labels converge faster and perform better than offline labels; 2) semi-hard negative samples are more reliable and unbiased than hard negative samples; 3) a less stringent decision boundary is more favorable for inter-image invariance learning. With all the obtained recipes, our final model, namely InterCLR, shows consistent improvements over state-of-the-art intra-image invariance learning methods on multiple standard benchmarks. We hope this work will provide useful experience for devising effective unsupervised inter-image invariance learning. Code: https://github.com/open-mmlab/ mmselfsup.
引用
收藏
页码:2994 / 3013
页数:20
相关论文
共 50 条
  • [41] Parallel Registration of Multi-modal Medical Image Triples Having Unknown Inter-image Geometry
    Papp, Laszlo
    Zsoter, Norbert
    Szabo, Gergely
    Bejan, Csaba
    Szimjanovszki, Emil
    Zuhayra, Maaz
    2009 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-20, 2009, : 5825 - +
  • [42] Image Cluster Compression using Partitioned Iterated Function Systems and efficient Inter-Image Similarity Features
    Kramm, Matthias
    SITIS 2007: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGIES & INTERNET BASED SYSTEMS, 2008, : 989 - 996
  • [43] SIGNAL-PROCESSING PROCEDURE FOR INTER-IMAGE IDENTIFICATION OF PARTICLE TRACKS IN BUBBLE CHAMBERS
    LEVINSTONE, D
    EDEN, M
    NUCLEAR INSTRUMENTS & METHODS, 1978, 152 (2-3): : 357 - 366
  • [44] Relative Pose Estimation in Binocular Vision for a Planar Scene using Inter-Image Homographies
    Ornhag, Marcus Valtonen
    Heyden, Anders
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM 2018), 2018, : 568 - 575
  • [45] Image representations for visual learning
    Poggio, T
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 143 - 143
  • [46] Image representations for visual learning
    Beymer, D
    Poggio, T
    SCIENCE, 1996, 272 (5270) : 1905 - 1909
  • [47] Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations
    Van Gansbeke, Wouter
    Vandenhende, Simon
    Georgoulis, Stamatios
    Van Gool, Luc
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [48] Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles
    Noroozi, Mehdi
    Favaro, Paolo
    COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 69 - 84
  • [49] Unsupervised classification of plethysmography signals with advanced visual representations
    Germain, Thibaut
    Truong, Charles
    Oudre, Laurent
    Krejci, Eric
    FRONTIERS IN PHYSIOLOGY, 2023, 14
  • [50] Unsupervised learning of mid-level visual representations
    Matteucci, Giulio
    Piasini, Eugenio
    Zoccolan, Davide
    CURRENT OPINION IN NEUROBIOLOGY, 2024, 84