Dissecting Supervised Constrastive Learning

被引:0
|
作者
Graf, Florian [1 ]
Hofer, Christoph D. [1 ]
Niethammer, Marc [2 ]
Kwitt, Roland [1 ]
机构
[1] Univ Salzburg, Dept Comp Sci, Salzburg, Austria
[2] Univ N Carolina, Chapel Hill, NC USA
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139 | 2021年 / 139卷
基金
奥地利科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Minimizing cross-entropy over the softmax scores of a linear map composed with a high-capacity encoder is arguably the most popular choice for training neural networks on supervised learning tasks. However, recent works show that one can directly optimize the encoder instead, to obtain equally (or even more) discriminative representations via a supervised variant of a contrastive objective. In this work, we address the question whether there are fundamental differences in the sought-for representation geometry in the output space of the encoder at minimal loss. Specifically, we prove, under mild assumptions, that both losses attain their minimum once the representations of each class collapse to the vertices of a regular simplex, inscribed in a hypersphere. We provide empirical evidence that this configuration is attained in practice and that reaching a close-to-optimal state typically indicates good generalization performance. Yet, the two losses show remarkably different optimization behavior. The number of iterations required to perfectly fit to data scales superlinearly with the amount of randomly flipped labels for the supervised contrastive loss. This is in contrast to the approximately linear scaling previously reported for networks trained with cross-entropy.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Dissecting the Roles of Supervised and Unsupervised Learning in Perceptual Discrimination Judgments
    Loewenstein, Yonatan
    Raviv, Ofri
    Ahissar, Merav
    JOURNAL OF NEUROSCIENCE, 2021, 41 (04): : 757 - 765
  • [2] Dissecting self-supervised learning methods for surgical computer vision
    Ramesh, Sanat
    Srivastav, Vinkle
    Alapatt, Deepak
    Yu, Tong
    Murali, Aditya
    Sestini, Luca
    Nwoye, Chinedu Innocent
    Hamoud, Idris
    Sharma, Saurav
    Fleurentin, Antoine
    Exarchakis, Georgios
    Karargyris, Alexandros
    Padoy, Nicolas
    MEDICAL IMAGE ANALYSIS, 2023, 88
  • [3] Federated Constrastive Learning and Visual Transformers for Personal Recommendation
    Belhadi, Asma
    Djenouri, Youcef
    Andrade, Fabio Augusto de Alcantara
    Srivastava, Gautam
    COGNITIVE COMPUTATION, 2024, 16 (05) : 2551 - 2565
  • [4] A constrastive semi-supervised deep learning framework for land cover classification of satellite time series with limited labels
    Ienco, Dino
    Gaetano, Raffaele
    Interdonato, Roberto
    NEUROCOMPUTING, 2024, 567
  • [5] Social Recommendation Based on Multi-Auxiliary Information Constrastive Learning
    Jiang, Feng
    Cao, Yang
    Wu, Huan
    Wang, Xibin
    Song, Yuqi
    Gao, Min
    MATHEMATICS, 2022, 10 (21)
  • [6] Dissecting learning
    Katherine Whalley
    Nature Reviews Neuroscience, 2008, 9 : 161 - 161
  • [7] Miscell: An efficient self-supervised learning approach for dissecting single-cell transcriptome
    Shen, Hongru
    Li, Yang
    Feng, Mengyao
    Shen, Xilin
    Wu, Dan
    Zhang, Chao
    Yang, Yichen
    Yang, Meng
    Hu, Jiani
    Liu, Jilei
    Wang, Wei
    Zhang, Qiang
    Song, Fangfang
    Yang, Jilong
    Chen, Kexin
    Li, Xiangchun
    ISCIENCE, 2021, 24 (11)
  • [8] Self-supervised image clustering from multiple incomplete views via constrastive complementary generation
    Wang, Jiatai
    Xu, Zhiwei
    Yang, Xuewen
    Guo, Dongjin
    Liu, Limin
    IET COMPUTER VISION, 2023, 17 (02) : 189 - 202
  • [9] Supervised learning
    Valkenborg, Dirk
    Geubbelmans, Melvin
    Rousseau, Axel-Jan
    Burzykowski, Tomasz
    AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2023, 164 (01) : 146 - 149
  • [10] Gated Self-supervised Learning for Improving Supervised Learning
    Fuadi, Erland Hillman
    Ruslim, Aristo Renaldo
    Wardhana, Putu Wahyu Kusuma
    Yudistira, Novanto
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 611 - 615