Dissecting Supervised Constrastive Learning

被引：0

作者：

Graf, Florian ^{[1
]}

Hofer, Christoph D. ^{[1
]}

Niethammer, Marc ^{[2
]}

Kwitt, Roland ^{[1
]}

机构：

[1] Univ Salzburg, Dept Comp Sci, Salzburg, Austria

[2] Univ N Carolina, Chapel Hill, NC USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139 | 2021年 / 139卷

基金：

奥地利科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Minimizing cross-entropy over the softmax scores of a linear map composed with a high-capacity encoder is arguably the most popular choice for training neural networks on supervised learning tasks. However, recent works show that one can directly optimize the encoder instead, to obtain equally (or even more) discriminative representations via a supervised variant of a contrastive objective. In this work, we address the question whether there are fundamental differences in the sought-for representation geometry in the output space of the encoder at minimal loss. Specifically, we prove, under mild assumptions, that both losses attain their minimum once the representations of each class collapse to the vertices of a regular simplex, inscribed in a hypersphere. We provide empirical evidence that this configuration is attained in practice and that reaching a close-to-optimal state typically indicates good generalization performance. Yet, the two losses show remarkably different optimization behavior. The number of iterations required to perfectly fit to data scales superlinearly with the amount of randomly flipped labels for the supervised contrastive loss. This is in contrast to the approximately linear scaling previously reported for networks trained with cross-entropy.

引用

页数：10

共 50 条

[1] Dissecting the Roles of Supervised and Unsupervised Learning in Perceptual Discrimination Judgments
Loewenstein, Yonatan
Raviv, Ofri
Ahissar, Merav
JOURNAL OF NEUROSCIENCE, 2021, 41 (04): : 757 - 765
[2] Dissecting self-supervised learning methods for surgical computer vision
Ramesh, Sanat
Srivastav, Vinkle
Alapatt, Deepak
Yu, Tong
Murali, Aditya
Sestini, Luca
Nwoye, Chinedu Innocent
Hamoud, Idris
Sharma, Saurav
Fleurentin, Antoine
Exarchakis, Georgios
Karargyris, Alexandros
Padoy, Nicolas
MEDICAL IMAGE ANALYSIS, 2023, 88
[3] Federated Constrastive Learning and Visual Transformers for Personal Recommendation
Belhadi, Asma
Djenouri, Youcef
Andrade, Fabio Augusto de Alcantara
Srivastava, Gautam
COGNITIVE COMPUTATION, 2024, 16 (05) : 2551 - 2565
[4] A constrastive semi-supervised deep learning framework for land cover classification of satellite time series with limited labels
Ienco, Dino
Gaetano, Raffaele
Interdonato, Roberto
NEUROCOMPUTING, 2024, 567
[5] Social Recommendation Based on Multi-Auxiliary Information Constrastive Learning
Jiang, Feng
Cao, Yang
Wu, Huan
Wang, Xibin
Song, Yuqi
Gao, Min
MATHEMATICS, 2022, 10 (21)
[6] Dissecting learning
Katherine Whalley
Nature Reviews Neuroscience, 2008, 9 : 161 - 161
[7] Miscell: An efficient self-supervised learning approach for dissecting single-cell transcriptome
Shen, Hongru
Li, Yang
Feng, Mengyao
Shen, Xilin
Wu, Dan
Zhang, Chao
Yang, Yichen
Yang, Meng
Hu, Jiani
Liu, Jilei
Wang, Wei
Zhang, Qiang
Song, Fangfang
Yang, Jilong
Chen, Kexin
Li, Xiangchun
ISCIENCE, 2021, 24 (11)
[8] Self-supervised image clustering from multiple incomplete views via constrastive complementary generation
Wang, Jiatai
Xu, Zhiwei
Yang, Xuewen
Guo, Dongjin
Liu, Limin
IET COMPUTER VISION, 2023, 17 (02) : 189 - 202
[9] Supervised learning
Valkenborg, Dirk
Geubbelmans, Melvin
Rousseau, Axel-Jan
Burzykowski, Tomasz
AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2023, 164 (01) : 146 - 149
[10] Gated Self-supervised Learning for Improving Supervised Learning
Fuadi, Erland Hillman
Ruslim, Aristo Renaldo
Wardhana, Putu Wahyu Kusuma
Yudistira, Novanto
2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 611 - 615

← 1 2 3 4 5 →