A Deep Cut Into Split Federated Self-Supervised Learning

被引：0

作者：

Przewiezlikowski, Marcin ^{[1
,2
,3
]}

Osial, Marcin ^{[1
,2
,3
]}

Zielinski, Bartosz ^{[1
,3
]}

Smieja, Marek ^{[1
]}

机构：

[1] Jagiellonian Univ, Fac Math & Comp Sci, Krakow, Poland

[2] Jagiellonian Univ, Doctoral Sch Exact & Nat Sci, Krakow, Poland

[3] IDEAS NCBR, Warsaw, Poland

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT II, ECML PKDD 2024 | 2024年 / 14942卷

关键词：

Federated learning; Self-supervised learning; Contrastive learning;

D O I：

10.1007/978-3-031-70344-7_26

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Collaborative self-supervised learning has recently become feasible in highly distributed environments by dividing the network layers between client devices and a central server. However, state-of-the-art methods, such as MocoSFL, are optimized for network division at the initial layers, which decreases the protection of the client data and increases communication overhead. In this paper, we demonstrate that splitting depth is crucial for maintaining privacy and communication efficiency in distributed training. We also show that MocoSFL suffers from a catastrophic quality deterioration for the minimal communication overhead. As a remedy, we introduce Momentum-Aligned contrastive Split Federated Learning (MonAcoSFL), which aligns online and momentum client models during training procedure. Consequently, we achieve state-of-the-art accuracy while significantly reducing the communication overhead, making MonAcoSFL more practical in real-world scenarios. Our codebase is available at https://github.com/gmum/MonAcoSFL.

引用

页码：444 / 459

页数：16

共 38 条

[1] Survey on Self-Supervised Learning: Auxiliary Pretext Tasks and Contrastive Learning Methods in Imaging [J].

Albelwi, Saleh .

ENTROPY, 2022, 24 (04)

[2] Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture [J].

Assran, Mahmoud ;

Duval, Quentin ;

Misra, Ishan ;

Bojanowski, Piotr ;

Vincent, Pascal ;

Rabbat, Michael ;

Lecun, Yann ;

Ballas, Nicolas .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :15619-15629

[3]

Bachman P, 2019, ADV NEUR IN, V32

[4]

Balestriero R, 2023, Arxiv, DOI [arXiv:2304.12210, DOI 10.48550/ARXIV.2304.12210]

[5]

Bordes F., 2023, Trans. Mach. Learn. Res.

[6] IMPROVING MEMORY BANKS FOR UNSUPERVISED LEARNING WITH LARGE MINI-BATCH, CONSISTENCY AND HARD NEGATIVE MINING [J].

Bulat, Adrian ;

Sanchez-Lozano, Enrique ;

Tzimiropoulos, Georgios .

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :1695-1699

[7]

Caron M, 2020, ADV NEUR IN, V33

[8] Emerging Properties in Self-Supervised Vision Transformers [J].

Caron, Mathilde ;

Touvron, Hugo ;

Misra, Ishan ;

Jegou, Herve ;

Mairal, Julien ;

Bojanowski, Piotr ;

Joulin, Armand .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9630-9640

[9]

Chen CC, 2022, PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, P1959

[10]

Chen T, 2020, PR MACH LEARN RES, V119

← 1 2 3 4 →