MS-DINO: Masked Self-Supervised Distributed Learning Using Vision Transformer

被引:1
|
作者
Park, Sangjoon [1 ,2 ,3 ]
Lee, Ik Jae [4 ]
Kim, Jun Won [4 ]
Ye, Jong Chul [5 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Bio & Brain Engn, Daejeon 34141, South Korea
[2] Yonsei Univ, Coll Med, Dept Radiat Oncol, Seoul 03722, South Korea
[3] Yonsei Univ, Inst Innovat Digital Healthcare, Seoul 03722, South Korea
[4] Gangnam Severance Hosp, Dept Radiat Oncol, Seoul 06273, South Korea
[5] Korea Adv Inst Sci & Technol, Kim Jaechul Grad Sch AI, Daejeon 34141, South Korea
基金
新加坡国家研究基金会;
关键词
Feature extraction; Task analysis; Biomedical imaging; Privacy; Transformers; Servers; Distance learning; Distributed learning; self-supervised learning; random permutation; vision transformer; privacy protection;
D O I
10.1109/JBHI.2024.3423797
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite promising advancements in deep learning in medical domains, challenges still remain owing to data scarcity, compounded by privacy concerns and data ownership disputes. Recent explorations of distributed-learning paradigms, particularly federated learning, have aimed to mitigate these challenges. However, these approaches are often encumbered by substantial communication and computational overhead, and potential vulnerabilities in privacy safeguards. Therefore, we propose a self-supervised masked sampling distillation technique called MS-DINO, tailored to the vision transformer architecture. This approach removes the need for incessant communication and strengthens privacy using a modified encryption mechanism inherent to the vision transformer while minimizing the computational burden on client-side devices. Rigorous evaluations across various tasks confirmed that our method outperforms existing self-supervised distributed learning strategies and fine-tuned baselines.
引用
收藏
页码:6180 / 6192
页数:13
相关论文
共 50 条
  • [21] Self-supervised representation learning using multimodal Transformer for emotion recognition
    Goetz, Theresa
    Arora, Pulkit
    Erick, F. X.
    Holzer, Nina
    Sawant, Shrutika
    PROCEEDINGS OF THE 8TH INTERNATIONAL WORKSHOP ON SENSOR-BASED ACTIVITY RECOGNITION AND ARTIFICIAL INTELLIGENCE, IWOAR 2023, 2023,
  • [22] Learnable Masked Tokens for Improved Transferability of Self-supervised Vision Transformers
    Hu, Hao
    Baldassarre, Federico
    Azizpour, Hossein
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT III, 2023, 13715 : 409 - 426
  • [23] Self-Supervised Pretraining via Multimodality Images With Transformer for Change Detection
    Zhang, Yuxiang
    Zhao, Yang
    Dong, Yanni
    Du, Bo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [24] MaeFE: Masked Autoencoders Family of Electrocardiogram for Self-Supervised Pretraining and Transfer Learning
    Zhang, Huaicheng
    Liu, Wenhan
    Shi, Jiguang
    Chang, Sheng
    Wang, Hao
    He, Jin
    Huang, Qijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [25] HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
    Hsu, Wei-Ning
    Bolte, Benjamin
    Tsai, Yao-Hung Hubert
    Lakhotia, Kushal
    Salakhutdinov, Ruslan
    Mohamed, Abdelrahman
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 3451 - 3460
  • [26] Self-Supervised Representation Learning for Video Quality Assessment
    Jiang, Shaojie
    Sang, Qingbing
    Hu, Zongyao
    Liu, Lixiong
    IEEE TRANSACTIONS ON BROADCASTING, 2023, 69 (01) : 118 - 129
  • [27] Multi-scale vision transformer classification model with self-supervised learning and dilated convolution
    Xing, Liping
    Jin, Hongmei
    Li, Hong-an
    Li, Zhanli
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 103
  • [28] Explainable Vision Transformer with Self-Supervised Learning to Predict Alzheimer's Disease Progression Using 18F-FDG PET
    Khatri, Uttam
    Kwon, Goo-Rak
    BIOENGINEERING-BASEL, 2023, 10 (10):
  • [29] Self-supervised Vision Transformer are Scalable Generative Models for Domain Generalization
    Doerrich, Sebastian
    Di Salvo, Francesco
    Ledig, Christian
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT X, 2024, 15010 : 644 - 654
  • [30] Reduce the Difficulty of Incremental Learning With Self-Supervised Learning
    Guan, Linting
    Wu, Yan
    IEEE ACCESS, 2021, 9 : 128540 - 128549