FedATA: Adaptive attention aggregation for federated self-supervised medical image segmentation

被引:3
作者
Dai, Jian [1 ]
Wu, Hao [3 ]
Liu, Huan [2 ]
Yu, Liheng [1 ]
Hu, Xing [4 ]
Liu, Xiao [2 ]
Geng, Daoying [1 ,2 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
[2] Fudan Univ, Huashan Hosp, Dept Radiol, Shanghai 200040, Peoples R China
[3] Fudan Univ, Huashan Hosp, Dept Dermatol, Shanghai 200040, Peoples R China
[4] Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai 20093, Peoples R China
关键词
Self-supervised learning; Masked image modeling; Federated learning;
D O I
10.1016/j.neucom.2024.128691
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pre-trained on large-scale datasets has profoundly promoted the development of deep learning models in medical image analysis. For medical image segmentation, collecting a large number of labeled volumetric medical images from multiple institutions is an enormous challenge due to privacy concerns. Self-supervised learning with mask image modeling (MIM) can learn general representation without annotations. Integrating MIM into FL enables collaborative learning of an efficient pre-trained model from unlabeled data, followed by fine-tuning with limited annotations. However, setting pixels as reconstruction targets in traditional MIM fails to facilitate robust representation learning due to the medical image's complexity and distinct characteristics. On the other hand, the generalization of the aggregated model in FL is also impaired under the heterogeneous data distributions among institutions. To address these issues, we proposed a novel self-supervised federated learning, which combines masked self-distillation with adaptive attention federated learning. Such incorporation enjoys two vital benefits. First, masked self-distillation sets high-quality latent representations of masked tokens as the target, improving the descriptive capability of the learned presentation rather than reconstructing low-level pixels. Second, adaptive attention aggregation with Personalized federate learning effectively captures specific-related representation from the aggregated model, thus facilitating local fine-tuning performance for target tasks. We conducted comprehensive experiments on two medical segmentation tasks using a large-scale dataset consisting of volumetric medical images from multiple institutions, demonstrating superior performance compared to existing federated self-supervised learning approaches.
引用
收藏
页数:13
相关论文
共 37 条
[1]   Big Self-Supervised Models Advance Medical Image Classification [J].
Azizi, Shekoofeh ;
Mustafa, Basil ;
Ryan, Fiona ;
Beaver, Zachary ;
Freyberg, Jan ;
Deaton, Jonathan ;
Loh, Aaron ;
Karthikesalingam, Alan ;
Kornblith, Simon ;
Chen, Ting ;
Natarajan, Vivek ;
Norouzi, Mohammad .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3458-3468
[2]  
Bao H., 2021, arXiv
[3]  
Beutel D.J., 2007, arXiv
[4]  
Chaitanya K., 2020, P ADV NEUR INF PROC, P12546
[5]  
Chen T, 2020, PR MACH LEARN RES, V119
[6]   Masked Image Modeling Advances 3D Medical Image Analysis [J].
Chen, Zekai ;
Agarwal, Devansh ;
Aggarwal, Kshitij ;
Safta, Wiem ;
Balan, Mariann Micsinai ;
Brown, Kevin .
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, :1969-1979
[7]   Multimodal Transformer for Accelerated MR Imaging [J].
Feng, Chun-Mei ;
Yan, Yunlu ;
Chen, Geng ;
Xu, Yong ;
Hu, Ying ;
Shao, Ling ;
Fu, Huazhu .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (10) :2804-2816
[8]  
Grill J., 2020, ADV NEURAL INFORM PR, V33, P21271
[9]   Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images [J].
Hatamizadeh, Ali ;
Nath, Vishwesh ;
Tang, Yucheng ;
Yang, Dong ;
Roth, Holger R. ;
Xu, Daguang .
BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2021, PT I, 2022, 12962 :272-284
[10]   UNETR: Transformers for 3D Medical Image Segmentation [J].
Hatamizadeh, Ali ;
Tang, Yucheng ;
Nath, Vishwesh ;
Yang, Dong ;
Myronenko, Andriy ;
Landman, Bennett ;
Roth, Holger R. ;
Xu, Daguang .
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :1748-1758