FedDiff: Diffusion Model Driven Federated Learning for Multi-Modal and Multi-Clients

被引:4
作者
Li, Daixun [1 ]
Xie, Weiying [1 ]
Wang, Zixuan [1 ]
Lu, Yibing [1 ]
Li, Yunsong [1 ]
Fang, Leyuan [2 ]
机构
[1] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[2] Hunan Univ, Coll Elect & Informat Engn, Changsha 410082, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; Federated learning; Feature extraction; Transformers; Satellites; Data models; Laser radar; Deep learning; multi-modality; diffusion model; remote sensing; feature fusion; federated learning;
D O I
10.1109/TCSVT.2024.3407131
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the rapid development of imaging sensor technology in the field of remote sensing, multi-modal remote sensing data fusion has emerged as a crucial research direction for land cover classification tasks. While diffusion models have made great progress in generative models and image classification tasks, existing models primarily focus on single-modality and single-client control, that is, the diffusion process is driven by a single modal in a single computing node. To facilitate the secure fusion of heterogeneous data from clients, it is necessary to enable distributed multi-modal control, such as merging the hyperspectral data of organization A and the LiDAR data of organization B privately on each base station client. In this study, we propose a multi-modal collaborative diffusion federated learning framework called FedDiff. Our framework establishes a dual-branch diffusion model feature extraction setup, where the two modal data are inputted into separate branches of the encoder. Our key insight is that diffusion models driven by different modalities are inherently complementary in terms of potential denoising steps on which bilateral connections can be built. Considering the challenge of private and efficient communication between multiple clients, we embed the diffusion model into the federated learning communication structure, and introduce a lightweight communication module. Qualitative and quantitative experiments validate the superiority of our framework in terms of image quality and conditional consistency. To the best of our knowledge, this is the first instance of deploying a diffusion model into a federated learning framework, achieving optimal both privacy protection and performance for heterogeneous data. Our FedDiff surpasses existing methods in terms of performance on three multi-modal datasets, achieving a classification average accuracy of 96.77% while reducing the communication cost.
引用
收藏
页码:10353 / 10367
页数:15
相关论文
共 53 条
[1]   Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 140 :20-32
[2]  
Bandara WGC, 2024, Arxiv, DOI arXiv:2206.11892
[3]  
Baranchuk D., 2021, arXiv
[4]   Using Meta Labels for the Training of Weighting Models in a Sample-Specific Late Fusion Classification Architecture [J].
Bellmann, Peter ;
Thiam, Patrick ;
Schwenker, Friedhelm .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :2604-2611
[5]  
Bonawitz K, 2019, ARXIV190201046
[6]   Towards Optimal Multi-Modal Federated Learning on Non-IID Data with Hierarchical Gradient Blending [J].
Chen, Sijia ;
Li, Baochun .
IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2022), 2022, :1469-1478
[7]   FL-Enhance: A federated learning framework for balancing non-IID data with augmented and shared compressed samples [J].
Chiaro, Diletta ;
Prezioso, Edoardo ;
Ianni, Michele ;
Giampaolo, Fabio .
INFORMATION FUSION, 2023, 98
[8]  
Cho KYHY, 2014, Arxiv, DOI arXiv:1409.1259
[9]  
Dai W, 2015, AAAI CONF ARTIF INTE, P79
[10]   Global-Local Transformer Network for HSI and LiDAR Data Joint Classification [J].
Ding, Kexing ;
Lu, Ting ;
Fu, Wei ;
Li, Shutao ;
Ma, Fuyan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60