Improving Global Generalization and Local Personalization for Federated Learning

被引:0
作者
Meng, Lei [1 ,2 ]
Qi, Zhuang [1 ]
Wu, Lei [1 ]
Du, Xiaoyu [3 ]
Li, Zhaochuan [4 ]
Cui, Lizhen [1 ]
Meng, Xiangxu [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
[2] Shandong Res Inst Ind Technol, Jinan 250098, Peoples R China
[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[4] Inspur, Jinan 250101, Peoples R China
基金
中国国家自然科学基金;
关键词
Data models; Adaptation models; Optimization; Servers; Federated learning; Collaboration; Prototypes; Data heterogeneity; federated learning (FL); generalization; personalization; prototypical calibration; HEALTH;
D O I
10.1109/TNNLS.2024.3417452
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning aims to facilitate collaborative training among multiple clients with data heterogeneity in a privacy-preserving manner, which either generates the generalized model or develops personalized models. However, existing methods typically struggle to balance both directions, as optimizing one often leads to failure in another. To address the problem, this article presents a method named personalized federated learning via cross silo prototypical calibration (pFedCSPC) to enhance the consistency of knowledge of clients by calibrating features from heterogeneous spaces, which contributes to enhancing the collaboration effectiveness between clients. Specifically, pFedCSPC employs an adaptive aggregation method to offer personalized initial models to each client, enabling rapid adaptation to personalized tasks. Subsequently, pFedCSPC learns class representation patterns on clients by clustering, averages the representations within each cluster to form local prototypes, and aggregates them on the server to generate global prototypes. Meanwhile, pFedCSPC leverages global prototypes as knowledge to guide the learning of local representation, which is beneficial for mitigating the data imbalanced problem and preventing overfitting. Moreover, pFedCSPC has designed a cross-silo prototypical calibration (CSPC) module, which utilizes contrastive learning techniques to map heterogeneous features from different sources into a unified space. This can enhance the generalization ability of the global model. Experiments were conducted on four datasets in terms of performance comparison, ablation study, in-depth analysis, and case study, and the results verified that pFedCSPC achieves improvements in both global generalization and local personalization performance via calibrating cross-source features and strengthening collaboration effectiveness, respectively.
引用
收藏
页数:12
相关论文
共 81 条
  • [41] Lyu L., 2020, Federated Learning: Privacy and Incentive, P189
  • [42] Lyu L, 2020, Arxiv, DOI [arXiv:2003.02133, DOI 10.48550/ARXIV.2003.02133]
  • [43] Triple Sequence Learning for Cross-domain Recommendation
    Ma, Haokai
    Xie, Ruobing
    Meng, Lei
    Chen, Xin
    Zhang, Xu
    Lin, Leyu
    Zhou, Jie
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (04)
  • [44] Mansour Y, 2020, Arxiv, DOI arXiv:2002.10619
  • [45] McMahan HB, 2017, PR MACH LEARN RES, V54, P1273
  • [46] Learning Using Privileged Information for Food Recognition
    Meng, Lei
    Chen, Long
    Yang, Xun
    Tao, Dacheng
    Zhang, Hanwang
    Miao, Chunyan
    Chua, Tat-Seng
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 557 - 565
  • [47] Salience-aware adaptive resonance theory for large-scale sparse data clustering
    Meng, Lei
    Tan, Ah-Hwee
    Miao, Chunyan
    [J]. NEURAL NETWORKS, 2019, 120 : 143 - 157
  • [48] Semi-Supervised Heterogeneous Fusion for Multimedia Data Co-Clustering
    Meng, Lei
    Tan, Ah-Hwee
    Xu, Dong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (09) : 2293 - 2306
  • [49] FedProc: Prototypical contrastive federated learning on non-IID data
    Mu, Xutong
    Shen, Yulong
    Cheng, Ke
    Geng, Xueli
    Fu, Jiaxuan
    Zhang, Tao
    Zhang, Zhiwei
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 143 : 93 - 104
  • [50] Liang PP, 2020, Arxiv, DOI arXiv:2001.01523