Multi-center federated learning: clients clustering for better personalization

被引:136
作者
Long, Guodong [1 ]
Xie, Ming [1 ]
Shen, Tao [1 ]
Zhou, Tianyi [2 ,3 ]
Wang, Xianzhi [1 ]
Jiang, Jing [1 ]
机构
[1] Univ Technol Sydney, Australian AI Inst, Sydney, NSW, Australia
[2] Univ Washington, Seattle, WA 98195 USA
[3] Univ Maryland, College Pk, MD 20742 USA
来源
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2023年 / 26卷 / 01期
关键词
Federated learning; Personalized modeling; Clustering; DECISION-MAKING;
D O I
10.1007/s11280-022-01046-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Personalized decision-making can be implemented in a Federated learning (FL) framework that can collaboratively train a decision model by extracting knowledge across intelligent clients, e.g. smartphones or enterprises. FL can mitigate the data privacy risk of collaborative training since it merely collects local gradients from users without access to their data. However, FL is fragile in the presence of statistical heterogeneity that is commonly encountered in personalized decision making, e.g., non-IID data over different clients. Existing FL approaches usually update a single global model to capture the shared knowledge of all users by aggregating their gradients, regardless of the discrepancy between their data distributions. By comparison, a mixture of multiple global models could capture the heterogeneity across various clients if assigning the client to different global models (i.e., centers) in FL. To this end, we propose a novel multi-center aggregation mechanism to cluster clients using their models' parameters. It learns multiple global models from data as the cluster centers, and simultaneously derives the optimal matching between users and centers. We then formulate it as an optimization problem that can be efficiently solved by a stochastic expectation maximization (EM) algorithm. Experiments on multiple benchmark datasets of FL show that our method outperforms several popular baseline methods. The experimental source codes are publicly available on the Github repository (GitHub repository: https://github.com/mingxuts/multi-center-fed-learning).
引用
收藏
页码:481 / 500
页数:20
相关论文
共 65 条
[1]  
Arivazhagan Manoj Ghuhan, 2019, CoRR abs/1912.00818
[2]  
Bishop Christopher M., 2006, Pattern recognition and machine learning, DOI [10.1007/978-0-387-45528-0, DOI 10.1007/978-0-387-45528-0]
[3]  
Bonawitz K., 2019, P MACH LEARN SYST, DOI 10.48550/arXiv.1902.01046
[4]   Target-Aware Holistic Influence Maximization in Spatial Social Networks [J].
Cai, Taotao ;
Li, Jianxin ;
Mian, Ajmal S. ;
li, Ronghua ;
Sellis, Timos ;
Yu, Jeffrey Xu .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (04) :1993-2007
[5]  
Caldas S., 2018, LEAF BENCHMARK FEDER
[6]  
Cao T., 2020, ARXIV200109782, P1
[7]   On-line expectation-maximization algorithm for latent data models [J].
Cappe, Olivier ;
Moulines, Eric .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 :593-613
[8]  
Chandra, 2018, arXiv preprint arXiv:1806.00582
[9]  
Chen F., 2022, ARXIV220300829
[10]  
Cohen G, 2017, IEEE IJCNN, P2921, DOI 10.1109/IJCNN.2017.7966217