FedNP: Towards Non-IID Federated Learning via Federated Neural Propagation

被引:0
作者
Wu, Xueyang [1 ]
Huang, Hengguan [2 ]
Ding, Youlong [3 ]
Wang, Hao [4 ]
Wang, Ye [2 ]
Xu, Qian [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
[3] Shenzhen Univ, Shenzhen, Peoples R China
[4] Rutgers State Univ, Piscataway, NJ USA
来源
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9 | 2023年
关键词
EXPECTATION PROPAGATION; FRAMEWORK;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional federated learning (FL) algorithms, such as FedAvg, fail to handle non-i.i.d data because they learn a global model by simply averaging biased local models that are trained on non-i.i.d data, therefore failing to model the global data distribution. In this paper, we present a novel Bayesian FL algorithm that successfully handles such a non-i.i.d FL setting by enhancing the local training task with an auxiliary task that explicitly estimates the global data distribution. One key challenge in estimating the global data distribution is that the data are partitioned in FL, and therefore the groundtruth global data distribution is inaccessible. To address this challenge, we propose an expectation-propagation-inspired probabilistic neural network, dubbed federated neural propagation (FedNP), which efficiently estimates the global data distribution given non-i.i.d data partitions. Our algorithm is sampling-free and end-to-end differentiable, can be applied with any conventional FL frameworks and learns richer global data representation. Experiments on both image classification tasks with synthetic non-i.i.d image data partitions and real-world non-i.i.d speech recognition tasks demonstrate that our framework effectively alleviates the performance deterioration caused by non-i.i.d data.
引用
收藏
页码:10399 / 10407
页数:9
相关论文
共 38 条
[1]  
Acar DAE, 2021, Arxiv, DOI arXiv:2111.04263
[2]  
Al-Shedivat M, 2021, Arxiv, DOI arXiv:2010.05273
[3]  
Amari S.-i., 2000, Methods of Information Geometry
[4]  
Barker J, 2018, Arxiv, DOI arXiv:1803.10609
[5]   Expectation Propagation for Likelihood-Free Inference [J].
Barthelme, Simon ;
Chopin, Nicolas .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (505) :315-333
[6]  
McMahan HB, 2018, Arxiv, DOI arXiv:1710.06963
[7]  
Chen H.-Y., 2020, arXiv
[8]   FedDC: Federated Learning with Non-IID Data via Local Drift Decoupling and Correction [J].
Gao, Liang ;
Fu, Huazhu ;
Li, Li ;
Chen, Yingwen ;
Xu, Ming ;
Xu, Cheng-Zhong .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :10102-10111
[9]  
Heess N, 2013, Advances in Neural Information Processing Systems, P3219
[10]  
Huang H., 2020, PR MACH LEARN RES, P4531