FedEL: Federated ensemble learning for non-iid data

被引:6
作者
Wu, Xing [1 ,2 ,3 ]
Pei, Jie [1 ]
Han, Xian-Hua [4 ]
Chen, Yen-Wei [5 ]
Yao, Junfeng [6 ]
Liu, Yang [1 ]
Qian, Quan [1 ,2 ,3 ]
Guo, Yike [7 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Zhejiang Lab, Hangzhou 311100, Zhejiang, Peoples R China
[3] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai 200444, Peoples R China
[4] Rikkyo Univ, Grad Sch Artificial Intelligence & Sci, Tokyo 1718501, Japan
[5] Ritsumeikan Univ, Grad Sch Informat Sci & Engn, Kusatsu 5250058, Japan
[6] Cssc Seago Syst Technol Co Ltd, Shanghai 200010, Peoples R China
[7] Hong Kong Univ Sci & Technol, Hong Kong 999077, Peoples R China
基金
中国国家自然科学基金;
关键词
Federated learning; Statistical heterogeneity; Ensemble learning;
D O I
10.1016/j.eswa.2023.121390
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning (FL) is a joint training pattern that fully utilizes data information whereas protecting data privacy. A key challenge in FL is statistical heterogeneity, which arises on account of the heterogeneity of local data distributions among clients, leading to inconsistency in local optimization goals and ultimately reducing the performance of globally aggregated models. We propose the Federated Ensemble Learning (FedEL), which makes full use of the heterogeneity of data distribution among clients to train a group of weak learners with diversity to construct a global model, which is a novel solution to the non-independent identical distribution (non-IID) problem. Experiments demonstrate that the proposed FedEL can improve performance in non-IID data scenarios. Even under extreme statistical heterogeneity, the average accuracy of FedEL is 3.54% higher than the state-of-the-art FL method. Moreover, the proposed FedEL reduces model storage and reasoning costs compared with traditional ensemble learning. The proposed FedEL demonstrates good generalization ability in experiments across different datasets, including natural scene image datasets and medical image datasets.
引用
收藏
页数:9
相关论文
共 32 条
[1]  
[Anonymous], Technical report
[2]  
Benbrahim H., 2021, 2021 INT C ART INT C, P1
[3]   Federated learning for violence incident prediction in a simulated cross-institutional psychiatric setting [J].
Borger, Thomas ;
Mosteiro, Pablo ;
Kaya, Heysem ;
Rijcken, Emil ;
Salah, Albert Ali ;
Scheepers, Floortje ;
Spruit, Marco .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 199
[4]   HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy [J].
Borgli, Hanna ;
Thambawita, Vajira ;
Smedsrud, Pia H. ;
Hicks, Steven ;
Jha, Debesh ;
Eskeland, Sigrun L. ;
Randel, Kristin Ranheim ;
Pogorelov, Konstantin ;
Lux, Mathias ;
Nguyen, Duc Tien Dang ;
Johansen, Dag ;
Griwodz, Carsten ;
Stensland, Hakon K. ;
Garcia-Ceja, Enrique ;
Schmidt, Peter T. ;
Hammer, Hugo L. ;
Riegler, Michael A. ;
Halvorsen, Pal ;
de Lange, Thomas .
SCIENTIFIC DATA, 2020, 7 (01)
[5]   Federated learning for privacy-preserving AI [J].
Cheng, Yong ;
Liu, Yang ;
Chen, Tianjian ;
Yang, Qiang .
COMMUNICATIONS OF THE ACM, 2020, 63 (12) :33-36
[6]  
Durmus Alp Emre, 2021, INT C LEARN REPR
[7]   Asynchronous federated learning system for human-robot touch interaction [J].
Gamboa-Montero, Juan Jose ;
Alonso-Martin, Fernando ;
Marques-Villarroya, Sara ;
Sequeira, Joao ;
Salichs, Miguel A. .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 211
[8]   FedDC: Federated Learning with Non-IID Data via Local Drift Decoupling and Correction [J].
Gao, Liang ;
Fu, Huazhu ;
Li, Li ;
Chen, Yingwen ;
Xu, Ming ;
Xu, Cheng-Zhong .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :10102-10111
[9]   Privacy-enhanced multi-party deep learning [J].
Gong, Maoguo ;
Feng, Jialun ;
Xie, Yu .
NEURAL NETWORKS, 2020, 121 :484-496
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778