A Communication-Efficient Hierarchical Federated Learning Framework via Shaping Data Distribution at Edge

被引:17
作者
Deng, Yongheng [1 ]
Lyu, Feng [2 ]
Xia, Tengxi [1 ]
Zhou, Yuezhi [3 ]
Zhang, Yaoxue [1 ,3 ]
Ren, Ju [1 ,3 ]
Yang, Yuanyuan [4 ]
机构
[1] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol BNRist, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
[3] Zhongguancun Lab, Beijing 100084, Peoples R China
[4] SUNY Stony Brook, Dept Elect & Comp Engn, Stony Brook, NY 11794 USA
关键词
Costs; Data models; Servers; Computational modeling; Training data; Federated learning; Distributed databases; Hierarchical federated learning; communication efficiency; edge computing; distributed edge intelligence; RESOURCE-ALLOCATION;
D O I
10.1109/TNET.2024.3363916
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Federated learning (FL) enables collaborative model training over distributed computing nodes without sharing their privacy-sensitive raw data. However, in FL, iterative exchanges of model updates between distributed nodes and the cloud server can result in significant communication cost, especially when the data distributions at distributed nodes are imbalanced with requiring more rounds of iterations. In this paper, with our in-depth empirical studies, we disclose that extensive cloud aggregations can be avoided without compromising the learning accuracy if frequent aggregations can be enabled at edge network. To this end, we shed light on the hierarchical federated learning (HFL) framework, where a subset of distributed nodes can play as edge aggregators to support edge aggregations. Under the HFL framework, we formulate a communication cost minimization (CCM) problem to minimize the total communication cost required for model learning with a target accuracy by making decisions on edge aggragator selection and node-edge associations. Inspired by our data-driven insights that the potential of HFL lies in the data distribution at edge aggregators, we propose ShapeFL, i.e., SHaping dAta distRibution at Edge, to transform and solve the CCM problem. In ShapeFL, we divide the original problem into two sub-problems to minimize the per-round communication cost and maximize the data distribution diversity of edge aggregator data, respectively, and devise two light-weight algorithms to solve them accordingly. Extensive experiments are carried out based on several opened datasets and real-world network topologies, and the results demonstrate the efficacy of ShapeFL in terms of both learning accuracy and communication efficiency.
引用
收藏
页码:2600 / 2615
页数:16
相关论文
共 62 条
[51]   Adaptive Federated Learning in Resource Constrained Edge Computing Systems [J].
Wang, Shiqiang ;
Tuor, Tiffany ;
Salonidis, Theodoros ;
Leung, Kin K. ;
Makaya, Christian ;
He, Ting ;
Chan, Kevin .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (06) :1205-1221
[52]   Accelerating Federated Learning With Cluster Construction and Hierarchical Aggregation [J].
Wang, Zhiyuan ;
Xu, Hongli ;
Liu, Jianchun ;
Xu, Yang ;
Huang, He ;
Zhao, Yangming .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (07) :3805-3822
[53]   Resource-Efficient Federated Learning with Hierarchical Aggregation in Edge Computing [J].
Wang, Zhiyuan ;
Xu, Hongli ;
Liu, Jianchun ;
Huang, He ;
Qiao, Chunming ;
Zhao, Yangming .
IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,
[54]  
Wangni JQ, 2018, ADV NEUR IN, V31
[55]   User-Level Privacy-Preserving Federated Learning: Analysis and Performance Optimization [J].
Wei, Kang ;
Li, Jun ;
Ding, Ming ;
Ma, Chuan ;
Su, Hang ;
Zhang, Bo ;
Poor, H. Vincent .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (09) :3388-3401
[56]   FedHome: Cloud-Edge Based Personalized Federated Learning for In-Home Health Monitoring [J].
Wu, Qiong ;
Chen, Xu ;
Zhou, Zhi ;
Zhang, Junshan .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (08) :2818-2832
[57]  
Xiao H, 2017, Arxiv, DOI [arXiv:1708.07747, DOI 10.48550/ARXIV.1708.07747]
[58]  
Xiaomin Ouyang, 2021, MobiSys '21: Proceedings of the 19th Annual International Conference on Mobile Systems, Applications, and Services, P54, DOI 10.1145/3458864.3467681
[59]   CoEdge: Cooperative DNN Inference With Adaptive Workload Partitioning Over Heterogeneous Edge Devices [J].
Zeng, Liekang ;
Chen, Xu ;
Zhou, Zhi ;
Yang, Lei ;
Zhang, Junshan .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2021, 29 (02) :595-608
[60]   Efficient Computing Resource Sharing for Mobile Edge-Cloud Computing Networks [J].
Zhang, Yongmin ;
Lan, Xiaolong ;
Ren, Ju ;
Cai, Lin .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2020, 28 (03) :1227-1240