Faster Convergence on Heterogeneous Federated Edge Learning: An Adaptive Clustered Data Sharing Approach

被引：0

作者：

Hu, Gang ^{[1
]}

Teng, Yinglei ^{[1
]}

Wang, Nan ^{[1
]}

Han, Zhu ^{[2
,3
]}

机构：

[1] Beijing Univ Posts & Telecommun BUPT, Beijing Key Lab Work Safety Intelligent Monitoring, Beijing 100876, Peoples R China

[2] Univ Houston, Dept Elect & Comp Engn, Houston, TX 77004 USA

[3] Kyung Hee Univ, Dept Comp Sci & Engn, Seoul 446701, South Korea

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2025年 / 24卷 / 06期

基金：

日本科学技术振兴机构; 中国国家自然科学基金; 国家重点研发计划;

关键词：

Training; Data models; Accuracy; Optimization; Distributed databases; Convergence; Delays; Data privacy; Costs; Computational modeling; 6G; federated learning; non-IID data; multicasting; sidelink; data sharing;

D O I：

10.1109/TMC.2025.3533566

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Federated Edge Learning (FEL) emerges as a pioneering distributed machine learning paradigm for the 6 G Hyper-Connectivity, harnessing data from the IoT devices while upholding data privacy. However, current FEL algorithms struggle with non-independent and non-identically distributed (non-IID) data, leading to elevated communication costs and compromised model accuracy. To address these statistical imbalances, we introduce a clustered data sharing framework, mitigating data heterogeneity by selectively sharing partial data from cluster heads to trusted associates through sidelink-aided multicasting. The collective communication pattern is integral to FEL training, where both cluster formation and the efficiency of communication and computation impact training latency and accuracy simultaneously. To tackle the strictly coupled data sharing and resource optimization, we decompose the optimization problem into the clients clustering and effective data sharing subproblems. Specifically, a distribution-based adaptive clustering algorithm (DACA) is devised basing on three deductive cluster forming conditions, which ensures the maximum sharing yield. Meanwhile, we design a stochastic optimization based joint computed frequency and shared data volume optimization (JFVO) algorithm, determining the optimal resource allocation with an uncertain objective function. The experiments show that the proposed framework facilitates FEL on non-IID datasets with faster convergence rate and higher model accuracy in a resource-limited environment.

引用

页码：5342 / 5356

页数：15

共 48 条

[21]

Li D., 2019, ADV NEURAL INF PROCE, P1

[22] Federated Learning on Non-IID Data Silos: An Experimental Study [J].