Enhancing Federated Learning Convergence With Dynamic Data Queue and Data-Entropy-Driven Participant Selection

被引：0

作者：

Herath, Charuka ^{[1
]}

Liu, Xiaolan ^{[1
]}

Lambotharan, Sangarapillai ^{[1
]}

Rahulamathavan, Yogachandran ^{[1
]}

机构：

[1] Loughborough Univ London, Inst Digital Technol, London E20 3BS, England

来源：

IEEE INTERNET OF THINGS JOURNAL | 2025年 / 12卷 / 06期

基金：

英国工程与自然科学研究理事会;

关键词：

Data models; Convergence; Internet of Things; Distributed databases; Accuracy; Training; Mathematical models; Servers; Adaptation models; Data entropy; fairness FL; federated learning (FL); not identically and independently distributed (non-IID);

D O I：

10.1109/JIOT.2024.3491034

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Federated learning (FL) is a decentralized approach for collaborative model training on edge devices. This distributed method of model training offers advantages in privacy, security, regulatory compliance, and cost efficiency. Our emphasis in this research lies in addressing statistical complexity in FL, especially when the data stored locally across devices is not identically and independently distributed (non-IID). We have observed an accuracy reduction of up to approximately 10%-30%, particularly in skewed scenarios where each edge device trains with only 1 class of data. This reduction is attributed to weight divergence, quantified using the Euclidean distance between device-level class distributions and the population distribution, resulting in a bias term (delta(k)) . As a solution, we present a method to improve convergence in FL by creating a global subset of data on the server and dynamically distributing it across devices using a dynamic data queue-driven FL (DDFL). Next, we leverage Data Entropy metrics to observe the process during each training round and enable reasonable device selection for aggregation. Furthermore, we provide a convergence analysis of our proposed DDFL to justify their viability in practical FL scenarios, aiming for better device selection, a non-suboptimal global model, and faster convergence. We observe that our approach results in a substantial accuracy boost of approximately 5% for the MNIST dataset, around 18% for CIFAR-10, and 20% for CIFAR-100 with a 10% global subset of data, outperforming the state-of-the-art (SOTA) aggregation algorithms.

引用

页码：6646 / 6658

页数：13

共 41 条

[31] Dynamic Ensemble Selection and Data Preprocessing for Multi-Class Imbalance Learning
Cruz, Rafael M. O.
Souza, Mariana de Araujo
Sabourin, Robert
Cavalcanti, George D. C.
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (11)
[32] AWE-DPFL: Adaptive weighting and dynamic privacy budget federated learning for heterogeneous data in IoT
Zheng, Guiping
Gong, Bei
Guo, Chong
Peng, Tianqi
Gong, Mowei
COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123
[33] Joint Power Control and Data Size Selection for Over-the-Air Computation-Aided Federated Learning
An, Xuming
Fan, Rongfei
Zuo, Shiyuan
Hu, Han
Jiang, Hai
Zhang, Ning
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (08): : 14031 - 14046
[34] Federated dynamic weighted learning method based on non-independent and identically distributed industrial big data
Liu J.
Zhu J.
Yuan R.
Ji H.
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (05): : 1602 - 1614
[35] Data-driven optimal terminal iterative learning control with initial value dynamic compensation
Chi, Ronghu
Huang, Biao
Wang, Danwei
Zhang, Ruikun
Feng, Yuanjing
IET CONTROL THEORY AND APPLICATIONS, 2016, 10 (12) : 1357 - 1364
[36] DIM-DS: Dynamic Incentive Model for Data Sharing in Federated Learning Based on Smart Contracts and Evolutionary Game Theory
Chen, Yanru
Zhang, Yuanyuan
Wang, Shengwei
Wang, Fan
Li, Yang
Jiang, Yuming
Chen, Liangyin
Guo, Bing
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (23) : 24572 - 24584
[37] Data-Driven Dynamic Models of Active Distribution Networks Using Unsupervised Learning Techniques on Field Measurements
Mitrentsis, Georgios
Lens, Hendrik
IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (04) : 2952 - 2965
[38] Automated detection and classification of tumor histotypes on dynamic PET imaging data through machine-learning driven voxel classification
Bianchetti, G.
Taralli, S.
Vaccaro, M.
Indovina, L.
Mattoli, M., V
Capotosti, A.
Scolozzi, V
Calcagni, M. L.
Giordano, A.
De Spirito, M.
Maulucci, G.
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 145
[39] Improved data-driven high-order model-free adaptive iterative learning control with fast convergence for trajectory tracking systems
Huang, Liang
Huang, Hua
TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2024, 46 (15) : 2884 - 2896
[40] Data Driven Real-Time Dynamic Voltage Control Using Decentralized Execution Multi-Agent Deep Reinforcement Learning
Wang, Yuling
Vittal, Vijay
IEEE OPEN ACCESS JOURNAL OF POWER AND ENERGY, 2024, 11 : 508 - 519

← 1 2 3 4 5 →