Enhancing Federated Learning Convergence With Dynamic Data Queue and Data-Entropy-Driven Participant Selection

被引:0
作者
Herath, Charuka [1 ]
Liu, Xiaolan [1 ]
Lambotharan, Sangarapillai [1 ]
Rahulamathavan, Yogachandran [1 ]
机构
[1] Loughborough Univ London, Inst Digital Technol, London E20 3BS, England
来源
IEEE INTERNET OF THINGS JOURNAL | 2025年 / 12卷 / 06期
基金
英国工程与自然科学研究理事会;
关键词
Data models; Convergence; Internet of Things; Distributed databases; Accuracy; Training; Mathematical models; Servers; Adaptation models; Data entropy; fairness FL; federated learning (FL); not identically and independently distributed (non-IID);
D O I
10.1109/JIOT.2024.3491034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Federated learning (FL) is a decentralized approach for collaborative model training on edge devices. This distributed method of model training offers advantages in privacy, security, regulatory compliance, and cost efficiency. Our emphasis in this research lies in addressing statistical complexity in FL, especially when the data stored locally across devices is not identically and independently distributed (non-IID). We have observed an accuracy reduction of up to approximately 10%-30%, particularly in skewed scenarios where each edge device trains with only 1 class of data. This reduction is attributed to weight divergence, quantified using the Euclidean distance between device-level class distributions and the population distribution, resulting in a bias term (delta(k)) . As a solution, we present a method to improve convergence in FL by creating a global subset of data on the server and dynamically distributing it across devices using a dynamic data queue-driven FL (DDFL). Next, we leverage Data Entropy metrics to observe the process during each training round and enable reasonable device selection for aggregation. Furthermore, we provide a convergence analysis of our proposed DDFL to justify their viability in practical FL scenarios, aiming for better device selection, a non-suboptimal global model, and faster convergence. We observe that our approach results in a substantial accuracy boost of approximately 5% for the MNIST dataset, around 18% for CIFAR-10, and 20% for CIFAR-100 with a 10% global subset of data, outperforming the state-of-the-art (SOTA) aggregation algorithms.
引用
收藏
页码:6646 / 6658
页数:13
相关论文
共 41 条
  • [31] Dynamic Ensemble Selection and Data Preprocessing for Multi-Class Imbalance Learning
    Cruz, Rafael M. O.
    Souza, Mariana de Araujo
    Sabourin, Robert
    Cavalcanti, George D. C.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (11)
  • [32] AWE-DPFL: Adaptive weighting and dynamic privacy budget federated learning for heterogeneous data in IoT
    Zheng, Guiping
    Gong, Bei
    Guo, Chong
    Peng, Tianqi
    Gong, Mowei
    COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123
  • [33] Joint Power Control and Data Size Selection for Over-the-Air Computation-Aided Federated Learning
    An, Xuming
    Fan, Rongfei
    Zuo, Shiyuan
    Hu, Han
    Jiang, Hai
    Zhang, Ning
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (08): : 14031 - 14046
  • [34] Federated dynamic weighted learning method based on non-independent and identically distributed industrial big data
    Liu J.
    Zhu J.
    Yuan R.
    Ji H.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (05): : 1602 - 1614
  • [35] Data-driven optimal terminal iterative learning control with initial value dynamic compensation
    Chi, Ronghu
    Huang, Biao
    Wang, Danwei
    Zhang, Ruikun
    Feng, Yuanjing
    IET CONTROL THEORY AND APPLICATIONS, 2016, 10 (12) : 1357 - 1364
  • [36] DIM-DS: Dynamic Incentive Model for Data Sharing in Federated Learning Based on Smart Contracts and Evolutionary Game Theory
    Chen, Yanru
    Zhang, Yuanyuan
    Wang, Shengwei
    Wang, Fan
    Li, Yang
    Jiang, Yuming
    Chen, Liangyin
    Guo, Bing
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (23) : 24572 - 24584
  • [37] Data-Driven Dynamic Models of Active Distribution Networks Using Unsupervised Learning Techniques on Field Measurements
    Mitrentsis, Georgios
    Lens, Hendrik
    IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (04) : 2952 - 2965
  • [38] Automated detection and classification of tumor histotypes on dynamic PET imaging data through machine-learning driven voxel classification
    Bianchetti, G.
    Taralli, S.
    Vaccaro, M.
    Indovina, L.
    Mattoli, M., V
    Capotosti, A.
    Scolozzi, V
    Calcagni, M. L.
    Giordano, A.
    De Spirito, M.
    Maulucci, G.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 145
  • [39] Improved data-driven high-order model-free adaptive iterative learning control with fast convergence for trajectory tracking systems
    Huang, Liang
    Huang, Hua
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2024, 46 (15) : 2884 - 2896
  • [40] Data Driven Real-Time Dynamic Voltage Control Using Decentralized Execution Multi-Agent Deep Reinforcement Learning
    Wang, Yuling
    Vittal, Vijay
    IEEE OPEN ACCESS JOURNAL OF POWER AND ENERGY, 2024, 11 : 508 - 519