Enhancing Federated Learning Convergence With Dynamic Data Queue and Data-Entropy-Driven Participant Selection

被引:0
作者
Herath, Charuka [1 ]
Liu, Xiaolan [1 ]
Lambotharan, Sangarapillai [1 ]
Rahulamathavan, Yogachandran [1 ]
机构
[1] Loughborough Univ London, Inst Digital Technol, London E20 3BS, England
来源
IEEE INTERNET OF THINGS JOURNAL | 2025年 / 12卷 / 06期
基金
英国工程与自然科学研究理事会;
关键词
Data models; Convergence; Internet of Things; Distributed databases; Accuracy; Training; Mathematical models; Servers; Adaptation models; Data entropy; fairness FL; federated learning (FL); not identically and independently distributed (non-IID);
D O I
10.1109/JIOT.2024.3491034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Federated learning (FL) is a decentralized approach for collaborative model training on edge devices. This distributed method of model training offers advantages in privacy, security, regulatory compliance, and cost efficiency. Our emphasis in this research lies in addressing statistical complexity in FL, especially when the data stored locally across devices is not identically and independently distributed (non-IID). We have observed an accuracy reduction of up to approximately 10%-30%, particularly in skewed scenarios where each edge device trains with only 1 class of data. This reduction is attributed to weight divergence, quantified using the Euclidean distance between device-level class distributions and the population distribution, resulting in a bias term (delta(k)) . As a solution, we present a method to improve convergence in FL by creating a global subset of data on the server and dynamically distributing it across devices using a dynamic data queue-driven FL (DDFL). Next, we leverage Data Entropy metrics to observe the process during each training round and enable reasonable device selection for aggregation. Furthermore, we provide a convergence analysis of our proposed DDFL to justify their viability in practical FL scenarios, aiming for better device selection, a non-suboptimal global model, and faster convergence. We observe that our approach results in a substantial accuracy boost of approximately 5% for the MNIST dataset, around 18% for CIFAR-10, and 20% for CIFAR-100 with a 10% global subset of data, outperforming the state-of-the-art (SOTA) aggregation algorithms.
引用
收藏
页码:6646 / 6658
页数:13
相关论文
共 41 条
  • [21] Joint Data Allocation and LSTM-Based Server Selection With Parallelized Federated Learning in LEO Satellite IoT Networks
    Qin, Pengxiang
    Xu, Dongyang
    Liu, Lei
    Dong, Mianxiong
    Mumtaz, Shahid
    Guizani, Mohsen
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06): : 6259 - 6271
  • [22] Importance-Aware Data Selection and Resource Allocation in Federated Edge Learning System
    He, Yinghui
    Ren, Jinke
    Yu, Guanding
    Yuan, Jiantao
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13593 - 13605
  • [23] Long-Term Client Selection for Federated Learning With Non-IID Data: A Truthful Auction Approach
    Tan, Jinghong
    Liu, Zhian
    Guo, Kun
    Zhao, Mingxiong
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (05): : 4953 - 4970
  • [24] CONVERGENCE ANALYSIS OF SEMI-FEDERATED LEARNING WITH NON-IID DATA
    Ni, Wanli
    Han, Jiachen
    Qin, Zhijin
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 214 - 218
  • [25] Probabilistic Node Selection for Federated Learning with Heterogeneous Data in Mobile Edge
    Wu, Hongda
    Wang, Ping
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 2453 - 2458
  • [26] Data Driven Dynamic Sensor Selection in Internet of Things
    Vora, Aakash
    Amipara, Kevinkumar
    Modi, Samarth
    Zaveri, Mukesh A.
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 1196 - 1201
  • [27] EPFFL: Enhancing Privacy and Fairness in Federated Learning for Distributed E-Healthcare Data Sharing Services
    Liu, Jingwei
    Li, Yating
    Zhao, Mengjiao
    Liu, Lei
    Kumar, Neeraj
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2025, 22 (02) : 1239 - 1252
  • [28] An Efficient Privacy-Enhancing Cross-Silo Federated Learning and Applications for False Data Injection Attack Detection in Smart Grids
    Tran, Hong-Yen
    Hu, Jiankun
    Yin, Xuefei
    Pota, Hemanshu R.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 2538 - 2552
  • [29] Improving Accuracy and Convergence in Group-Based Federated Learning on Non-IID Data
    He, Ziqi
    Yang, Lei
    Lin, Wanyu
    Wu, Weigang
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (03): : 1389 - 1404
  • [30] Source-Free Dynamic Weighted Federated Transfer Learning for State-of-Health Estimation of Lithium-Ion Batteries With Data Privacy
    Han, Tengfei
    Yue, Shang
    Yang, Pu
    Zhou, Ruixu
    Yu, Jianbo
    IEEE TRANSACTIONS ON POWER ELECTRONICS, 2024, 39 (11) : 15085 - 15100