WSCC: A Weight-Similarity-Based Client Clustering Approach for Non-IID Federated Learning

被引:48
|
作者
Tian, Pu [1 ]
Liao, Weixian [1 ]
Yu, Wei [1 ]
Blasch, Erik [2 ]
机构
[1] Towson Univ, Dept Comp & Informat Sci, Towson, MD 21252 USA
[2] Air Force Res Lab, Rome, NY 13441 USA
关键词
Collaborative work; Training; Servers; Internet of Things; Computational modeling; Data models; Convergence; Affinity propagation (AP); client clustering aggregation; cosine distance; nonindependent and identical distribution (Non-IID) federated learning; INTERNET; THINGS; EFFICIENT;
D O I
10.1109/JIOT.2022.3175149
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The fast development of the Internet of Things (IoT) and deep learning enables learning useful patterns from the massive amount of collected data with sporadic nodes in IoT systems. Federated learning has received increasing attention in distributed machine learning where only intermediate parameters are exchanged with training samples that resided at local nodes. Nonetheless, most of the existing federated learning schemes assume a homogeneous distribution of data. The assumption, however, does not apply to IoT systems because of the heterogeneity of the IoT architecture. The nonindependent and identical distribution (non-IID) property in data volume and statistical distribution of IoT nodes can impact the performance of an aggregated global model that fits all nodes. Existing federated learning solutions for non-IID data sets either have to train additional models or require extra data exchange to check the node distribution. However, due to resource constraints in IoT systems, these approaches will increase the burden on limited computation capacity and cause network overhead. To address the issue, in this article, a novel weight-similarity-based client clustering (WSCC) approach is proposed, in which clients are split into different groups based on their data set distributions. An affinity-propagation-based method with the cosine distance of the client's weight parameters is designed to iteratively and automatically determine dynamic clusters. The proposed approach is ideal for IoT systems since there are no auxiliary models and extra data transmissions are needed. Through the theoretical convergence analysis and empirical results, we show that our proposed WSCC scheme outperforms the representative federated learning schemes under different non-IID settings, achieving up to 20% improvements in accuracy.
引用
收藏
页码:20243 / 20256
页数:14
相关论文
共 50 条
  • [41] Resource-Efficient Federated Learning With Non-IID Data: An Auction Theoretic Approach
    Seo, Eunil
    Niyato, Dusit
    Elmroth, Erik
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (24) : 25506 - 25524
  • [42] Privacy-Preserving Asynchronous Federated Learning Under Non-IID Settings
    Miao, Yinbin
    Kuang, Da
    Li, Xinghua
    Xu, Shujiang
    Li, Hongwei
    Choo, Kim-Kwang Raymond
    Deng, Robert H.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 5828 - 5841
  • [43] FedSG: A Personalized Subgraph Federated Learning Framework on Multiple Non-IID Graphs
    Wang, Yingcheng
    Guo, Songtao
    Qiao, Dewen
    Liu, Guiyan
    Li, Mingyan
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (05): : 3678 - 3690
  • [44] Improving Accuracy and Convergence in Group-Based Federated Learning on Non-IID Data
    He, Ziqi
    Yang, Lei
    Lin, Wanyu
    Wu, Weigang
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (03): : 1389 - 1404
  • [45] Energy-Efficient Active Federated Learning on Non-IID Data
    Shullary, M. Hashem
    Abdellatif, Alaa Awad
    Massoudn, Yehia
    2022 IEEE 65TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS 2022), 2022,
  • [46] CONVERGENCE ANALYSIS OF SEMI-FEDERATED LEARNING WITH NON-IID DATA
    Ni, Wanli
    Han, Jiachen
    Qin, Zhijin
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 214 - 218
  • [47] FedGAN: A Federated Semi-supervised Learning from Non-IID Data
    Zhao, Chen
    Gao, Zhipeng
    Wang, Qian
    Mo, Zijia
    Yu, Xinlei
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS (WASA 2022), PT II, 2022, 13472 : 181 - 192
  • [48] Multi-Modal Federated Learning for Cancer Staging Over Non-IID Datasets With Unbalanced Modalities
    Borazjani, Kasra
    Khosravan, Naji
    Ying, Leslie
    Hosseinalipour, Seyyedali
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (01) : 556 - 573
  • [49] FedArtML: A Tool to Facilitate the Generation of Non-IID Datasets in a Controlled Way to Support Federated Learning Research
    Gutierrez, Daniel Mauricio Jimenez
    Anagnostopoulos, Aris
    Chatzigiannakis, Ioannis
    Vitaletti, Andrea
    IEEE ACCESS, 2024, 12 : 81004 - 81016
  • [50] BCE-FL: A Secure and Privacy-Preserving Federated Learning System for Device Fault Diagnosis Under Non-IID Condition in IIoT
    Xiao, Yiming
    Shao, Haidong
    Lin, Jian
    Huo, Zhiqiang
    Liu, Bin
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (08): : 14241 - 14252