WSCC: A Weight-Similarity-Based Client Clustering Approach for Non-IID Federated Learning

被引:48
|
作者
Tian, Pu [1 ]
Liao, Weixian [1 ]
Yu, Wei [1 ]
Blasch, Erik [2 ]
机构
[1] Towson Univ, Dept Comp & Informat Sci, Towson, MD 21252 USA
[2] Air Force Res Lab, Rome, NY 13441 USA
关键词
Collaborative work; Training; Servers; Internet of Things; Computational modeling; Data models; Convergence; Affinity propagation (AP); client clustering aggregation; cosine distance; nonindependent and identical distribution (Non-IID) federated learning; INTERNET; THINGS; EFFICIENT;
D O I
10.1109/JIOT.2022.3175149
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The fast development of the Internet of Things (IoT) and deep learning enables learning useful patterns from the massive amount of collected data with sporadic nodes in IoT systems. Federated learning has received increasing attention in distributed machine learning where only intermediate parameters are exchanged with training samples that resided at local nodes. Nonetheless, most of the existing federated learning schemes assume a homogeneous distribution of data. The assumption, however, does not apply to IoT systems because of the heterogeneity of the IoT architecture. The nonindependent and identical distribution (non-IID) property in data volume and statistical distribution of IoT nodes can impact the performance of an aggregated global model that fits all nodes. Existing federated learning solutions for non-IID data sets either have to train additional models or require extra data exchange to check the node distribution. However, due to resource constraints in IoT systems, these approaches will increase the burden on limited computation capacity and cause network overhead. To address the issue, in this article, a novel weight-similarity-based client clustering (WSCC) approach is proposed, in which clients are split into different groups based on their data set distributions. An affinity-propagation-based method with the cosine distance of the client's weight parameters is designed to iteratively and automatically determine dynamic clusters. The proposed approach is ideal for IoT systems since there are no auxiliary models and extra data transmissions are needed. Through the theoretical convergence analysis and empirical results, we show that our proposed WSCC scheme outperforms the representative federated learning schemes under different non-IID settings, achieving up to 20% improvements in accuracy.
引用
收藏
页码:20243 / 20256
页数:14
相关论文
共 50 条
  • [31] FedLC: Optimizing Federated Learning in Non-IID Data via Label-Wise Clustering
    Lee, Hunmin
    Seo, Daehee
    IEEE ACCESS, 2023, 11 : 42082 - 42095
  • [32] Ensemble Federated Learning With Non-IID Data in Wireless Networks
    Zhao, Zhongyuan
    Wang, Jingyi
    Hong, Wei
    Quek, Tony Q. S.
    Ding, Zhiguo
    Peng, Mugen
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (04) : 3557 - 3571
  • [33] Logit Calibration and Feature Contrast for Robust Federated Learning on Non-IID Data
    Qiao, Yu
    Zhang, Chaoning
    Adhikary, Apurba
    Hong, Choong Seon
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2025, 12 (02): : 636 - 652
  • [34] Federated Analytics Informed Distributed Industrial IoT Learning With Non-IID Data
    Wang, Zibo
    Zhu, Yifei
    Wang, Dan
    Han, Zhu
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (05): : 2924 - 2939
  • [35] Optimal Model Transfer and Dynamic Parameter Server Selection for Efficient Federated Learning in IoT-Edge Systems With Non-IID Data
    Mengistu, Tesfahunegn Minwuyelet
    Lin, Jenn-Wei
    Kuo, Po-Hsien
    Kim, Taewoon
    IEEE ACCESS, 2024, 12 : 157954 - 157974
  • [36] Fast converging Federated Learning with Non-IID Data
    Naas, Si -Ahmed
    Sigg, Stephan
    2023 IEEE 97TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-SPRING, 2023,
  • [37] Channel-Aware Joint AoI and Diversity Optimization for Client Scheduling in Federated Learning With Non-IID Datasets
    Ma, Manyou
    Wong, Vincent W. S.
    Schober, Robert
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (06) : 6295 - 6311
  • [38] AdaDpFed: A Differentially Private Federated Learning Algorithm With Adaptive Noise on Non-IID Data
    Zhao, Zirun
    Sun, Yi
    Bashir, Ali Kashif
    Lin, Zhaowen
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 2536 - 2545
  • [39] Mitigating Update Conflict in Non-IID Federated Learning via Orthogonal Class Gradients
    Guo, Siyang
    Guo, Yaming
    Zhang, Hui
    Wang, Junbo
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (04) : 2967 - 2978
  • [40] Hypernetworks-Based Hierarchical Federated Learning on Hybrid Non-IID Datasets for Digital Twin in Industrial IoT
    Yang, Jihao
    Jiang, Wen
    Nie, Laisen
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 1413 - 1423