WSCC: A Weight-Similarity-Based Client Clustering Approach for Non-IID Federated Learning

被引:48
|
作者
Tian, Pu [1 ]
Liao, Weixian [1 ]
Yu, Wei [1 ]
Blasch, Erik [2 ]
机构
[1] Towson Univ, Dept Comp & Informat Sci, Towson, MD 21252 USA
[2] Air Force Res Lab, Rome, NY 13441 USA
关键词
Collaborative work; Training; Servers; Internet of Things; Computational modeling; Data models; Convergence; Affinity propagation (AP); client clustering aggregation; cosine distance; nonindependent and identical distribution (Non-IID) federated learning; INTERNET; THINGS; EFFICIENT;
D O I
10.1109/JIOT.2022.3175149
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The fast development of the Internet of Things (IoT) and deep learning enables learning useful patterns from the massive amount of collected data with sporadic nodes in IoT systems. Federated learning has received increasing attention in distributed machine learning where only intermediate parameters are exchanged with training samples that resided at local nodes. Nonetheless, most of the existing federated learning schemes assume a homogeneous distribution of data. The assumption, however, does not apply to IoT systems because of the heterogeneity of the IoT architecture. The nonindependent and identical distribution (non-IID) property in data volume and statistical distribution of IoT nodes can impact the performance of an aggregated global model that fits all nodes. Existing federated learning solutions for non-IID data sets either have to train additional models or require extra data exchange to check the node distribution. However, due to resource constraints in IoT systems, these approaches will increase the burden on limited computation capacity and cause network overhead. To address the issue, in this article, a novel weight-similarity-based client clustering (WSCC) approach is proposed, in which clients are split into different groups based on their data set distributions. An affinity-propagation-based method with the cosine distance of the client's weight parameters is designed to iteratively and automatically determine dynamic clusters. The proposed approach is ideal for IoT systems since there are no auxiliary models and extra data transmissions are needed. Through the theoretical convergence analysis and empirical results, we show that our proposed WSCC scheme outperforms the representative federated learning schemes under different non-IID settings, achieving up to 20% improvements in accuracy.
引用
收藏
页码:20243 / 20256
页数:14
相关论文
共 50 条
  • [1] Client Selection for Federated Learning With Non-IID Data in Mobile Edge Computing
    Zhang, Wenyu
    Wang, Xiumin
    Zhou, Pan
    Wu, Weiwei
    Zhang, Xinglin
    IEEE ACCESS, 2021, 9 : 24462 - 24474
  • [2] Federated Learning With Non-IID Data in Wireless Networks
    Zhao, Zhongyuan
    Feng, Chenyuan
    Hong, Wei
    Jiang, Jiamo
    Jia, Chao
    Quek, Tony Q. S.
    Peng, Mugen
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (03) : 1927 - 1942
  • [3] Federated Learning With Non-IID Data: A Survey
    Lu, Zili
    Pan, Heng
    Dai, Yueyue
    Si, Xueming
    Zhang, Yan
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (11): : 19188 - 19209
  • [4] Federated Learning With Taskonomy for Non-IID Data
    Jamali-Rad, Hadi
    Abdizadeh, Mohammad
    Singh, Anuj
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8719 - 8730
  • [5] Reschedule Gradients: Temporal Non-IID Resilient Federated Learning
    You, Xianyao
    Liu, Ximeng
    Jiang, Nan
    Cai, Jianping
    Ying, Zuobin
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (01) : 747 - 762
  • [6] FedCiR: Client-Invariant Representation Learning for Federated Non-IID Features
    Li, Zijian
    Lin, Zehong
    Shao, Jiawei
    Mao, Yuyi
    Zhang, Jun
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (11) : 10509 - 10522
  • [7] Long-Term Client Selection for Federated Learning With Non-IID Data: A Truthful Auction Approach
    Tan, Jinghong
    Liu, Zhian
    Guo, Kun
    Zhao, Mingxiong
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (05): : 4953 - 4970
  • [8] FlGan: GAN-Based Unbiased Federated Learning Under Non-IID Settings
    Ma, Zhuoran
    Liu, Yang
    Miao, Yinbin
    Xu, Guowen
    Liu, Ximeng
    Ma, Jianfeng
    Deng, Robert H.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (04) : 1566 - 1581
  • [9] Feature Matching Data Synthesis for Non-IID Federated Learning
    Li, Zijian
    Sun, Yuchang
    Shao, Jiawei
    Mao, Yuyi
    Wang, Jessie Hui
    Zhang, Jun
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (10) : 9352 - 9367
  • [10] IOFL: Intelligent-Optimization-Based Federated Learning for Non-IID Data
    Li, Xinyan
    Zhao, Huimin
    Deng, Wu
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (09): : 16693 - 16699