WSCC: A Weight-Similarity-Based Client Clustering Approach for Non-IID Federated Learning

被引:48
|
作者
Tian, Pu [1 ]
Liao, Weixian [1 ]
Yu, Wei [1 ]
Blasch, Erik [2 ]
机构
[1] Towson Univ, Dept Comp & Informat Sci, Towson, MD 21252 USA
[2] Air Force Res Lab, Rome, NY 13441 USA
关键词
Collaborative work; Training; Servers; Internet of Things; Computational modeling; Data models; Convergence; Affinity propagation (AP); client clustering aggregation; cosine distance; nonindependent and identical distribution (Non-IID) federated learning; INTERNET; THINGS; EFFICIENT;
D O I
10.1109/JIOT.2022.3175149
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The fast development of the Internet of Things (IoT) and deep learning enables learning useful patterns from the massive amount of collected data with sporadic nodes in IoT systems. Federated learning has received increasing attention in distributed machine learning where only intermediate parameters are exchanged with training samples that resided at local nodes. Nonetheless, most of the existing federated learning schemes assume a homogeneous distribution of data. The assumption, however, does not apply to IoT systems because of the heterogeneity of the IoT architecture. The nonindependent and identical distribution (non-IID) property in data volume and statistical distribution of IoT nodes can impact the performance of an aggregated global model that fits all nodes. Existing federated learning solutions for non-IID data sets either have to train additional models or require extra data exchange to check the node distribution. However, due to resource constraints in IoT systems, these approaches will increase the burden on limited computation capacity and cause network overhead. To address the issue, in this article, a novel weight-similarity-based client clustering (WSCC) approach is proposed, in which clients are split into different groups based on their data set distributions. An affinity-propagation-based method with the cosine distance of the client's weight parameters is designed to iteratively and automatically determine dynamic clusters. The proposed approach is ideal for IoT systems since there are no auxiliary models and extra data transmissions are needed. Through the theoretical convergence analysis and empirical results, we show that our proposed WSCC scheme outperforms the representative federated learning schemes under different non-IID settings, achieving up to 20% improvements in accuracy.
引用
收藏
页码:20243 / 20256
页数:14
相关论文
共 50 条
  • [21] FedPKR: Federated Learning With Non-IID Data via Periodic Knowledge Review in Edge Computing
    Wang, Jinbo
    Wang, Ruijin
    Xu, Guangquan
    He, Donglin
    Pei, Xikai
    Zhang, Fengli
    Gan, Jie
    IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2024, 9 (06): : 902 - 912
  • [22] Non-IID Federated Learning With Sharper Risk Bound
    Wei, Bojian
    Li, Jian
    Liu, Yong
    Wang, Weiping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) : 6906 - 6917
  • [23] FedRFC: Federated Learning with Recursive Fuzzy Clustering for improved non-IID data training
    Deng, Yuxiao
    Wang, Anqi
    Zhang, Lei
    Lei, Ying
    Li, Beibei
    Li, Yizhou
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 160 : 835 - 843
  • [24] Differentially Private Federated Learning on Non-iid Data: Convergence Analysis and Adaptive Optimization
    Chen, Lin
    Ding, Xiaofeng
    Bao, Zhifeng
    Zhou, Pan
    Jin, Hai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (09) : 4567 - 4581
  • [25] Age-of-Information Minimization in Federated Learning Based Networks With Non-IID Dataset
    Wang, Kaidi
    Ding, Zhiguo
    So, Daniel K. C.
    Ding, Zhi
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (08) : 8939 - 8953
  • [26] FedSea: Federated Learning via Selective Feature Alignment for Non-IID Multimodal Data
    Tan, Min
    Feng, Yinfu
    Chu, Lingqiang
    Shi, Jingcheng
    Xiao, Rong
    Tang, Haihong
    Yu, Jun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5807 - 5822
  • [27] Digital Twin-Empowered Federated Incremental Learning for Non-IID Privacy Data
    Wang, Qian
    Chen, Siguang
    Wu, Meng
    Li, Xue
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (05) : 3860 - 3877
  • [28] Tackling the Non-IID Issue in Heterogeneous Federated Learning by Gradient Harmonization
    Zhang, Xinyu
    Sun, Weiyu
    Chen, Ying
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2595 - 2599
  • [29] Enhancing Federated Learning With Pattern-Based Client Clustering
    Gao, Yuan
    Lin, Ziyue
    Gong, Maoguo
    Zhang, Yuanqiao
    Zhang, Yihong
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (24): : 40365 - 40375
  • [30] Federated Learning-Based IoT Intrusion Detection on Non-IID Data
    Huang, Wenxuan
    Tiropanis, Thanassis
    Konstantinidis, George
    INTERNET OF THINGS, GIOTS 2022, 2022, 13533 : 326 - 337