A Clustered Federated Learning Method of User Behavior Analysis Based on Non-IID Data

被引:1
作者
Zhang, Jianfei [1 ]
Li, Zhongxin [1 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun 130000, Peoples R China
关键词
federated learning; Non-IID; user behavior; user modeling;
D O I
10.3390/electronics12071660
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Federated learning (FL) is a novel distributed machine learning paradigm. It can protect data privacy in distributed machine learning. Hence, FL provides new ideas for user behavior analysis. User behavior analysis can be modeled using multiple data sources. However, differences between different data sources can lead to different data distributions, i.e., non-identically and non-independently distributed (Non-IID). Non-IID data usually introduce bias in the training process of FL models, which will affect the model accuracy and convergence speed. In this paper, a new federated learning algorithm is proposed to mitigate the impact of Non-IID data on the model, named federated learning with a two-tier caching mechanism (FedTCM). First, FedTCM clustered similar clients based on their data distribution. Clustering reduces the extent of Non-IID between clients in a cluster. Second, FedTCM uses asynchronous communication methods to alleviate the problem of inconsistent computation speed across different clients. Finally, FedTCM sets up a two-tier caching mechanism on the server for mitigating the Non-IID data between different clusters. In multiple simulated datasets, compared to the method without the federated framework, the FedTCM is maximum 15.8% higher than it and average 12.6% higher than it. Compared to the typical federated method FedAvg, the accuracy of FedTCM is maximum 2.3% higher than it and average 1.6% higher than it. Additionally, FedTCM achieves more excellent communication performance than FedAvg.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Adaptive Federated Learning With Non-IID Data
    Zeng, Yan
    Mu, Yuankai
    Yuan, Junfeng
    Teng, Siyuan
    Zhang, Jilin
    Wan, Jian
    Ren, Yongjian
    Zhang, Yunquan
    COMPUTER JOURNAL, 2023, 66 (11) : 2758 - 2772
  • [2] Clustered Data Sharing for Non-IID Federated Learning over Wireless Networks
    Hu, Gang
    Teng, Yinglei
    Wang, Nan
    Yu, F. Richard
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 1175 - 1180
  • [3] CCSF: Clustered Client Selection Framework for Federated Learning in non-IID Data
    Mohamed, Aissa H.
    de Souza, Allan M.
    da Costa, Joahannes B. D.
    Villas, Leandro A.
    Dos Reis, Julio C.
    16TH IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC 2023, 2023,
  • [4] Federated learning on non-IID data: A survey
    Zhu, Hangyu
    Xu, Jinjin
    Liu, Shiqing
    Jin, Yaochu
    NEUROCOMPUTING, 2021, 465 : 371 - 390
  • [5] Data augmentation scheme for federated learning with non-IID data
    Tang L.
    Wang D.
    Liu S.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (01): : 164 - 176
  • [6] Federated Learning With Taskonomy for Non-IID Data
    Jamali-Rad, Hadi
    Abdizadeh, Mohammad
    Singh, Anuj
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8719 - 8730
  • [7] An Optimization Method for Non-IID Federated Learning Based on Deep Reinforcement Learning
    Meng, Xutao
    Li, Yong
    Lu, Jianchao
    Ren, Xianglin
    SENSORS, 2023, 23 (22)
  • [8] FLIS: Clustered Federated Learning Via Inference Similarity for Non-IID Data Distribution
    Morafah, Mahdi
    Vahidian, Saeed
    Wang, Weijia
    Lin, Bill
    IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2023, 4 : 109 - 120
  • [9] Data independent warmup scheme for non-IID federated learning
    Arafeh, Mohamad
    Ould-Slimane, Hakima
    Otrok, Hadi
    Mourad, Azzam
    Talhi, Chamseddine
    Damiani, Ernesto
    INFORMATION SCIENCES, 2023, 623 : 342 - 360
  • [10] A Comprehensive Study on Personalized Federated Learning with Non-IID Data
    Yu, Menghang
    Zheng, Zhenzhe
    Li, Qinya
    Wu, Fan
    Zheng, Jiaqi
    2022 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING, ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM, 2022, : 40 - 49