Towards Unsupervised Sudden Data Drift Detection in Federated Learning with Fuzzy Clustering

被引:0
|
作者
Stallmann, Morris [1 ]
Wilbik, Anna [1 ]
Weiss, Gerhard [1 ]
机构
[1] Maastricht Univ, Dept Adv Comp Sci, Maastricht, Netherlands
来源
2024 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, FUZZ-IEEE 2024 | 2024年
关键词
federated learning; fuzzy clustering; unsupervised; drift; drift detection; federated drift detection; federated data drift detection; FCM;
D O I
10.1109/FUZZ-IEEE60900.2024.10611883
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning (FL) is a machine learning (ML) discipline that allows to train ML models on distributed data without revealing raw data instances. It promises to enable ML in environments with data sharing constraints, e.g., due to data privacy concerns, or other considerations. Data and concept drift are commonly referred to as unpredictable changes in data distributions over time. It is known to impact a ML model's performances in many real-world scenarios. While drift detection and adaptation has been studied extensively in the non-federated setting, it is still less explored in the FL setting. The private and distributed nature of data in FL makes drift detection much harder in FL since no entity can oversee all data instances to estimate changes in the global data distribution. In this paper, we propose a novel unsupervised federated data drift detection method that is based on federated fuzzy c-means clustering and the federated fuzzy Davies-Bouldin index, a global cluster validation metric. First, using the federated fuzzy c-means clustering algorithm, an initial global data model is learned. Second, the federated fuzzy Davies-Bouldin index . is calculated estimating how well the data fits the learned model. Third, whenever a new batch of data is available at time t, the fit of initial data model and new data is evaluated through the federated fuzzy Davies-Bouldin index Delta(t). Finally Delta and Delta(t) are compared to detect drift. The method is unsupervised as it does not require any labels and detects global data drift while keeping all data private. We evaluate our method carefully in a controlled environment by simulating multiple federated drift scenarios. We observe promising results as it rarely signals false positive alarms and detects drift in multiple scenarios. We also observe short-comings such as sensitivity to parameter choices and low detection rate in case only few data points in a new batch of data are affected by drift.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Byzantine-robust one-shot federated learning based on hybrid-domain fuzzy clustering and meta learning
    Liu, Huili
    Guo, Chenqi
    Ma, Yinglong
    Wang, Yandan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 279
  • [42] RECALL: Towards Generalized Representations in Unsupervised Federated Learning Under Non-IID Conditions
    Chen, Pi-Wei
    Lin, Jerry Chun-Wei
    Yeh, Feng-Hao
    Cupek, Rafal
    Chen, Chao-Chun
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT I, ACIIDS 2024, 2024, 14795 : 253 - 263
  • [43] Pneumonia detection from X-ray images using federated learning–An unsupervised learning approach
    Rana, Neeta
    Marwaha, Hitesh
    Measurement: Sensors, 2025, 37
  • [44] A Clustering-Based Scoring Mechanism for Malicious Model Detection in Federated Learning
    Caglayan, Cem
    Yurdakul, Arda
    2022 25TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2022, : 224 - 231
  • [45] Mitigating Dimensional Collapse and Model Drift in Non-IID Data of Federated Learning
    Jiang, Ming
    Li, Yun
    Lu, Yao
    Guo, Biao
    Zhang, Feng
    NEURAL COMPUTING FOR ADVANCED APPLICATIONS, NCAA 2024, PT III, 2025, 2183 : 270 - 283
  • [46] FedNN: Federated learning on concept drift data using weight and adaptive group normalizations
    Kang, Myeongkyun
    Kim, Soopil
    Jin, Kyong Hwan
    Adeli, Ehsan
    Pohl, Kilian M.
    Park, Sang Hyun
    PATTERN RECOGNITION, 2024, 149
  • [47] Privacy-preserving clustering federated learning for non-IID data
    Luo, Guixun
    Chen, Naiyue
    He, Jiahuan
    Jin, Bingwei
    Zhang, Zhiyuan
    Li, Yidong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 384 - 395
  • [48] Clustering-Based Federated Learning for Enhancing Data Privacy in Internet of Vehicles
    Jin, Zilong
    Wang, Jin
    Zhang, Lejun
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (06): : 1462 - 1477
  • [49] Can hierarchical client clustering mitigate the data heterogeneity effect in federated learning?
    Lee, Seungjun
    Yu, Miri
    Yoon, Daegun
    Oh, Sangyoon
    2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW, 2023, : 799 - 808
  • [50] Towards Distributed Learning of PMU Data: A Federated Learning based Event Classification Approach
    Mohammadabadi, Seyed Mahmoud Sajjadi
    Liu, Yunchuan
    Canafe, Abraham
    Yang, Lei
    2023 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, PESGM, 2023,