Towards Unsupervised Sudden Data Drift Detection in Federated Learning with Fuzzy Clustering

被引:0
|
作者
Stallmann, Morris [1 ]
Wilbik, Anna [1 ]
Weiss, Gerhard [1 ]
机构
[1] Maastricht Univ, Dept Adv Comp Sci, Maastricht, Netherlands
来源
2024 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, FUZZ-IEEE 2024 | 2024年
关键词
federated learning; fuzzy clustering; unsupervised; drift; drift detection; federated drift detection; federated data drift detection; FCM;
D O I
10.1109/FUZZ-IEEE60900.2024.10611883
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning (FL) is a machine learning (ML) discipline that allows to train ML models on distributed data without revealing raw data instances. It promises to enable ML in environments with data sharing constraints, e.g., due to data privacy concerns, or other considerations. Data and concept drift are commonly referred to as unpredictable changes in data distributions over time. It is known to impact a ML model's performances in many real-world scenarios. While drift detection and adaptation has been studied extensively in the non-federated setting, it is still less explored in the FL setting. The private and distributed nature of data in FL makes drift detection much harder in FL since no entity can oversee all data instances to estimate changes in the global data distribution. In this paper, we propose a novel unsupervised federated data drift detection method that is based on federated fuzzy c-means clustering and the federated fuzzy Davies-Bouldin index, a global cluster validation metric. First, using the federated fuzzy c-means clustering algorithm, an initial global data model is learned. Second, the federated fuzzy Davies-Bouldin index . is calculated estimating how well the data fits the learned model. Third, whenever a new batch of data is available at time t, the fit of initial data model and new data is evaluated through the federated fuzzy Davies-Bouldin index Delta(t). Finally Delta and Delta(t) are compared to detect drift. The method is unsupervised as it does not require any labels and detects global data drift while keeping all data private. We evaluate our method carefully in a controlled environment by simulating multiple federated drift scenarios. We observe promising results as it rarely signals false positive alarms and detects drift in multiple scenarios. We also observe short-comings such as sensitivity to parameter choices and low detection rate in case only few data points in a new batch of data are affected by drift.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] UDDT: An Unsupervised Drift Detection Method for Industrial Time Series Data
    Maduskar, Deepti
    Sharma, Divyasheel
    Chandrika, K. R.
    Borrison, Reuben
    Manca, Gianluca
    Dix, Marcel
    2023 IEEE 2ND INDUSTRIAL ELECTRONICS SOCIETY ANNUAL ON-LINE CONFERENCE, ONCON, 2023,
  • [32] Unsupervised concept drift detection for multi-label data streams
    Gulcan, Ege Berkay
    Can, Fazli
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (03) : 2401 - 2434
  • [33] Unsupervised concept drift detection for multi-label data streams
    Ege Berkay Gulcan
    Fazli Can
    Artificial Intelligence Review, 2023, 56 : 2401 - 2434
  • [34] Cellular Network Antenna Tilt Anomaly Detection Using Federated Unsupervised Learning
    Mulvey, David
    Foh, Chuan Heng
    Imran, Muhammad Ali
    Tafazolli, Rahim
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3048 - 3053
  • [35] FLARE: Detection and Mitigation of Concept Drift for Federated Learning based IoT Deployments
    Chow, Theo
    Raza, Usman
    Mavromatis, Ioannis
    Khan, Aftab
    2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2023, : 989 - 995
  • [36] First Steps Towards Federated Learning Network Traffic Detection
    Boiano, Antonio
    Detomas, Valeria
    Redondi, Alessandro E. C.
    Cesana, Matteo
    PROCEEDINGS OF THE 8TH NETWORK TRAFFIC MEASUREMENT AND ANALYSIS CONFERENCE, TMA 2024, 2024,
  • [37] Energy-efficient Clustering to Address Data Heterogeneity in Federated Learning
    Luo, Yibo
    Liu, Xuefeng
    Xiu, Jianwei
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [38] SoFL: Clustered Federated Learning Based on Dual Clustering for Heterogeneous Data
    Zhang, Jianfei
    Qiao, Zhiming
    ELECTRONICS, 2024, 13 (18)
  • [39] Towards Superior Pruning Performance in Federated Learning with Discriminative Data
    Yang, Yinan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2025, E108D (01) : 23 - 36
  • [40] Federated Learning for Anomaly Detection in Maritime Movement Data
    Graser, Anita
    Weissenfeld, Axel
    Heistracher, Clemens
    Dragaschnig, Melitta
    Widhalm, Peter
    PROCEEDINGS OF THE 2024 25TH IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT, MDM 2024, 2024, : 77 - 82