TabFedSL: A Self-Supervised Approach to Labeling Tabular Data in Federated Learning Environments

被引:1
|
作者
Wang, Ruixiao [1 ]
Hu, Yanxin [1 ]
Chen, Zhiyu [1 ,2 ]
Guo, Jianwei [1 ]
Liu, Gang [1 ,2 ]
机构
[1] Changchun Univ Technol, Sch Comp Sci & Engn, Changchun 130102, Peoples R China
[2] Jilin Prov Data Serv Ind Publ Technol Res Ctr, Changchun 130102, Peoples R China
关键词
Federated Learning; self-supervised learning; tabular data; deep learning; FRAMEWORK;
D O I
10.3390/math12081158
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Currently, self-supervised learning has shown effectiveness in solving data labeling issues. Its success mainly depends on having access to large, high-quality datasets with diverse features. It also relies on utilizing the spatial, temporal, and semantic structures present in the data. However, domains such as finance, healthcare, and insurance primarily utilize tabular data formats. This presents challenges for traditional data augmentation methods aimed at improving data quality. Furthermore, the privacy-sensitive nature of these domains complicates the acquisition of the extensive, high-quality datasets necessary for training effective self-supervised models. To tackle these challenges, our proposal introduces a novel framework that combines self-supervised learning with Federated Learning (FL). This approach aims to solve the problem of data-distributed training while ensuring training quality. Our framework improves upon the conventional self-supervised learning data augmentation paradigm by incorporating data labeling through the segmentation of data into subsets. Our framework adds noise by splitting subsets of data and can achieve the same level of centralized learning in a distributed environment. Moreover, we conduct experiments on various public tabular datasets to evaluate our approach. The experimental results showcase the effectiveness and generalizability of our proposed method in scenarios involving unlabeled data and distributed settings.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A survey on self-supervised learning for non-sequential tabular data
    Wang, Wei-Yao
    Du, Wei-Wei
    Xu, Derek
    Wang, Wei
    Peng, Wen-Chih
    MACHINE LEARNING, 2025, 114 (01)
  • [2] SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning
    Ucar, Talip
    Hajiramezanali, Ehsan
    Edwards, Lindsay
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [3] Tabular-based self-supervised learning approach for encrypted traffic classification
    Zheng, Xuan
    Ma, Xiuli
    Jin, Yanliang
    Gu, Dongsheng
    Wang, Rui
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (04)
  • [4] Federated Self-supervised Learning for Video Understanding
    Rehman, Yasar Abbas Ur
    Gao, Yan
    Shen, Jiajun
    de Gusmao, Pedro Porto Buarque
    Lane, Nicholas
    COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 506 - 522
  • [5] FedLID: Self-Supervised Federated Learning for Leveraging Limited Image Data
    Psaltis, Athanasios
    Kastellos, Anestis
    Patrikakis, Charalampos Z.
    Daras, Petros
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 1031 - 1040
  • [6] Understanding the limitations of self-supervised learning for tabular anomaly detection
    Mai, Kimberly T.
    Davies, Toby
    Griffin, Lewis D.
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (02)
  • [7] Federated Self-Supervised Learning in Heterogeneous Settings: Limits of a Baseline Approach on HAR
    Sannara, E. K.
    Rombourg, Romain
    Portet, Francois
    Lalanda, Philippe
    2022 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS (PERCOM WORKSHOPS), 2022,
  • [8] FEDERATED SELF-SUPERVISED LEARNING FOR ACOUSTIC EVENT CLASSIFICATION
    Feng, Meng
    Kao, Chieh-Chi
    Tang, Qingming
    Sun, Ming
    Rozgic, Viktor
    Matsoukas, Spyros
    Wang, Chao
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 481 - 485
  • [9] BADFSS: Backdoor Attacks on Federated Self-Supervised Learning
    Zhang, Jiale
    Zhu, Chengcheng
    Di Wu
    Sun, Xiaobing
    Yong, Jianming
    Long, Guodong
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 548 - 558
  • [10] A Deep Cut Into Split Federated Self-Supervised Learning
    Przewiezlikowski, Marcin
    Osial, Marcin
    Zielinski, Bartosz
    Smieja, Marek
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT II, ECML PKDD 2024, 2024, 14942 : 444 - 459