GAN-based data reconstruction attacks in split learning

被引:0
|
作者
Zeng, Bo [1 ]
Luo, Sida [1 ]
Yu, Fangchao [1 ]
Yang, Geying [1 ]
Zhao, Kai [1 ]
Wang, Lina [1 ]
机构
[1] Wuhan Univ, Sch Cyber Sci & Engn, Key Lab Aerosp Informat Secur & Trusted Comp, Minist Educ, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Distributed privacy-preserving machine; learning; Split learning; Data reconstruction attacks; Model inversion; Generative adversarial networks;
D O I
10.1016/j.neunet.2025.107150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the distinctive distributed privacy-preserving architecture, split learning has found widespread application in scenarios where computational resources on the client side are limited. Unlike clients in federated learning retaining the whole model, split learning partitions the model into two segments situated separately on the server and client ends, thereby preventing direct access to the complete model structure by either party and fortifying its resilience against attacks. However, existing studies have demonstrated that even with access restricted to partial model outputs, split learning remains susceptible to data reconstruction attacks. This vulnerability persists despite prior research predominantly relying on stringent assumptions and the attacker being the server with the ability to access global information. Building upon this understanding, we devise GAN-based data reconstruction attacks within the U-shaped split learning framework, meticulously examining and confirming the feasibility of attacks initiated from both server and client sides, along with the underlying assumptions. Specifically, for attacks originating from the server, we propose the Model Approximation E stimation Reconstruction Attack (MAERA) to mitigate the requisite prior assumptions, and we also introduce the Distillation-based Client-side Reconstruction Attack (DCRA) to execute data reconstructions from the client for the first time. Experimental results illustrate the effectiveness and the robustness of the proposed frameworks in launching attacks across various datasets. In particular, MAERA necessitates merely 1% of the test set samples and 1% of the private data samples from the CIFAR100 dataset to unleash effective attacks, while DCRA adeptly expropriates models from clients and yields more pronounced reconstruction effects on target class samples during the process of inferring data distribution characteristics, in contrast to conventional Maximum A Posteriori (MAP) estimation algorithms.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Global Ionospheric Total Electron Content Completion with a GAN-Based Deep Learning Framework
    Yang, Kunlin
    Liu, Yang
    REMOTE SENSING, 2022, 14 (23)
  • [42] FlGan: GAN-Based Unbiased Federated Learning Under Non-IID Settings
    Ma, Zhuoran
    Liu, Yang
    Miao, Yinbin
    Xu, Guowen
    Liu, Ximeng
    Ma, Jianfeng
    Deng, Robert H.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (04) : 1566 - 1581
  • [43] AEC_GAN: Unbalanced Data Processing Decision-Making in Network Attacks Based on ACGAN and Machine Learning
    Zhu, Naibo
    Zhao, Guangyu
    Yang, Yang
    Yang, Han
    Liu, Zhi
    IEEE ACCESS, 2023, 11 : 52452 - 52465
  • [44] Improving imbalanced medical image classification through GAN-based data augmentation methods
    Ding, Hongwei
    Huang, Nana
    Wu, Yaoxin
    Cui, Xiaohui
    PATTERN RECOGNITION, 2025, 166
  • [45] Producing More with Less: A GAN-based Network Attack Detection Approach for Imbalanced Data
    Hao, Xingran
    Jiang, Zhengwei
    Xiao, Qingsai
    Wang, Qiuyun
    Yao, Yepeng
    Liu, Baoxu
    Liu, Jian
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 384 - 390
  • [46] Tabular GAN-Based Oversampling of Imbalanced Time-to-Event Data for Survival Prediction
    Tan, Huaning
    Chen, Renxing
    Qin, Meng
    Tang, Lining
    Wu, Zhibing
    Luo, Qianlin
    Quan, Yujuan
    2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 376 - 380
  • [47] GAN-based generation of realistic compressible-flow samples from incomplete data
    Abaidi, R.
    Adams, N. A.
    COMPUTERS & FLUIDS, 2024, 269
  • [48] GAN-Based Training of Semi-Interpretable Generators for Biological Data Interpolation and Augmentation
    Tsourtis, Anastasios
    Papoutsoglou, Georgios
    Pantazis, Yannis
    APPLIED SCIENCES-BASEL, 2022, 12 (11):
  • [49] GAN-BASED SYNTHETIC GASTROINTESTINAL IMAGE GENERATION
    Adjei, Prince E.
    Lonsek, Zenebe M.
    Rao, Nini
    2020 17TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2020, : 338 - 342
  • [50] GAN-based Matrix Factorization for Recommender Systems
    Dervishaj, Ervin
    Cremonesi, Paolo
    37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 1373 - 1381