GAN-based data reconstruction attacks in split learning

被引：0

作者：

Zeng, Bo ^{[1
]}

Luo, Sida ^{[1
]}

Yu, Fangchao ^{[1
]}

Yang, Geying ^{[1
]}

Zhao, Kai ^{[1
]}

Wang, Lina ^{[1
]}

机构：

[1] Wuhan Univ, Sch Cyber Sci & Engn, Key Lab Aerosp Informat Secur & Trusted Comp, Minist Educ, Wuhan 430072, Peoples R China

来源：

NEURAL NETWORKS | 2025年 / 185卷

基金：

中国国家自然科学基金;

关键词：

Distributed privacy-preserving machine; learning; Split learning; Data reconstruction attacks; Model inversion; Generative adversarial networks;

D O I：

10.1016/j.neunet.2025.107150

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Due to the distinctive distributed privacy-preserving architecture, split learning has found widespread application in scenarios where computational resources on the client side are limited. Unlike clients in federated learning retaining the whole model, split learning partitions the model into two segments situated separately on the server and client ends, thereby preventing direct access to the complete model structure by either party and fortifying its resilience against attacks. However, existing studies have demonstrated that even with access restricted to partial model outputs, split learning remains susceptible to data reconstruction attacks. This vulnerability persists despite prior research predominantly relying on stringent assumptions and the attacker being the server with the ability to access global information. Building upon this understanding, we devise GAN-based data reconstruction attacks within the U-shaped split learning framework, meticulously examining and confirming the feasibility of attacks initiated from both server and client sides, along with the underlying assumptions. Specifically, for attacks originating from the server, we propose the Model Approximation E stimation Reconstruction Attack (MAERA) to mitigate the requisite prior assumptions, and we also introduce the Distillation-based Client-side Reconstruction Attack (DCRA) to execute data reconstructions from the client for the first time. Experimental results illustrate the effectiveness and the robustness of the proposed frameworks in launching attacks across various datasets. In particular, MAERA necessitates merely 1% of the test set samples and 1% of the private data samples from the CIFAR100 dataset to unleash effective attacks, while DCRA adeptly expropriates models from clients and yields more pronounced reconstruction effects on target class samples during the process of inferring data distribution characteristics, in contrast to conventional Maximum A Posteriori (MAP) estimation algorithms.

引用

页数：15

共 50 条

[41] Global Ionospheric Total Electron Content Completion with a GAN-Based Deep Learning Framework
Yang, Kunlin
Liu, Yang
REMOTE SENSING, 2022, 14 (23)
[42] FlGan: GAN-Based Unbiased Federated Learning Under Non-IID Settings
Ma, Zhuoran
Liu, Yang
Miao, Yinbin
Xu, Guowen
Liu, Ximeng
Ma, Jianfeng
Deng, Robert H.
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (04) : 1566 - 1581
[43] AEC_GAN: Unbalanced Data Processing Decision-Making in Network Attacks Based on ACGAN and Machine Learning
Zhu, Naibo
Zhao, Guangyu
Yang, Yang
Yang, Han
Liu, Zhi
IEEE ACCESS, 2023, 11 : 52452 - 52465
[44] Improving imbalanced medical image classification through GAN-based data augmentation methods
Ding, Hongwei
Huang, Nana
Wu, Yaoxin
Cui, Xiaohui
PATTERN RECOGNITION, 2025, 166
[45] Producing More with Less: A GAN-based Network Attack Detection Approach for Imbalanced Data
Hao, Xingran
Jiang, Zhengwei
Xiao, Qingsai
Wang, Qiuyun
Yao, Yepeng
Liu, Baoxu
Liu, Jian
PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 384 - 390
[46] Tabular GAN-Based Oversampling of Imbalanced Time-to-Event Data for Survival Prediction
Tan, Huaning
Chen, Renxing
Qin, Meng
Tang, Lining
Wu, Zhibing
Luo, Qianlin
Quan, Yujuan
2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 376 - 380
[47] GAN-based generation of realistic compressible-flow samples from incomplete data
Abaidi, R.
Adams, N. A.
COMPUTERS & FLUIDS, 2024, 269
[48] GAN-Based Training of Semi-Interpretable Generators for Biological Data Interpolation and Augmentation
Tsourtis, Anastasios
Papoutsoglou, Georgios
Pantazis, Yannis
APPLIED SCIENCES-BASEL, 2022, 12 (11):
[49] GAN-BASED SYNTHETIC GASTROINTESTINAL IMAGE GENERATION
Adjei, Prince E.
Lonsek, Zenebe M.
Rao, Nini
2020 17TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2020, : 338 - 342
[50] GAN-based Matrix Factorization for Recommender Systems
Dervishaj, Ervin
Cremonesi, Paolo
37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 1373 - 1381

← 1 2 3 4 5 →