Label Privacy Source Coding in Vertical Federated Learning

被引：0

作者：

Gao, Dashan ^{[1
,2
,3
]}

Wan, Sheng ^{[2
,3
]}

Gu, Hanlin ^{[4
]}

Fan, Lixin ^{[4
]}

Yao, Xin ^{[5
]}

Yang, Qiang ^{[2
]}

机构：

[1] Guangdong Prov Key Lab, Guangzhou, Guangdong, Peoples R China

[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China

[3] Southern Univ Sci & Technol, Shenzhen, Peoples R China

[4] WeBank AI Lab, Shenzhen, Peoples R China

[5] Lingnan Univ, Hong Kong, Peoples R China

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT I, ECML PKDD 2024 | 2024年 / 14941卷

基金：

中国国家自然科学基金;

关键词：

Vertical federated learning; Mutual information privacy; REGRESSION;

D O I：

10.1007/978-3-031-70341-6_19

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We study label privacy protection in vertical federated learning (VFL). VFL enables an active party who possesses labeled data to improve model performance (utility) by collaborating with passive parties who have auxiliary features. Recently, there has been a growing concern for protecting label privacy against passive parties who may surreptitiously deduce private labels from the output of their bottom models. In contrast to existing defense methods that focus on training-phase perturbation, we propose a novel offline-phase cleansing approach to protect label privacy barely compromising utility. Specifically, we first formulate a Label Privacy Source Coding (LPSC) problem to remove the redundant label information in the active party's features from labels, by assigning each sample a new weight and label (i.e., residual) for federated training. We theoretically demonstrate that LPSC 1) satisfies epsilon-mutual information privacy (epsilon-MIP) and 2) can be reduced to gradient boosting's objective thereby efficiently optimized. Therefore, we propose a gradient boosting-based LPSC method to protect label privacy. Moreover, given that LPSC only provides bounded privacy enhancement, we further introduce the two-phase LPSC+ framework, which enables a flexible privacy-utility trade-off by incorporating training-phase perturbation methods, such as adversarial training. Experimental results on four realworld datasets substantiate the efficacy of LPSC and the superiority of our LPSC+ framework.

引用

页码：313 / 331

页数：19

共 50 条

[31] Vertical federated learning: a structured literature review
Khan, Afsana
ten Thij, Marijn
Wilbik, Anna
KNOWLEDGE AND INFORMATION SYSTEMS, 2025, : 3205 - 3243
[32] Adaptive Vertical Federated Learning on Unbalanced Features
Zhang, Jie
Guo, Song
Qu, Zhihao
Zeng, Deze
Wang, Haozhao
Liu, Qifeng
Zomaya, Albert Y.
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (12) : 4006 - 4018
[33] TREECSS: An Efficient Framework for Vertical Federated Learning
Zhang, Qinbo
Yang, Xiao
Ding, Yukai
Xu, Quanqing
Hu, Chuang
Zhou, Xiaokai
Jiang, Jiawei
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, DASFAA 2024, 2024, 14850 : 425 - 441
[34] Vertical Federated Learning: Concepts, Advances, and Challenges
Liu, Yang
Kang, Yan
Zou, Tianyuan
Pu, Yanhong
He, Yuanqin
Ye, Xiaozhou
Ouyang, Ye
Zhang, Ya-Qin
Yang, Qiang
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 3615 - 3634
[35] SVFGNN: A privacy-preserving vertical federated graph neural network model training framework based on split learning
Liu, Yanjun
Li, Hongwei
Hao, Meng
PEER-TO-PEER NETWORKING AND APPLICATIONS, 2024, 17 (01) : 261 - 283
[36] SVFGNN: A privacy-preserving vertical federated graph neural network model training framework based on split learning
Yanjun Liu
Hongwei Li
Meng Hao
Peer-to-Peer Networking and Applications, 2024, 17 : 246 - 260
[37] SGBoost: An Efficient and Privacy-Preserving Vertical Federated Tree Boosting Framework
Zhao, Jiaqi
Zhu, Hui
Xu, Wei
Wang, Fengwei
Lu, Rongxing
Li, Hui
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 1022 - 1036
[38] VFLR: An Efficient and Privacy-Preserving Vertical Federated Framework for Logistic Regression
Zhao, Jiaqi
Zhu, Hui
Wang, Fengwei
Lu, Rongxing
Wang, Ermei
Li, Linfeng
Li, Hui
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (04) : 3326 - 3340
[39] SVFL: Secure Vertical Federated Learning on Linear Models
Luo, Kaifeng
Cao, Zhenfu
Shen, Jiachen
Dong, Xiaolei
SCIENCE OF CYBER SECURITY, SCISEC 2023, 2023, 14299 : 332 - 344
[40] Architecture-Based FedAvg for Vertical Federated Learning
Casella, Bruno
Fonio, Samuele
16TH IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC 2023, 2023,

← 1 2 3 4 5 →