Label Privacy Source Coding in Vertical Federated Learning

被引:0
|
作者
Gao, Dashan [1 ,2 ,3 ]
Wan, Sheng [2 ,3 ]
Gu, Hanlin [4 ]
Fan, Lixin [4 ]
Yao, Xin [5 ]
Yang, Qiang [2 ]
机构
[1] Guangdong Prov Key Lab, Guangzhou, Guangdong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] Southern Univ Sci & Technol, Shenzhen, Peoples R China
[4] WeBank AI Lab, Shenzhen, Peoples R China
[5] Lingnan Univ, Hong Kong, Peoples R China
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT I, ECML PKDD 2024 | 2024年 / 14941卷
基金
中国国家自然科学基金;
关键词
Vertical federated learning; Mutual information privacy; REGRESSION;
D O I
10.1007/978-3-031-70341-6_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study label privacy protection in vertical federated learning (VFL). VFL enables an active party who possesses labeled data to improve model performance (utility) by collaborating with passive parties who have auxiliary features. Recently, there has been a growing concern for protecting label privacy against passive parties who may surreptitiously deduce private labels from the output of their bottom models. In contrast to existing defense methods that focus on training-phase perturbation, we propose a novel offline-phase cleansing approach to protect label privacy barely compromising utility. Specifically, we first formulate a Label Privacy Source Coding (LPSC) problem to remove the redundant label information in the active party's features from labels, by assigning each sample a new weight and label (i.e., residual) for federated training. We theoretically demonstrate that LPSC 1) satisfies epsilon-mutual information privacy (epsilon-MIP) and 2) can be reduced to gradient boosting's objective thereby efficiently optimized. Therefore, we propose a gradient boosting-based LPSC method to protect label privacy. Moreover, given that LPSC only provides bounded privacy enhancement, we further introduce the two-phase LPSC+ framework, which enables a flexible privacy-utility trade-off by incorporating training-phase perturbation methods, such as adversarial training. Experimental results on four realworld datasets substantiate the efficacy of LPSC and the superiority of our LPSC+ framework.
引用
收藏
页码:313 / 331
页数:19
相关论文
共 50 条
  • [31] Vertical federated learning: a structured literature review
    Khan, Afsana
    ten Thij, Marijn
    Wilbik, Anna
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, : 3205 - 3243
  • [32] Adaptive Vertical Federated Learning on Unbalanced Features
    Zhang, Jie
    Guo, Song
    Qu, Zhihao
    Zeng, Deze
    Wang, Haozhao
    Liu, Qifeng
    Zomaya, Albert Y.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (12) : 4006 - 4018
  • [33] TREECSS: An Efficient Framework for Vertical Federated Learning
    Zhang, Qinbo
    Yang, Xiao
    Ding, Yukai
    Xu, Quanqing
    Hu, Chuang
    Zhou, Xiaokai
    Jiang, Jiawei
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, DASFAA 2024, 2024, 14850 : 425 - 441
  • [34] Vertical Federated Learning: Concepts, Advances, and Challenges
    Liu, Yang
    Kang, Yan
    Zou, Tianyuan
    Pu, Yanhong
    He, Yuanqin
    Ye, Xiaozhou
    Ouyang, Ye
    Zhang, Ya-Qin
    Yang, Qiang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 3615 - 3634
  • [35] SVFGNN: A privacy-preserving vertical federated graph neural network model training framework based on split learning
    Liu, Yanjun
    Li, Hongwei
    Hao, Meng
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2024, 17 (01) : 261 - 283
  • [36] SVFGNN: A privacy-preserving vertical federated graph neural network model training framework based on split learning
    Yanjun Liu
    Hongwei Li
    Meng Hao
    Peer-to-Peer Networking and Applications, 2024, 17 : 246 - 260
  • [37] SGBoost: An Efficient and Privacy-Preserving Vertical Federated Tree Boosting Framework
    Zhao, Jiaqi
    Zhu, Hui
    Xu, Wei
    Wang, Fengwei
    Lu, Rongxing
    Li, Hui
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 1022 - 1036
  • [38] VFLR: An Efficient and Privacy-Preserving Vertical Federated Framework for Logistic Regression
    Zhao, Jiaqi
    Zhu, Hui
    Wang, Fengwei
    Lu, Rongxing
    Wang, Ermei
    Li, Linfeng
    Li, Hui
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (04) : 3326 - 3340
  • [39] SVFL: Secure Vertical Federated Learning on Linear Models
    Luo, Kaifeng
    Cao, Zhenfu
    Shen, Jiachen
    Dong, Xiaolei
    SCIENCE OF CYBER SECURITY, SCISEC 2023, 2023, 14299 : 332 - 344
  • [40] Architecture-Based FedAvg for Vertical Federated Learning
    Casella, Bruno
    Fonio, Samuele
    16TH IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC 2023, 2023,