Label Privacy Source Coding in Vertical Federated Learning

被引:0
|
作者
Gao, Dashan [1 ,2 ,3 ]
Wan, Sheng [2 ,3 ]
Gu, Hanlin [4 ]
Fan, Lixin [4 ]
Yao, Xin [5 ]
Yang, Qiang [2 ]
机构
[1] Guangdong Prov Key Lab, Guangzhou, Guangdong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] Southern Univ Sci & Technol, Shenzhen, Peoples R China
[4] WeBank AI Lab, Shenzhen, Peoples R China
[5] Lingnan Univ, Hong Kong, Peoples R China
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT I, ECML PKDD 2024 | 2024年 / 14941卷
基金
中国国家自然科学基金;
关键词
Vertical federated learning; Mutual information privacy; REGRESSION;
D O I
10.1007/978-3-031-70341-6_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study label privacy protection in vertical federated learning (VFL). VFL enables an active party who possesses labeled data to improve model performance (utility) by collaborating with passive parties who have auxiliary features. Recently, there has been a growing concern for protecting label privacy against passive parties who may surreptitiously deduce private labels from the output of their bottom models. In contrast to existing defense methods that focus on training-phase perturbation, we propose a novel offline-phase cleansing approach to protect label privacy barely compromising utility. Specifically, we first formulate a Label Privacy Source Coding (LPSC) problem to remove the redundant label information in the active party's features from labels, by assigning each sample a new weight and label (i.e., residual) for federated training. We theoretically demonstrate that LPSC 1) satisfies epsilon-mutual information privacy (epsilon-MIP) and 2) can be reduced to gradient boosting's objective thereby efficiently optimized. Therefore, we propose a gradient boosting-based LPSC method to protect label privacy. Moreover, given that LPSC only provides bounded privacy enhancement, we further introduce the two-phase LPSC+ framework, which enables a flexible privacy-utility trade-off by incorporating training-phase perturbation methods, such as adversarial training. Experimental results on four realworld datasets substantiate the efficacy of LPSC and the superiority of our LPSC+ framework.
引用
收藏
页码:313 / 331
页数:19
相关论文
共 50 条
  • [1] Cascade Vertical Federated Learning Towards Straggler Mitigation and Label Privacy Over Distributed Labels
    Xia, Wensheng
    Li, Ying
    Zhang, Lan
    Wu, Zhonghai
    Yuan, Xiaoyong
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (06) : 926 - 939
  • [2] Exploiting Internal Randomness for Privacy in Vertical Federated Learning
    Sun, Yulian
    Duan, Li
    Mendes, Ricardo
    Zhu, Derui
    Xia, Yue
    Li, Yong
    Fischer, Asja
    COMPUTER SECURITY-ESORICS 2024, PT II, 2024, 14983 : 390 - 409
  • [3] Toward Few-Label Vertical Federated Learning
    Zhang, Lei
    Fu, Lele
    Liu, Chen
    Yang, Zhao
    Yang, Jinghua
    Zheng, Zibin
    Chen, Chuan
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (07)
  • [4] Adaptive differential privacy in vertical federated learning for mobility forecasting
    Errounda, Fatima Zahra
    Liu, Yan
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 149 : 531 - 546
  • [5] Interpretable vertical federated learning with privacy-preserving multi-source data integration for prognostic prediction
    Wang, Qingyong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 148
  • [6] A Privacy-Preserving Method for Sequential Recommendation in Vertical Federated Learning
    Shi, Yutian
    Wang, Beilun
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 2221 - 2226
  • [7] A Privacy-preserving Data Alignment Framework for Vertical Federated Learning
    Gao, Ying
    Xie, Yuxin
    Deng, Huanghao
    Zhu, Zukun
    Zhang, Yiyu
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (08): : 3419 - 3427
  • [8] ELXGB: An Efficient and Privacy-Preserving XGBoost for Vertical Federated Learning
    Xu, Wei
    Zhu, Hui
    Zheng, Yandong
    Wang, Fengwei
    Zhao, Jiaqi
    Liu, Zhe
    Li, Hui
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (03) : 878 - 892
  • [9] SensFL: Privacy-Preserving Vertical Federated Learning with Sensitive Regularization
    Zhang, Chongzhen
    Liu, Zhichen
    Xu, Xiangrui
    Hu, Fuqiang
    Dai, Jiao
    Cai, Baigen
    Wang, Wei
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, : 385 - 404
  • [10] A defense mechanism against label inference attacks in Vertical Federated Learning
    Arazzi, Marco
    Nicolazzo, Serena
    Nocera, Antonino
    NEUROCOMPUTING, 2025, 624