Label Privacy Source Coding in Vertical Federated Learning

被引:0
|
作者
Gao, Dashan [1 ,2 ,3 ]
Wan, Sheng [2 ,3 ]
Gu, Hanlin [4 ]
Fan, Lixin [4 ]
Yao, Xin [5 ]
Yang, Qiang [2 ]
机构
[1] Guangdong Prov Key Lab, Guangzhou, Guangdong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] Southern Univ Sci & Technol, Shenzhen, Peoples R China
[4] WeBank AI Lab, Shenzhen, Peoples R China
[5] Lingnan Univ, Hong Kong, Peoples R China
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT I, ECML PKDD 2024 | 2024年 / 14941卷
基金
中国国家自然科学基金;
关键词
Vertical federated learning; Mutual information privacy; REGRESSION;
D O I
10.1007/978-3-031-70341-6_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study label privacy protection in vertical federated learning (VFL). VFL enables an active party who possesses labeled data to improve model performance (utility) by collaborating with passive parties who have auxiliary features. Recently, there has been a growing concern for protecting label privacy against passive parties who may surreptitiously deduce private labels from the output of their bottom models. In contrast to existing defense methods that focus on training-phase perturbation, we propose a novel offline-phase cleansing approach to protect label privacy barely compromising utility. Specifically, we first formulate a Label Privacy Source Coding (LPSC) problem to remove the redundant label information in the active party's features from labels, by assigning each sample a new weight and label (i.e., residual) for federated training. We theoretically demonstrate that LPSC 1) satisfies epsilon-mutual information privacy (epsilon-MIP) and 2) can be reduced to gradient boosting's objective thereby efficiently optimized. Therefore, we propose a gradient boosting-based LPSC method to protect label privacy. Moreover, given that LPSC only provides bounded privacy enhancement, we further introduce the two-phase LPSC+ framework, which enables a flexible privacy-utility trade-off by incorporating training-phase perturbation methods, such as adversarial training. Experimental results on four realworld datasets substantiate the efficacy of LPSC and the superiority of our LPSC+ framework.
引用
收藏
页码:313 / 331
页数:19
相关论文
共 50 条
  • [41] Adaptive and Efficient Participant Selection in Vertical Federated Learning
    Huang, Jiahui
    Zhang, Lan
    Li, Anran
    Cheng, Haoran
    Xu, Jiexin
    Song, Hongmei
    2023 19TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN 2023, 2023, : 455 - 462
  • [42] Cluster knowledge-driven vertical federated learning
    Yin, Zilong
    Zhao, Xiaoli
    Wang, Haoyu
    Zhang, Xin
    Guo, Xin
    Fang, Zhijun
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (14) : 20229 - 20252
  • [43] PraVFed: Practical Heterogeneous Vertical Federated Learning via Representation Learning
    Wang, Shuo
    Gai, Keke
    Yu, Jing
    Zhang, Zijian
    Zhu, Liehuang
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 2693 - 2705
  • [44] Practical and General Backdoor Attacks Against Vertical Federated Learning
    Xuan, Yuexin
    Chen, Xiaojun
    Zhao, Zhendong
    Tang, Bisheng
    Dong, Ye
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 402 - 417
  • [45] Cross-Modal Vertical Federated Learning for MRI Reconstruction
    Yan, Yunlu
    Wang, Hong
    Huang, Yawen
    He, Nanjun
    Zhu, Lei
    Xu, Yong
    Li, Yuexiang
    Zheng, Yefeng
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (11) : 6384 - 6394
  • [46] Universal adversarial backdoor attacks to fool vertical federated learning
    Chen, Peng
    Du, Xin
    Lu, Zhihui
    Chai, Hongfeng
    COMPUTERS & SECURITY, 2024, 137
  • [47] BDVFL: Blockchain-based Decentralized Vertical Federated Learning
    Wang, Shuo
    Gai, Keke
    Yu, Jing
    Zhu, Liehuang
    23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 628 - 637
  • [48] DFedXGB: An XGB Vertical Federated Learning Framework with Data Desensitization
    Yang, Qing
    Tian, Youliang
    Xiong, Jinbo
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 157 - 164
  • [49] Feature Inference Attack on Model Predictions in Vertical Federated Learning
    Luo, Xinjian
    Wu, Yuncheng
    Xiao, Xiaokui
    Ooi, Beng Chin
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 181 - 192
  • [50] Stalactite: toolbox for fast prototyping of vertical federated learning systems
    Zakharova, Anastasiia
    Alexandrov, Dmitriy
    Khodorchenko, Maria
    Butakov, Nikolay
    Vasilev, Alexey
    Savchenko, Maxim
    Grigorievskiy, Alexander
    PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 1187 - 1190