Label Privacy Source Coding in Vertical Federated Learning

被引:0
|
作者
Gao, Dashan [1 ,2 ,3 ]
Wan, Sheng [2 ,3 ]
Gu, Hanlin [4 ]
Fan, Lixin [4 ]
Yao, Xin [5 ]
Yang, Qiang [2 ]
机构
[1] Guangdong Prov Key Lab, Guangzhou, Guangdong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] Southern Univ Sci & Technol, Shenzhen, Peoples R China
[4] WeBank AI Lab, Shenzhen, Peoples R China
[5] Lingnan Univ, Hong Kong, Peoples R China
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT I, ECML PKDD 2024 | 2024年 / 14941卷
基金
中国国家自然科学基金;
关键词
Vertical federated learning; Mutual information privacy; REGRESSION;
D O I
10.1007/978-3-031-70341-6_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study label privacy protection in vertical federated learning (VFL). VFL enables an active party who possesses labeled data to improve model performance (utility) by collaborating with passive parties who have auxiliary features. Recently, there has been a growing concern for protecting label privacy against passive parties who may surreptitiously deduce private labels from the output of their bottom models. In contrast to existing defense methods that focus on training-phase perturbation, we propose a novel offline-phase cleansing approach to protect label privacy barely compromising utility. Specifically, we first formulate a Label Privacy Source Coding (LPSC) problem to remove the redundant label information in the active party's features from labels, by assigning each sample a new weight and label (i.e., residual) for federated training. We theoretically demonstrate that LPSC 1) satisfies epsilon-mutual information privacy (epsilon-MIP) and 2) can be reduced to gradient boosting's objective thereby efficiently optimized. Therefore, we propose a gradient boosting-based LPSC method to protect label privacy. Moreover, given that LPSC only provides bounded privacy enhancement, we further introduce the two-phase LPSC+ framework, which enables a flexible privacy-utility trade-off by incorporating training-phase perturbation methods, such as adversarial training. Experimental results on four realworld datasets substantiate the efficacy of LPSC and the superiority of our LPSC+ framework.
引用
收藏
页码:313 / 331
页数:19
相关论文
共 50 条
  • [21] FedLED: Label-Free Equipment Fault Diagnosis With Vertical Federated Transfer Learning
    Shen, Jie
    Yang, Shusen
    Zhao, Cong
    Ren, Xuebin
    Zhao, Peng
    Yang, Yuqian
    Han, Qing
    Wu, Shuaijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 10
  • [22] Privacy-Preserving Realization of Fuzzy Clustering and Fuzzy Modeling Through Vertical Federated Learning
    Zhu, Xiubin
    Wang, Dan
    Pedrycz, Witold
    Li, Zhiwu
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (02): : 915 - 924
  • [23] ReVFed: Representation-Based Privacy-Preserving Vertical Federated Learning with Heterogeneous Models
    Wang, Shuo
    Yu, Jing
    Gai, Keke
    Zhu, Liehuang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2024, 2024, 14886 : 386 - 397
  • [24] Beyond model splitting: Preventing label inference attacks in vertical federated learning with dispersed training
    Wang, Yilei
    Lv, Qingzhe
    Zhang, Huang
    Zhao, Minghao
    Sun, Yuhong
    Ran, Lingkai
    Li, Tao
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (05): : 2691 - 2707
  • [25] Beyond model splitting: Preventing label inference attacks in vertical federated learning with dispersed training
    Yilei Wang
    Qingzhe Lv
    Huang Zhang
    Minghao Zhao
    Yuhong Sun
    Lingkai Ran
    Tao Li
    World Wide Web, 2023, 26 : 2691 - 2707
  • [26] Frameworks for Privacy-Preserving Federated Learning
    Phong, Le Trieu
    Phuong, Tran Thi
    Wang, Lihua
    Ozawa, Seiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (01) : 2 - 12
  • [27] IHVFL: a privacy-enhanced intention-hiding vertical federated learning framework for medical data
    Fei Tang
    Shikai Liang
    Guowei Ling
    Jinyong Shan
    Cybersecurity, 6
  • [28] IHVFL: a privacy-enhanced intention-hiding vertical federated learning framework for medical data
    Tang, Fei
    Liang, Shikai
    Ling, Guowei
    Shan, Jinyong
    CYBERSECURITY, 2023, 6 (01)
  • [29] Privacy Matters: Vertical Federated Linear Contextual Bandits for Privacy-Protected Recommendation
    Cao, Zeyu
    Liang, Zhipeng
    Wu, Bingzhe
    Zhang, Shu
    Li, Hangyu
    Wen, Ouyang
    Rong, Yu
    Zhao, Peilin
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 154 - 166
  • [30] Practical Vertical Federated Learning With Unsupervised Representation Learning
    Wu, Zhaomin
    Li, Qinbin
    He, Bingsheng
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (06) : 864 - 878