Task-Specific Adaptive Differential Privacy Method for Structured Data

被引:3
作者
Utaliyeva, Assem [1 ]
Shin, Jinmyeong [1 ]
Choi, Yoon-Ho [1 ]
机构
[1] Pusan Natl Univ, Sch Comp Sci & Engn, Busan 609735, South Korea
基金
新加坡国家研究基金会;
关键词
differential privacy; machine learning; privacy-preserving;
D O I
10.3390/s23041980
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Data are needed to train machine learning (ML) algorithms, and in many cases often include private datasets that contain sensitive information. To preserve the privacy of data used while training ML algorithms, computer scientists have widely deployed anonymization techniques. These anonymization techniques have been widely used but are not foolproof. Many studies showed that ML models using anonymization techniques are vulnerable to various privacy attacks willing to expose sensitive information. As a privacy-preserving machine learning (PPML) technique that protects private data with sensitive information in ML, we propose a new task-specific adaptive differential privacy (DP) technique for structured data. The main idea of the proposed DP method is to adaptively calibrate the amount and distribution of random noise applied to each attribute according to the feature importance for the specific tasks of ML models and different types of data. From experimental results under various datasets, tasks of ML models, different DP mechanisms, and so on, we evaluate the effectiveness of the proposed task-specific adaptive DP method. Thus, we show that the proposed task-specific adaptive DP technique satisfies the model-agnostic property to be applied to a wide range of ML tasks and various types of data while resolving the privacy-utility trade-off problem.
引用
收藏
页数:18
相关论文
共 32 条
  • [21] DataSynthesizer: Privacy-Preserving Synthetic Datasets
    Ping, Haoyue
    Stoyanovich, Julia
    Howe, Bill
    [J]. SSDBM 2017: 29TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2017,
  • [22] Roy S., CAR INSURANCE DATA
  • [23] Samarati P., 1998, ACM SIGACT SIGMOD SI
  • [24] Snider M., 2019, USA TODAY, V7, P24
  • [25] k-anonymity:: A model for protecting privacy
    Sweeney, L
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2002, 10 (05) : 557 - 570
  • [26] DP-CGAN : Differentially Private Synthetic Data and Label Generation
    Torkzadehmahani, Reihaneh
    Kairouz, Peter
    Paten, Benedict
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 98 - 104
  • [27] Stealing Hyperparameters in Machine Learning
    Wang, Binghui
    Gong, Neil Zhenqiang
    [J]. 2018 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2018, : 36 - 52
  • [28] Xie L., 2018, arXiv
  • [29] PrivBayes: Private Data Release via Bayesian Networks
    Zhang, Jun
    Cormode, Graham
    Procopiuc, Cecilia M.
    Srivastava, Divesh
    Xiao, Xiaokui
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2017, 42 (04):
  • [30] Zhang X., 2018, arXiv