Task-Specific Adaptive Differential Privacy Method for Structured Data

被引：3

作者：

Utaliyeva, Assem ^{[1
]}

Shin, Jinmyeong ^{[1
]}

Choi, Yoon-Ho ^{[1
]}

机构：

[1] Pusan Natl Univ, Sch Comp Sci & Engn, Busan 609735, South Korea

来源：

SENSORS | 2023年 / 23卷 / 04期

基金：

新加坡国家研究基金会;

关键词：

differential privacy; machine learning; privacy-preserving;

D O I：

10.3390/s23041980

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Data are needed to train machine learning (ML) algorithms, and in many cases often include private datasets that contain sensitive information. To preserve the privacy of data used while training ML algorithms, computer scientists have widely deployed anonymization techniques. These anonymization techniques have been widely used but are not foolproof. Many studies showed that ML models using anonymization techniques are vulnerable to various privacy attacks willing to expose sensitive information. As a privacy-preserving machine learning (PPML) technique that protects private data with sensitive information in ML, we propose a new task-specific adaptive differential privacy (DP) technique for structured data. The main idea of the proposed DP method is to adaptively calibrate the amount and distribution of random noise applied to each attribute according to the feature importance for the specific tasks of ML models and different types of data. From experimental results under various datasets, tasks of ML models, different DP mechanisms, and so on, we evaluate the effectiveness of the proposed task-specific adaptive DP method. Thus, we show that the proposed task-specific adaptive DP technique satisfies the model-agnostic property to be applied to a wide range of ML tasks and various types of data while resolving the privacy-utility trade-off problem.

引用

页数：18

共 32 条

[21] DataSynthesizer: Privacy-Preserving Synthetic Datasets
Ping, Haoyue
Stoyanovich, Julia
Howe, Bill
[J]. SSDBM 2017: 29TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2017,
[22] Roy S., CAR INSURANCE DATA
[23] Samarati P., 1998, ACM SIGACT SIGMOD SI
[24] Snider M., 2019, USA TODAY, V7, P24
[25] k-anonymity:: A model for protecting privacy
Sweeney, L
[J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2002, 10 (05) : 557 - 570
[26] DP-CGAN : Differentially Private Synthetic Data and Label Generation
Torkzadehmahani, Reihaneh
Kairouz, Peter
Paten, Benedict
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 98 - 104
[27] Stealing Hyperparameters in Machine Learning
Wang, Binghui
Gong, Neil Zhenqiang
[J]. 2018 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2018, : 36 - 52
[28] Xie L., 2018, arXiv
[29] PrivBayes: Private Data Release via Bayesian Networks
Zhang, Jun
Cormode, Graham
Procopiuc, Cecilia M.
Srivastava, Divesh
Xiao, Xiaokui
[J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2017, 42 (04):
[30] Zhang X., 2018, arXiv

← 1 2 3 4 →