Pricing GAN-based data generators under R?nyi differential privacy

被引:11
作者
Jiang, Xikun [1 ,2 ]
Niu, Chaoyue [1 ]
Ying, Chenhao [1 ,2 ]
Wu, Fan [1 ]
Luo, Yuan [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, AI Inst, Key Lab Artificial Intelligence, MoE, Shanghai, Peoples R China
关键词
Generative adversarial networks (GAN); R?nyi differential privacy; Trading data generators;
D O I
10.1016/j.ins.2022.04.030
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As smart devices are becoming increasingly common in people's daily lives, privacy and security concerns make data collection expensive and limited, which further hinder the development of data-driven tasks. This paper studies how to better conduct private data trading via a novel generator method rather than direct trading of raw data. This new method facilitates more convenient data transactions by generator, protects the privacy of data owners and is satisfactory in terms of privacy compensation and query pricing. In detail, we propose RARIEA, a market framework for tRading privAte data geneRators based on GAN under renyI diffErential privAcy, which involves data owners, a data broker, and data consumers. To start, the broker employs the GAN training generator to augment the data to relieve the data shortage, introducing noise into its training process to preserve the owners' privacy. After that, the broker uses renyi differential privacy to quantify the privacy loss at the data item level during the GAN training process and compensates each owner according to their respective privacy policies. Finally, the data broker charges each of the data consumers for their queries, where the price is lower bounded by the total privacy compensation. We then evaluate the performance of RARIEA on classic data sets: MNIST, FashionMNIST, and CelebA. The analysis and simulation results reveal that the generator provided by RARIEA can not only meet the data consumers' demand for quantity and quality but also protect the owners' privacy. In addition, RARIEA not only allows finer control over data owner compensation, but also excels at controlling the data broker's revenue to improve market efficiency while ensuring fairness, balance, and monotonicity of pricing. (c) 2022 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:57 / 74
页数:18
相关论文
共 48 条
[1]   Deep Learning with Differential Privacy [J].
Abadi, Martin ;
Chu, Andy ;
Goodfellow, Ian ;
McMahan, H. Brendan ;
Mironov, Ilya ;
Talwar, Kunal ;
Zhang, Li .
CCS'16: PROCEEDINGS OF THE 2016 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2016, :308-318
[2]  
[Anonymous], ICLR 2016
[3]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[4]   Revenue Maximization for Query Pricing [J].
Chawla, Shuchi ;
Deep, Shaleen ;
Koutris, Paraschos ;
Teng, Yifeng .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 13 (01) :1-14
[5]   Towards Model-based Pricing for Machine Learning in a Data Marketplace [J].
Chen, Lingjiao ;
Koutris, Paraschos ;
Kumar, Arun .
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, :1535-1552
[6]  
dawex, ABOUT US
[7]  
Deep Shaleen., 2017, International Conference on Database Theory, P1, DOI DOI 10.1109/IGEHT.2017.8094066
[8]   Differential privacy: A survey of results [J].
Dwork, Cynthia .
THEORY AND APPLICATIONS OF MODELS OF COMPUTATION, PROCEEDINGS, 2008, 4978 :1-19
[9]   The Algorithmic Foundations of Differential Privacy [J].
Dwork, Cynthia ;
Roth, Aaron .
FOUNDATIONS AND TRENDS IN THEORETICAL COMPUTER SCIENCE, 2013, 9 (3-4) :211-406
[10]   Relational Data Synthesis using Generative Adversarial Networks: A Design Space Exploration [J].
Fan, Ju ;
Chen, Junyou ;
Liu, Tongyu ;
Shen, Yuwei ;
Li, Guoliang ;
Du, Xiaoyong .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (11) :1962-1975