Towards Correlated Data Trading for High-Dimensional Private Data

被引:9
作者
Cai, Hui [1 ]
Yang, Yuanyuan [2 ]
Fan, Weibei [1 ]
Xiao, Fu [1 ]
Zhu, Yanmin [3 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Comp, Nanjing 210028, Jiangsu, Peoples R China
[2] SUNY Stony Brook, Dept Elect & Comp Engn, Stony Brook, NY 11794 USA
[3] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
基金
中国博士后科学基金;
关键词
Data privacy; Correlation; Privacy; Costs; Perturbation methods; Data models; History; Data correlation; data privacy; data trading;
D O I
10.1109/TPDS.2023.3237691
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The commoditization of private data has become an attractive research topic with the emergence of Big Data era. In this paper, we study the trading of high-dimensional private data with differential privacy guarantee. We propose Cheap, which is a novel Correlated data trading framework for High-dimEnsionAl Private data. Cheap first models data correlations among high-dimensional user attributes, and builds an initial attribute clustering scheme. Combined with this scheme, Cheap devises a novel data perturbation mechanism by solving optimal attribute clustering (OAC) problem, in order to improve data utility of traded data and further generate a privacy-preserving high-dimensional dataset with close joint distribution with the original one. It then quantifies privacy loss based on near-optimal attribute cluster scheme due to the NP-hardness of the OAC problem, and further compensates data owners by running auction in a cost-effective way. We evaluate the performance of Cheap on UserBehavior dataset and Obesity dataset, respectively. Our evaluation and analysis demonstrate that Cheap well balances data utility and privacy protection, and achieves all desired economic properties of budget balance, individual rationality and truthfulness.
引用
收藏
页码:1047 / 1059
页数:13
相关论文
共 47 条
[1]  
[Anonymous], 2012, INFOCHIMPS
[2]  
[Anonymous], 2017, BEHAV TARGETING
[3]  
[Anonymous], 1969, AXICOM
[4]  
[Anonymous], 2019, MICROSOFT AZURE MARK
[5]  
[Anonymous], 2019, USER BEHAV DATASET I
[6]  
[Anonymous], 2019, OPTI TOOLBOX
[7]  
[Anonymous], 2020, ESTIMATION OBESITY L
[8]  
[Anonymous], 2011, Differential Privacy, DOI [DOI 10.1007/978-1-4419-5906-5752, 10.1007/978-1-4419-5906-5752]
[9]   Protecting User Privacy: An Approach for Untraceable Web Browsing History and Unambiguous User Profiles [J].
Beigi, Ghazaleh ;
Guo, Ruocheng ;
Nou, Alexander ;
Zhang, Yanchao ;
Liu, Huan .
PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, :213-221
[10]   Online Pricing and Trading of Private Data in Correlated Queries [J].
Cai, Hui ;
Ye, Fan ;
Yang, Yuanyuan ;
Zhu, Yanmin ;
Li, Jie ;
Xiao, Fu .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (03) :569-585