Influence of data errors on differential privacy

被引:0
作者
Tao Wang
Zhengquan Xu
Dong Wang
Hao Wang
机构
[1] Wuhan University,Collaborative Innovation Center for Geospatial Technology, and State Key Laboratory for Information Engineering in Surveying, Mapping and Remote Sensing
[2] Wuhan University,State Key Laboratory for Information Engineering in Surveying, Mapping and Remote Sensing, and Collaborative Innovation Center for Geospatial Technology
来源
Cluster Computing | 2019年 / 22卷
关键词
Data errors; Differential privacy; Privacy budget; Laplace mechanism; Gaussian distribution;
D O I
暂无
中图分类号
学科分类号
摘要
The rapid development of data sharing applications brings a serious problem of privacy disclosure. As an effective privacy-preserving method, the differential privacy, which strictly defines the privacy-preserving degree and data utility mathematically, can balance the privacy and data utility. However, the differential privacy has a hypothesis premise that the raw data are accurate without any error, so it could not limit the privacy security and the data utility to the expected range when processing data with errors. Hence, this paper focuses on the study on the influence of data errors on differential privacy. Taking the random error as an example, we analyze the influence mode and mechanism of data errors on differential privacy, especially on the privacy budget ε\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varepsilon $$\end{document}. The theoretical derivations and experimental simulations prove that the Laplace mechanism still preserves ε′\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varepsilon ^{\prime }$$\end{document} -indistinguishability for data with errors. Moreover, the random algorithm can realize the expected privacy preserving strength by adding less noise compared with the algorithm that do not consider data errors, and has a better data utility by reducing the unnecessary cost of utility. This paper defines the research directions on the differential privacy theory concerning of data errors, and provides the foundations of perfecting the theory system and promoting the practicality of the differential privacy.
引用
收藏
页码:2739 / 2746
页数:7
相关论文
共 43 条
  • [1] Montjoye YD(2013)Unique in the crowd: the privacy bounds of human mobility Sci. Rep. 3 1-5
  • [2] Hidalgo CA(2017)CTS-DP: publishing correlated time-series data via differential privacy Knowl. Based Syst. 122 167-179
  • [3] Verleysen M(2014)Differentially private filtering IEEE Trans. Autom. Control 59 341-354
  • [4] Blondel VD(2014)Location privacy preservation in big data era: a survey J. Softw. 25 693-712
  • [5] Wang H(2011)A survey of trajectory privacy-preserving techniques Chin. J. Comput. 34 1820-1830
  • [6] Xu ZQ(2002)K-anonymity: a model for protecting privacy Int. J. Uncertain. Fuzz. Knowl. Based Syst. 10 557-570
  • [7] Le Ny J(2012)Shortest path computation with no information leakage Proc. VLDB Endow. 5 692-703
  • [8] Pappas GJ(2014)A supermodularity-based differential privacy preserving algorithm for data anonymization IEEE Trans. Knowl. Data Eng. 26 1591-1601
  • [9] Wang L(2014)Pufferfish: a framework for mathematical privacy definitions ACM Trans. Database Syst. 39 3.1-3.36
  • [10] Meng XF(2016)Toward efficient multi-keyword fuzzy search over encrypted outsourced data with accuracy improvement IEEE Trans. Inf. Forensics Secur. 11 2706-2716