Differentially Private Real-Time Release of Sequential Data

被引:2
作者
Zhang, Xueru [1 ]
Khalili, Mohammad Mahdi [2 ]
Liu, Mingyan [3 ]
机构
[1] Ohio State Univ, Comp Sci & Engn, 2015 Neil Ave, Columbus, OH 43210 USA
[2] Univ Delaware, Comp & Informat Sci, 101 Smith Hall,18 Amstel Ave, Newark, DE 19716 USA
[3] Univ Michigan, Elect Engn & Comp Sci, 1301 Beal Ave, Ann Arbor, MI 48109 USA
关键词
Differential privacy; sequential data;
D O I
10.1145/3544837
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many data analytics applications rely on temporal data, generated (and possibly acquired) sequentially for online analysis. How to release this type of data in a privacy-preserving manner is of great interest and more challenging than releasing one-time, static data. Because of the (potentially strong) temporal correlation within the data sequence, the overall privacy loss can accumulate significantly over time; an attacker with statistical knowledge of the correlation can be particularly hard to defend against. An idea that has been explored in the literature to mitigate this problem is to factor this correlation into the perturbation/noise mechanism. Existing work, however, either focuses on the offline setting (where perturbation is designed and introduced after the entire sequence has become available), or requires a priori information on the correlation in generating perturbation. In this study we propose an approach where the correlation is learned as the sequence is generated, and is used for estimating future data in the sequence. This estimate then drives the generation of the noisy released data. This method allows us to design better perturbation and is suitable for real-time operations. Using the notion of differential privacy, we show this approach achieves high accuracy with lower privacy loss compared to existing methods.
引用
收藏
页数:29
相关论文
共 49 条
[11]   The Algorithmic Foundations of Differential Privacy [J].
Dwork, Cynthia ;
Roth, Aaron .
FOUNDATIONS AND TRENDS IN THEORETICAL COMPUTER SCIENCE, 2013, 9 (3-4) :211-406
[12]  
Dwork C, 2010, ACM S THEORY COMPUT, P715
[13]  
EPA, 2008, EPA FED TEST PROC FT
[14]   An Adaptive Approach to Real-Time Aggregate Monitoring with Differential Privacy [J].
Fan, Liyue ;
Xiong, Li .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (09) :2094-2106
[15]   Event labeling combining ensemble detectors and background knowledge [J].
Fanaee-T H. ;
Gama J. .
Progress in Artificial Intelligence, 2014, 2 (2-3) :113-127
[16]   OptStream: Releasing Time Series Privately [J].
Fioretto, Ferdinando ;
Van Hentenryck, Pascal .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 65 :423-456
[17]   VITERBI ALGORITHM [J].
FORNEY, GD .
PROCEEDINGS OF THE IEEE, 1973, 61 (03) :268-278
[18]  
Friedman A., 2010, P 16 ACM SIGKDD INT, P493, DOI [DOI 10.1145/1835804.1835868, 10.1145/1835804.1835868]
[19]   TGM: A Generative Mechanism for Publishing Trajectories With Differential Privacy [J].
Ghane, Soheila ;
Kulik, Lars ;
Ramamohanarao, Kotagiri .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (04) :2611-2621
[20]   Differentially Private and Utility Preserving Publication of Trajectory Data [J].
Gursoy, Mehmet Emre ;
Liu, Ling ;
Truex, Stacey ;
Yu, Lei .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2019, 18 (10) :2315-2329