Monitoring Web Browsing Behavior with Differential Privacy

被引:37
作者
Fan, Liyue [1 ]
Bonomi, Luca [1 ]
Xiong, Li [1 ]
Sunderam, Vaidy [1 ]
机构
[1] Emory Univ, Dept Math & Comp Sci, Atlanta, GA 30322 USA
来源
WWW'14: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB | 2014年
关键词
Web Monitoring; Web Mining; Differential Privacy;
D O I
10.1145/2566486.2568038
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Monitoring web browsing behavior has benefited many data mining applications, such as top-K discovery and anomaly detection. However, releasing private user data to the greater public would concern web users about their privacy, especially after the incident of AOL search log release where anonymization was not correctly done. In this paper, we adopt differential privacy, a strong, provable privacy definition, and show that differentially private aggregates of web browsing activities can be released in real-time while preserving the utility of shared data. Our proposed algorithms utilize the rich correlation of the time series of aggregated data and adopt a state-space approach to estimate the underlying, true aggregates from the perturbed values by the differential privacy mechanism. We evaluate our algorithms with real-world web browsing data. Utility evaluations with three metrics demonstrate that the quality of the private, released data by our solutions closely resembles that of the original, unperturbed aggregates.
引用
收藏
页码:177 / 187
页数:11
相关论文
共 29 条
[1]  
[Anonymous], 2010, Advances in Neural Information Processing Systems, DOI DOI 10.5555/2997046.2997169
[2]  
[Anonymous], 2012, Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
[3]  
Barbaro MichaelTom Zeller Jr., 2006, A Face Is Exposed for AOL Searcher No. 4417749
[4]  
Blum A, 2008, ACM S THEORY COMPUT, P609
[5]  
Bonomi Luca., 2013, Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2013, New York, NY, USA, June 22-27, 2013, P1029
[6]  
Cadez I., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P280, DOI 10.1145/347090.347151
[7]  
Canali D., 2013, NDSS 2013 20 ANN NET
[8]  
Chan T.-H Hubert, 2012, Privacy Enhancing Technologies. Proceedings 12th International Symposium, PETS 2012, P140, DOI 10.1007/978-3-642-31680-7_8
[9]  
Chan THH, 2010, LECT NOTES COMPUT SC, V6199, P405, DOI 10.1007/978-3-642-14162-1_34
[10]   Nonstationary Poisson modeling of web browsing session arrivals [J].
Chlebus, Edward ;
Brazier, Jordy .
INFORMATION PROCESSING LETTERS, 2007, 102 (05) :187-190