Using Search Results to Microaggregate Query Logs Semantically

被引:4
作者
Erola, Arnau [1 ]
Castella-Roca, Jordi [1 ]
机构
[1] Univ Rovira & Virgili, UNESCO Chair Data Privacy, Dept Engn Informat & Matemat, Av Paisos Catalans 26, E-43007 Tarragona, Spain
来源
DATA PRIVACY MANAGEMENT AND AUTONOMOUS SPONTANEOUS SECURITY, DPM 2013 | 2014年 / 8247卷
关键词
Privacy; Web search; Microaggregation; k-anonymity; Query logs; Semantics; Semantic microaggregation;
D O I
10.1007/978-3-642-54568-9_10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query log anonymization has become an important challenge nowadays. A query log contains the search history of the users, as well as the selected results and their position in the ranking. These data are used to provide a personalized re-ranking of results and trend studies. However, query logs can disclose sensitive information of the users. Hence, query logs must be submitted to an anonymization process to guarantee that: (a) no sensitive information can be linked to an identity; (b) the analysis of the anonymized data produces similar results than the original data, i.e. minimize data distortion. Latest anonymization approaches utilize microaggregation, a statistical disclosure control technique that provides a privacy comparable with k-anonymity, attempting to minimize the data distortion. We propose a new method that uses search results to optimize microaggregation, providing more data reliability than the existing methods.
引用
收藏
页码:148 / 161
页数:14
相关论文
共 21 条
  • [1] Adar E, 2007, QUERY LOG ANAL SOCIA
  • [2] A Survey of Query Log Privacy-Enhancing Techniques from a Policy Perspective
    Cooper, Alissa
    [J]. ACM TRANSACTIONS ON THE WEB, 2008, 2 (04)
  • [3] Defays D., 1993, Proceedings of the 1992 symposium on design and analysis of longitudinal surveys, P195
  • [4] Practical data-oriented microaggregation for statistical disclosure control
    Domingo-Ferrer, J
    Mateo-Sanz, JM
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (01) : 189 - 201
  • [5] A polynomial-time approximation to optimal multivariate micro aggregation
    Domingo-Ferrer, Josep
    Sebe, Francesc
    Solanas, Agusti
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2008, 55 (04) : 714 - 732
  • [6] Erola A., 2011, SORT-STAT OPER RES T, V35, P25
  • [7] Erola A, 2010, LECT NOTES COMPUT SC, V6344, P127, DOI 10.1007/978-3-642-15838-4_12
  • [8] Gligorov R., 2007, INT C WORLD WIDE WEB, P767
  • [9] He J. F., 2009, Proc. VLDB Endowment, V2, P934, DOI DOI 10.14778/1687627.1687733
  • [10] Hong Yuan., 2009, Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM '09, P1465, DOI DOI 10.1145/1645953.1646146