Privacy-preserving Anonymization of Set-valued Data

被引:168
作者
Terrovitis, Manolis [1 ]
Mamoulis, Nikos [1 ]
Kalnis, Panos [1 ]
机构
[1] Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2008年 / 1卷 / 01期
关键词
D O I
10.14778/1453856.1453874
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we study the problem of protecting privacy in the publication of set-valued data. Consider a collection of transactional data that contains detailed information about items bought together by individuals. Even after removing all personal characteristics of the buyer, which can serve as links to his identity, the publication of such data is still subject to privacy attacks from adversaries who have partial knowledge about the set. Unlike most previous works, we do not distinguish data as sensitive and non-sensitive, but we consider them both as potential quasi-identifiers and potential sensitive data, depending on the point of view of the adversary. We define a new version of the k-anonymity guarantee, the km-anonymity, to limit the effects of the data dimensionality and we propose efficient algorithms to transform the database. Our anonymization model relies on generalization instead of suppression, which is the most common practice in related works on such data. We develop an algorithm which finds the optimal solution, however, at a high cost which makes it inapplicable for large, realistic problems. Then, we propose two greedy heuristics, which scale much better and in most of the cases find a solution close to the optimal. The proposed algorithms are experimentally evaluated using real datasets.
引用
收藏
页码:115 / 125
页数:11
相关论文
共 24 条
[1]  
Aggarwal G, 2005, P INT C DAT THEOR
[2]  
AGGARWAL G., 2006, P 25 ACM SIGMOD SIGA, P153
[3]  
Atzori M., 2008, VLDB J
[4]  
Bayardo RJ, 2005, PROC INT CONF DATA, P217
[5]  
Ghinita G., 2008, P ICDE
[6]  
Ghinita G., 2007, PROC 33 INT C VERY L, P758
[7]  
Han JW, 2000, SIGMOD RECORD, V29, P1
[8]  
Iyengar VS., 2002, P 8 ACM SIGKDD INT C, P279, DOI DOI 10.1145/775047.775089
[9]  
LeFevre K., 2006, P ICDE
[10]  
LeFevre K., 2005, SIGMOD, P49, DOI DOI 10.1145/1066157.1066164