Anonymizing Collections of Tree-Structured Data

被引:14
作者
Gkountouna, Olga [1 ]
Terrovitis, Manolis [2 ]
机构
[1] Natl Tech Univ Athens, Dept Elect & Comp Engn, GR-10682 Athens, Greece
[2] Inst Management Informat Syst, Res & Innovat Ctr Informat Commun & Knowledge Tec, Athens, Greece
关键词
Privacy; tree data; anonymity; structural knowledge; generalization; disassociation; MICRODATA;
D O I
10.1109/TKDE.2015.2405563
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Collections of real-world data usually have implicit or explicit structural relations. For example, databases link records through foreign keys, and XML documents express associations between different values through syntax. Privacy preservation, until now, has focused either on data with a very simple structure, e.g. relational tables, or on data with very complex structure e.g. social network graphs, but has ignored intermediate cases, which are the most frequent in practice. In this work, we focus on tree structured data. Such data stem from various applications, even when the structure is not directly reflected in the syntax, e.g. XML documents. A characteristic case is a database where information about a single person is scattered amongst different tables that are associated through foreign keys. The paper defines k((m,n))-anonymity, which provides protection against identity disclosure and proposes a greedy anonymization heuristic that is able to sanitize large datasets. The algorithm and the quality of the anonymization are evaluated experimentally.
引用
收藏
页码:2034 / 2048
页数:15
相关论文
共 49 条
[11]  
Chen R, 2011, PROC VLDB ENDOW, V4, P1087
[12]  
Cheng J., 2010, P ACM SIGMOD INT C M, P459, DOI DOI 10.1145/1807167.1807218
[13]  
Cormode G., 2011, Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, P1253
[14]  
Cormode G, 2013, I C DATA ENGIN WORKS, P77, DOI 10.1109/ICDEW.2013.6547431
[15]  
Dwork C, 2006, LECT NOTES COMPUT SC, V4052, P1
[16]  
Ghinita G., 2007, Proceedings of the 33rd international conference on Very large data bases, P758
[17]   Anonymous Publication of Sensitive Transactional Data [J].
Ghinita, Gabriel ;
Kalnis, Panos ;
Tao, Yufei .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (02) :161-174
[18]   On the anonymization of sparse high-dimensional data [J].
Ghinita, Gabriel ;
Tao, Yufei ;
Kalnis, Panos .
2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, :715-+
[19]  
Gkountouna O, 2014, LECT NOTES COMPUT SC, V8744, P156, DOI 10.1007/978-3-319-11257-2_13
[20]  
Han JW, 2000, SIGMOD RECORD, V29, P1