k-Degree anonymity and edge selection: improving data utility in large networks

被引:55
作者
Casas-Roma, Jordi [1 ]
Herrera-Joancomarti, Jordi [2 ]
Torra, Vicenc [3 ]
机构
[1] UOC, Internet Interdisciplinary Inst IN3, Fac Comp Sci Multimedia & Telecommun, Barcelona, Spain
[2] UAB, Dept Informat & Commun Engn, Bellaterra, Spain
[3] Univ Skovde, Sch Informat, Skovde, Sweden
关键词
Privacy; k-Anonymity; Social networks; Information loss; Data utility; Edge measures; COMMUNITY STRUCTURE;
D O I
10.1007/s10115-016-0947-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of anonymization in large networks and the utility of released data are considered in this paper. Although there are some anonymization methods for networks, most of them cannot be applied in large networks because of their complexity. In this paper, we devise a simple and efficient algorithm for k-degree anonymity in large networks. Our algorithm constructs a k-degree anonymous network by the minimum number of edge modifications. We compare our algorithm with other well-known k-degree anonymous algorithms and demonstrate that information loss in real networks is lowered. Moreover, we consider the edge relevance in order to improve the data utility on anonymized networks. By considering the neighbourhood centrality score of each edge, we preserve the most important edges of the network, reducing the information loss and increasing the data utility. An evaluation of clustering processes is performed on our algorithm, proving that edge neighbourhood centrality increases data utility. Lastly, we apply our algorithm to different large real datasets and demonstrate their efficiency and practical utility.
引用
收藏
页码:447 / 474
页数:28
相关论文
共 52 条
[1]  
Adamic LA, 2005, P 3 INT WORKSH LINK, P36
[2]  
[Anonymous], 2007, P 16 INT C WORLD WID
[3]  
[Anonymous], 2008, Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data
[4]  
[Anonymous], 2008, SIGKDD Explorations, DOI DOI 10.1145/1540276.1540279
[5]  
Bing-Jing Cai, 2010, 2010 International Conference on Machine Learning and Cybernetics (ICMLC 2010), P1849, DOI 10.1109/ICMLC.2010.5580953
[6]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[7]   Injecting Uncertainty in Graphs for Identity Obfuscation [J].
Boldi, Paolo ;
Bonchi, Francesco ;
Gionis, Aristides ;
Tassa, Tamir .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (11) :1376-1387
[8]  
Bredereck R, 2014, LECT NOTES COMPUT SC, V8546, P44
[9]  
Campan A, 2015, TRANS DATA PRIV, V8, P55
[10]  
Campan A, 2009, LECT NOTES COMPUT SC, V5456, P33, DOI 10.1007/978-3-642-01718-6_4