A Novel Privacy Preserving Framework for Large Scale Graph Data Publishing

被引:30
作者
Ding, Xiaofeng [1 ]
Wang, Cui [1 ]
Choo, Kim-Kwang Raymond [2 ,3 ]
Jin, Hai [1 ]
机构
[1] Huazhong Univ Sci & Technol, Natl Engn Res Ctr Big Data Technol & Syst, Serv Comp Technol & Syst Lab, Cluster & Grid Comp Lab, Wuhan 430074, Peoples R China
[2] Univ Texas San Antonio, Dept Informat Syst & Cyber Secur, San Antonio, TX 78249 USA
[3] Univ Texas San Antonio, Dept Elect & Comp Engn, San Antonio, TX 78249 USA
基金
中国国家自然科学基金;
关键词
Large scale graph publication; privacy preserving; graph decomposition; community detection;
D O I
10.1109/TKDE.2019.2931903
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The need to efficiently store and query large scale graph datasets is evident in the growing number of data-intensive applications, particularly to maximize the mining of intelligence from these data (e.g., to inform decision making). However, directly releasing graph dataset for analysis may leak sensitive information of an individual even if the graph is anonymized, as demonstrated by the re-identification attacks on the DBpedia datasets. A key challenge in the design of graph sanitization methods is scalability, as existing execution models generally have significant memory requirements. In this paper, we propose a novel k-decomposition algorithm and define a new information loss matrix designed for utility measurement in massively large graph datasets. We also propose a novel privacy preserving framework that can be seamlessly integrated with graph storage, anonymization, query processing, and analysis. Our experimental studies show that the proposed solution achieves privacy-preserving, utility, and efficiency.
引用
收藏
页码:331 / 343
页数:13
相关论文
共 45 条
  • [1] Privacy Preserving Social Network Data Publication
    Abawajy, Jemal H.
    Ninggal, Mohd Izuan Hafez
    Herawan, Tutut
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (03): : 1974 - 1997
  • [2] [Anonymous], 2010, P 2010 ACM SIGMOD IN, DOI DOI 10.1145/1807167.1807218
  • [3] Backes M., 2016, P 2016 ACM SIGSAC C, P319, DOI [10.1145/, 10.1145/2976749.2978355]
  • [4] Bhagat S, 2009, PROC VLDB ENDOW, V2
  • [5] Fast unfolding of communities in large networks
    Blondel, Vincent D.
    Guillaume, Jean-Loup
    Lambiotte, Renaud
    Lefebvre, Etienne
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
  • [6] Bronson N., 2013, USENIX ANN TECHN C, P49
  • [7] A survey of graph-modification techniques for privacy-preserving on networks
    Casas-Roma, Jordi
    Herrera-Joancomarti, Jordi
    Torra, Vicenc
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2017, 47 (03) : 341 - 366
  • [8] Castelluccia C., 2012, P ACM C COMP COMM SE, P638
  • [9] Privacy Preserving Subgraph Matching on Large Graphs in Cloud
    Chang, Zhao
    Zou, Lei
    Li, Feifei
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 199 - 213
  • [10] Cordella L. P., 1999, Proceedings 10th International Conference on Image Analysis and Processing, P1172, DOI 10.1109/ICIAP.1999.797762