Toward Scalable Anonymization for Privacy-Preserving Big Data Publishing

被引:4
作者
Mehta, Brijesh B. [1 ]
Rao, Udai Pratap [1 ]
机构
[1] Sardar Vallabhbhai Natl Inst Technol, Surat, India
来源
RECENT FINDINGS IN INTELLIGENT COMPUTING TECHNIQUES, VOL 2 | 2018年 / 708卷
关键词
Big data; Big data privacy; k-anonymity;
D O I
10.1007/978-981-10-8636-6_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data is collected and processed using different sources and tools, which leads to privacy issues. Privacy-preserving data publishing techniques such as k-anonymity, l-diversity, t-closeness are used to de-identify data, but chances of re-identification are there as data is collected from multiple sources. Due to a large amount of data, less generalization or suppression is required to achieve same level of privacy, which is also known as "large crowd effect," but to handle such a large data for anonymization is also a challenging task. MapReduce handles a large amount of data, but it distributes data into small chunks, so the advantage of large data cannot be achieved. Therefore, scalability of privacy-preserving techniques has become a challenging area of research, and we are trying to explore it by proposing an algorithm for scalable k-anonymity for MapReduce. Based on comparison with existing algorithm, our approach shows significant improvement in running time.
引用
收藏
页码:297 / 304
页数:8
相关论文
共 23 条
  • [1] Chawla S, 2005, LECT NOTES COMPUT SC, V3378, P363
  • [2] Chawla S., 2005, P 21 C UNC ART INT U, P120
  • [3] Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
  • [4] Differential privacy: A survey of results
    Dwork, Cynthia
    [J]. THEORY AND APPLICATIONS OF MODELS OF COMPUTATION, PROCEEDINGS, 2008, 4978 : 1 - 19
  • [5] Dwork C, 2006, LECT NOTES COMPUT SC, V4353, P18
  • [6] Dwork C, 2006, LECT NOTES COMPUT SC, V4052, P1
  • [7] Anonymizing classification data for privacy preservation
    Fung, Benjamin C. M.
    Wang, Ke
    Yu, Philip S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (05) : 711 - 725
  • [8] Privacy-Preserving Data Publishing: A Survey of Recent Developments
    Fung, Benjamin C. M.
    Wang, Ke
    Chen, Rui
    Yu, Philip S.
    [J]. ACM COMPUTING SURVEYS, 2010, 42 (04)
  • [9] Ghinita G., 2007, P 33 INT C VER LARG, P758
  • [10] Google's MapReduce programming model -: Revisited
    Laemmel, Ralf
    [J]. SCIENCE OF COMPUTER PROGRAMMING, 2008, 70 (01) : 1 - 30