Anonymization in the time of big data

被引:0
作者
Domingo-Ferrer J. [1 ]
Soria-Comas J. [1 ]
机构
[1] Department of Computer Engineering and Mathematics, Universitat Rovira i Virgili, Av. Països Catalans 26, Tarragona, 43007, CA
来源
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | 2016年 / 9867 LNCS卷
关键词
Big data; Curse of dimensionality; Data anonymization; K-anonymity; Multiple releases;
D O I
10.1007/978-3-319-45381-15
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this work we explore how viable is anonymization to prevent disclosure in structured big data. For the sake of concreteness, we focus on k-anonymity, which is the best-known privacy model based on anonymization. We identify two main challenges to use k-anonymity in big data. First, confidential attributes can also be quasi-identifier attributes, which increases the number of quasi-identifier attributes and may lead to a large information loss to attain k-anonymity. Second, in big data there is an unlimited number of data controllers, who may publish independent k-anonymous releases on overlapping populations of subjects; the k-anonymity guarantee does not longer hold if an observer pools such independent releases. We propose solutions to deal with the above two challenges. Our conclusion is that, with the proposed adjustments, k-anonymity is still useful in a context of big data. © Springer International Publishing Switzerland 2016.
引用
收藏
页码:57 / 68
页数:11
相关论文
共 50 条
  • [31] Privacy Preserving Parallel Clustering Based Anonymization for Big Data Using MapReduce Framework
    Lawrance, Josephine Usha
    Jesudhasan, Jesu Vedha Nayahi
    APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (15) : 1587 - 1620
  • [32] A MapReduce Based Approach of Scalable Multidimensional Anonymization for Big Data Privacy Preservation on Cloud
    Zhang, Xuyun
    Yang, Chi
    Nepal, Surya
    Liu, Chang
    Dou, Wanchun
    Chen, Jinjun
    2013 IEEE THIRD INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING (CGC 2013), 2013, : 105 - 112
  • [33] Implications of Data Anonymization on the Statistical Evidence of Disparity
    Xu, Heng
    Zhang, Nan
    MANAGEMENT SCIENCE, 2022, 68 (04) : 2600 - 2618
  • [34] Big data for big questions: it is time for data analysts to act
    Moscato, Pablo
    FUTURE SCIENCE OA, 2015, 1 (03):
  • [35] A utility based approach for data stream anonymization
    Sopaoglu, Ugur
    Abul, Osman
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2020, 54 (03) : 605 - 631
  • [36] Data Anonymization Based on Natural Equivalent Class
    Guo, Naixuan
    Yang, Ming
    Gong, Qiyuan
    Chen, Zhouguo
    Luo, Junzhou
    PROCEEDINGS OF THE 2019 IEEE 23RD INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2019, : 22 - 27
  • [37] Classification utility aware data stream anonymization
    Sopaoglu, Ugur
    Abul, Osman
    APPLIED SOFT COMPUTING, 2021, 110
  • [38] Automation of the Validation, Anonymization and Augmentation of Big Data from a Multi-year Driving Study
    Wallace, Bruce
    Goubran, Rafik
    Knoefel, Frank
    Marshall, Shawn
    Porter, Michelle
    Harlow, Madelaine
    Puli, Akshay
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 608 - 614
  • [39] On the Role of Data Anonymization in Machine Learning Privacy
    Senavirathne, Navoda
    Torra, Vicenc
    2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 664 - 675
  • [40] Adaptive Privacy Preservation Approach for Big Data Publishing in Cloud using k-anonymization
    Madan S.
    Goswami P.
    Recent Advances in Computer Science and Communications, 2021, 14 (08) : 2678 - 2688