Anonymization in the time of big data

被引:0
|
作者
Domingo-Ferrer J. [1 ]
Soria-Comas J. [1 ]
机构
[1] Department of Computer Engineering and Mathematics, Universitat Rovira i Virgili, Av. Països Catalans 26, Tarragona, 43007, CA
来源
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | 2016年 / 9867 LNCS卷
关键词
Big data; Curse of dimensionality; Data anonymization; K-anonymity; Multiple releases;
D O I
10.1007/978-3-319-45381-15
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this work we explore how viable is anonymization to prevent disclosure in structured big data. For the sake of concreteness, we focus on k-anonymity, which is the best-known privacy model based on anonymization. We identify two main challenges to use k-anonymity in big data. First, confidential attributes can also be quasi-identifier attributes, which increases the number of quasi-identifier attributes and may lead to a large information loss to attain k-anonymity. Second, in big data there is an unlimited number of data controllers, who may publish independent k-anonymous releases on overlapping populations of subjects; the k-anonymity guarantee does not longer hold if an observer pools such independent releases. We propose solutions to deal with the above two challenges. Our conclusion is that, with the proposed adjustments, k-anonymity is still useful in a context of big data. © Springer International Publishing Switzerland 2016.
引用
收藏
页码:57 / 68
页数:11
相关论文
共 50 条
  • [11] Anonylitics: From a Small Data to a Big Data Anonymization System for Analytical Projects
    Pomares-Quimbaya, Alexandra
    Sierra-Munera, Alejandro
    Mendoza-Mendoza, Jaime
    Malaver-Moreno, Julian
    Carvajal, Hernan
    Moncayo, Victor
    PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS (ICEIS), VOL 1, 2019, : 61 - 71
  • [12] A Review of Anonymization Algorithms and Methods in Big Data
    Shamsinejad E.
    Banirostam T.
    Pedram M.M.
    Rahmani A.M.
    Annals of Data Science, 2025, 12 (1) : 253 - 279
  • [13] A Clustering Based Anonymization Model for Big Data
    Canbay, Yavuz
    Kalyoncu, Aydincan
    Ercimen, Mucahid
    Dogan, Adem
    Sagiroglu, Seref
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 720 - 725
  • [14] Two-phase Entropy based approach to Big Data Anonymization
    Ranjan, Ashish
    Ranjan, Prabhat
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 76 - 81
  • [15] Toward Scalable Anonymization for Privacy-Preserving Big Data Publishing
    Mehta, Brijesh B.
    Rao, Udai Pratap
    RECENT FINDINGS IN INTELLIGENT COMPUTING TECHNIQUES, VOL 2, 2018, 708 : 297 - 304
  • [16] Contextual Anonymization for Secondary Use of Big Data in Biomedical Research: Proposal for an Anonymization Matrix
    Rumbold, John
    Pierscionek, Barbara
    JMIR MEDICAL INFORMATICS, 2018, 6 (04): : 229 - 241
  • [17] PRIVACY PRESERVATION IN BIG DATA USING ANONYMIZATION TECHNIQUES
    Karle, Tanashri
    Vora, Deepali
    2017 1ST IEEE INTERNATIONAL CONFERENCE ON DATA MANAGEMENT, ANALYTICS AND INNOVATION (ICDMAI), 2017, : 340 - 343
  • [18] Fast Summarization and Anonymization of Multivariate Big Time Series
    Ruta, Dymitr
    Cen, Ling
    Damiani, Ernesto
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1901 - 1904
  • [19] Scalable Solution for the Anonymization of Big Data Spatio-Temporal Trajectories
    Eddine, Hajlaoui Jalel
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2022, PT I, 2022, 13375 : 465 - 476
  • [20] Privacy Preserving Big data Using Combine Anonymization and Encryption Approach
    Desai, Vidhi
    Chauhan, Gargi K.
    2019 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2019,