Anonymization in the time of big data

被引:0
|
作者
Domingo-Ferrer J. [1 ]
Soria-Comas J. [1 ]
机构
[1] Department of Computer Engineering and Mathematics, Universitat Rovira i Virgili, Av. Països Catalans 26, Tarragona, 43007, CA
来源
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | 2016年 / 9867 LNCS卷
关键词
Big data; Curse of dimensionality; Data anonymization; K-anonymity; Multiple releases;
D O I
10.1007/978-3-319-45381-15
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this work we explore how viable is anonymization to prevent disclosure in structured big data. For the sake of concreteness, we focus on k-anonymity, which is the best-known privacy model based on anonymization. We identify two main challenges to use k-anonymity in big data. First, confidential attributes can also be quasi-identifier attributes, which increases the number of quasi-identifier attributes and may lead to a large information loss to attain k-anonymity. Second, in big data there is an unlimited number of data controllers, who may publish independent k-anonymous releases on overlapping populations of subjects; the k-anonymity guarantee does not longer hold if an observer pools such independent releases. We propose solutions to deal with the above two challenges. Our conclusion is that, with the proposed adjustments, k-anonymity is still useful in a context of big data. © Springer International Publishing Switzerland 2016.
引用
收藏
页码:57 / 68
页数:11
相关论文
共 50 条
  • [1] Anonymization in the Time of Big Data
    Domingo-Ferrer, Josep
    Soria-Comas, Jordi
    PRIVACY IN STATISTICAL DATABASES: UNESCO CHAIR IN DATA PRIVACY, 2016, 9867 : 57 - 68
  • [2] Efficient multimedia big data anonymization
    Jang, Sung-Bong
    Ko, Young-Woong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (17) : 17855 - 17872
  • [3] Big Data Anonymization with Spark
    Canbay, Yavuz
    Sagiroglu, Seref
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 833 - 838
  • [4] A Study of Performance Enhancement in Big Data Anonymization
    Jang, Sung-Bong
    2017 4TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS AND INFORMATION PROCESSING TECHNOLOGY (CAIPT), 2017,
  • [5] Sensitivity-based Anonymization of Big Data
    Al-Zobbi, Mohammed
    Shahrestani, Seyed
    Ruan, Chun
    PROCEEDINGS OF THE 2016 IEEE 41ST CONFERENCE ON LOCAL COMPUTER NETWORKS - LCN WORKSHOPS 2016, 2016, : 58 - 64
  • [6] An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques
    Eyupoglu, Can
    Aydin, Muhammed Ali
    Zaim, Abdul Halim
    Sertbas, Ahmet
    ENTROPY, 2018, 20 (05)
  • [7] Efficient multimedia big data anonymization
    Sung-Bong Jang
    Young-Woong Ko
    Multimedia Tools and Applications, 2017, 76 : 17855 - 17872
  • [8] Personal Big Data, GDPR and Anonymization
    Domingo-Ferrer, Josep
    FLEXIBLE QUERY ANSWERING SYSTEMS, 2019, 11529 : 7 - 10
  • [9] Data anonymization evaluation for big data and IoT environment
    Ni, Chunchun
    Cang, Li Shan
    Gope, Prosanta
    Min, Geyong
    INFORMATION SCIENCES, 2022, 605 : 381 - 392
  • [10] Towards Optimal Sensitivity-Based Anonymization for Big Data
    Al-Zobbi, Mohammed
    Shahrestani, Seyed
    Ruan, Chun
    2017 27TH INTERNATIONAL TELECOMMUNICATION NETWORKS AND APPLICATIONS CONFERENCE (ITNAC), 2017, : 331 - 336