Rapid Re-Identification Risk Assessment for Anonymous Data Set in Mobile Multimedia Scene

被引:5
|
作者
Yang, Zhigang [1 ,2 ,3 ,4 ]
Wang, Ruyan [1 ,3 ,4 ]
Luo, Daizhong [2 ]
Xiong, Yu [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China
[2] Chongqing Univ Arts & Sci, Sch Artificial Intelligence, Chongqing 402160, Peoples R China
[3] Key Lab Opt Commun & Networks, Chongqing 400065, Peoples R China
[4] Key Lab Ubiquitous Sensing & Networking, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
Data privacy; Data models; Trajectory; Risk management; Couplings; Privacy; Multimedia systems; Multimedia; privacy; overall re-identification risk; attribute dependency; DE-ANONYMIZATION;
D O I
10.1109/ACCESS.2020.2977404
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ubiquitous mobile multimedia applications bring great convenience to users. However, when enjoying mobile multimedia services, users provide personal data to service platforms. Although the service platforms always claim that the collected personal data are de-identified, the risk of re-identifying users through linkage attacks still exists and is incalculable. This paper proposes a rapid prediction model for the overall re-identification risk based on the statistics of data sets (i.e., the number of individuals, number of attributes, distribution of attribute values, and attribute dependency). Our proposed model reveals the impact of statistics on the overall re-identification risk and adopts random sampling and semi-random sampling methods to predict the overall re-identification risk of data sets with and without strong dependency ordered attribute pairs. Experimental results show that for the data sets without strong dependency ordered attribute pairs, the random sampling method has a high prediction accuracy (the prediction error is less than 0.05). For the data sets with strong dependency ordered attribute pairs, the semi-random sampling method has a high prediction accuracy (the prediction error is less than 0.09). Exploiting our model, governments and individuals can quickly assess the privacy leakage risk of their data sets, given only the statistic of the data sets. Besides, this model can also evaluate the privacy risk of data collection schemes in advance according to historical statistics, and identify suspected services.
引用
收藏
页码:41557 / 41565
页数:9
相关论文
共 50 条
  • [21] The Impact of Data Suppression Rules on Data Access and Re-Identification Risk in Adoption and Foster Care Analysis and Reporting System Annual Files
    Eiermann, Martin
    CHILD MALTREATMENT, 2024,
  • [22] A method for managing re-identification risk from small geographic areas in Canada
    El Emam, Khaled
    Brown, Ann
    AbdelMalik, Philip
    Neisa, Angelica
    Walker, Mark
    Bottomley, Jim
    Roffey, Tyson
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2010, 10
  • [23] A Re-Identification Strategy Using Machine Learning that Exploits Better Side Data
    Hashimoto, Eina
    Ichino, Masatsugu
    Yoshiura, Hiroshi
    2019 IEEE 10TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2019), 2019, : 221 - 228
  • [24] Responsible Data Sharing: Identifying and Remedying Possible Re-Identification of Human Participants
    Morehouse, Kirsten N.
    Kurdi, Benedek
    Nosek, Brian A.
    AMERICAN PSYCHOLOGIST, 2024,
  • [25] Generate and Purify: Efficient Person Data Generation for Re-Identification
    Lu, Jianjie
    Zhang, Weidong
    Yin, Haibing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 558 - 566
  • [26] A computational model to protect patient data from location-based re-identification
    Malin, Bradley
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2007, 40 (03) : 223 - 239
  • [27] Re-identification of individuals in genomic data-sharing beacons via allele inference
    von Thenen, Nora
    Ayday, Erman
    Cicek, A. Ercument
    BIOINFORMATICS, 2019, 35 (03) : 365 - 371
  • [28] Silencing the Risk, Not the Whistle: A Semi-automated Text Sanitization Tool for Mitigating the Risk of Whistleblower Re-Identification
    Staufer, Dimitri
    Pallas, Frank
    Berendt, Bettina
    PROCEEDINGS OF THE 2024 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, ACM FACCT 2024, 2024, : 733 - 745
  • [29] Evaluation of Re-identification Risk using Anonymization and Differential Privacy in Healthcare
    Ratra, Ritu
    Gulia, Preeti
    Gill, Nasib Singh
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (02) : 563 - 570
  • [30] Perceived Risk of Re-Identification in OMOP-CDM Database: A Cross-Sectional Survey
    Tak, Yae Won
    You, Seng Chan
    Han, Jeong Hyun
    Kim, Soon-Seok
    Kim, Gi-Tae
    Lee, Yura
    JOURNAL OF KOREAN MEDICAL SCIENCE, 2022, 37 (26)