Rapid Re-Identification Risk Assessment for Anonymous Data Set in Mobile Multimedia Scene

被引:5
|
作者
Yang, Zhigang [1 ,2 ,3 ,4 ]
Wang, Ruyan [1 ,3 ,4 ]
Luo, Daizhong [2 ]
Xiong, Yu [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China
[2] Chongqing Univ Arts & Sci, Sch Artificial Intelligence, Chongqing 402160, Peoples R China
[3] Key Lab Opt Commun & Networks, Chongqing 400065, Peoples R China
[4] Key Lab Ubiquitous Sensing & Networking, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
Data privacy; Data models; Trajectory; Risk management; Couplings; Privacy; Multimedia systems; Multimedia; privacy; overall re-identification risk; attribute dependency; DE-ANONYMIZATION;
D O I
10.1109/ACCESS.2020.2977404
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ubiquitous mobile multimedia applications bring great convenience to users. However, when enjoying mobile multimedia services, users provide personal data to service platforms. Although the service platforms always claim that the collected personal data are de-identified, the risk of re-identifying users through linkage attacks still exists and is incalculable. This paper proposes a rapid prediction model for the overall re-identification risk based on the statistics of data sets (i.e., the number of individuals, number of attributes, distribution of attribute values, and attribute dependency). Our proposed model reveals the impact of statistics on the overall re-identification risk and adopts random sampling and semi-random sampling methods to predict the overall re-identification risk of data sets with and without strong dependency ordered attribute pairs. Experimental results show that for the data sets without strong dependency ordered attribute pairs, the random sampling method has a high prediction accuracy (the prediction error is less than 0.05). For the data sets with strong dependency ordered attribute pairs, the semi-random sampling method has a high prediction accuracy (the prediction error is less than 0.09). Exploiting our model, governments and individuals can quickly assess the privacy leakage risk of their data sets, given only the statistic of the data sets. Besides, this model can also evaluate the privacy risk of data collection schemes in advance according to historical statistics, and identify suspected services.
引用
收藏
页码:41557 / 41565
页数:9
相关论文
共 50 条
  • [31] The risk of re-identification when analyzing electronic health records: a critical appraisal and possible solutions
    Hauswaldt, Johannes
    Demmer, Iris
    Heinemann, Stephanie
    Himmel, Wolfgang
    Hummers, Eva
    Pung, Johannes
    Schlegelmilch, Falk
    Drepper, Johannes
    ZEITSCHRIFT FUR EVIDENZ FORTBILDUNG UND QUALITAET IM GESUNDHEITSWESEN, 2019, 149 : 22 - 31
  • [32] Evaluation of Re-identification Risk using Anonymization and Differential Privacy in Healthcare
    Ratra R.
    Gulia P.
    Gill N.S.
    International Journal of Advanced Computer Science and Applications, 2022, 13 (02): : 563 - 570
  • [33] WristPrint: Characterizing User Re-identification Risks from Wrist-worn Accelerometry Data
    Saleheen, Nazir
    Ullah, Md Azim
    Chakraborty, Supriyo
    Ones, Deniz S.
    Srivastava, Mani
    Kumar, Santosh
    CCS '21: PROCEEDINGS OF THE 2021 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 2807 - 2823
  • [34] Multi-Camera Person Re-Identification Based on Trajectory Data
    Mendes, Diogo
    Correia, Simao
    Jorge, Pedro
    Brandao, Tomas
    Arriaga, Patricia
    Nunes, Luis
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [35] Evaluating the disclosure risk of anonymized documents via a machine learning-based re-identification attack
    Manzanares-Salor, Benet
    Sanchez, David
    Lison, Pierre
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (06) : 4040 - 4075
  • [36] On the Effectiveness of Re-Identification Attacks and Local Differential Privacy-Based Solutions for Smart Meter Data
    Kaya, Zeynep Sila
    Gursoy, M. Emre
    PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE ON SECURITY AND CRYPTOGRAPHY, SECRYPT 2023, 2023, : 111 - 122
  • [37] Re-identification and information fusion between anonymized CDR and social network data
    Cecaj, Alket
    Mamei, Marco
    Zambonelli, Franco
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2016, 7 (01) : 83 - 96
  • [38] Inter-Domain Adaptation Label for Data Augmentation in Vehicle Re-Identification
    Wang, Qi
    Min, Weidong
    Han, Qing
    Liu, Qian
    Zha, Cheng
    Zhao, Haoyu
    Wei, Zitai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1031 - 1041
  • [39] Re-identification and information fusion between anonymized CDR and social network data
    Alket Cecaj
    Marco Mamei
    Franco Zambonelli
    Journal of Ambient Intelligence and Humanized Computing, 2016, 7 : 83 - 96
  • [40] Fast refacing of MR images with a generative neural network lowers re-identification risk and preserves volumetric consistency
    Molchanova, Nataliia
    Marechal, Benedicte
    Thiran, Jean-Philippe
    Kober, Tobias
    Huelnhagen, Till
    Richiardi, Jonas
    HUMAN BRAIN MAPPING, 2024, 45 (09)