Rapid Re-Identification Risk Assessment for Anonymous Data Set in Mobile Multimedia Scene

被引：5

作者：

Yang, Zhigang ^{[1
,2
,3
,4
]}

Wang, Ruyan ^{[1
,3
,4
]}

Luo, Daizhong ^{[2
]}

Xiong, Yu ^{[2
]}

机构：

[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China

[2] Chongqing Univ Arts & Sci, Sch Artificial Intelligence, Chongqing 402160, Peoples R China

[3] Key Lab Opt Commun & Networks, Chongqing 400065, Peoples R China

[4] Key Lab Ubiquitous Sensing & Networking, Chongqing 400065, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

中国国家自然科学基金;

关键词：

Data privacy; Data models; Trajectory; Risk management; Couplings; Privacy; Multimedia systems; Multimedia; privacy; overall re-identification risk; attribute dependency; DE-ANONYMIZATION;

D O I：

10.1109/ACCESS.2020.2977404

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Ubiquitous mobile multimedia applications bring great convenience to users. However, when enjoying mobile multimedia services, users provide personal data to service platforms. Although the service platforms always claim that the collected personal data are de-identified, the risk of re-identifying users through linkage attacks still exists and is incalculable. This paper proposes a rapid prediction model for the overall re-identification risk based on the statistics of data sets (i.e., the number of individuals, number of attributes, distribution of attribute values, and attribute dependency). Our proposed model reveals the impact of statistics on the overall re-identification risk and adopts random sampling and semi-random sampling methods to predict the overall re-identification risk of data sets with and without strong dependency ordered attribute pairs. Experimental results show that for the data sets without strong dependency ordered attribute pairs, the random sampling method has a high prediction accuracy (the prediction error is less than 0.05). For the data sets with strong dependency ordered attribute pairs, the semi-random sampling method has a high prediction accuracy (the prediction error is less than 0.09). Exploiting our model, governments and individuals can quickly assess the privacy leakage risk of their data sets, given only the statistic of the data sets. Besides, this model can also evaluate the privacy risk of data collection schemes in advance according to historical statistics, and identify suspected services.

引用

页码：41557 / 41565

页数：9

共 50 条

[21] The Impact of Data Suppression Rules on Data Access and Re-Identification Risk in Adoption and Foster Care Analysis and Reporting System Annual Files
Eiermann, Martin
CHILD MALTREATMENT, 2024,
[22] A method for managing re-identification risk from small geographic areas in Canada
El Emam, Khaled
Brown, Ann
AbdelMalik, Philip
Neisa, Angelica
Walker, Mark
Bottomley, Jim
Roffey, Tyson
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2010, 10
[23] A Re-Identification Strategy Using Machine Learning that Exploits Better Side Data
Hashimoto, Eina
Ichino, Masatsugu
Yoshiura, Hiroshi
2019 IEEE 10TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2019), 2019, : 221 - 228
[24] Responsible Data Sharing: Identifying and Remedying Possible Re-Identification of Human Participants
Morehouse, Kirsten N.
Kurdi, Benedek
Nosek, Brian A.
AMERICAN PSYCHOLOGIST, 2024,
[25] Generate and Purify: Efficient Person Data Generation for Re-Identification
Lu, Jianjie
Zhang, Weidong
Yin, Haibing
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 558 - 566
[26] A computational model to protect patient data from location-based re-identification
Malin, Bradley
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2007, 40 (03) : 223 - 239
[27] Re-identification of individuals in genomic data-sharing beacons via allele inference
von Thenen, Nora
Ayday, Erman
Cicek, A. Ercument
BIOINFORMATICS, 2019, 35 (03) : 365 - 371
[28] Silencing the Risk, Not the Whistle: A Semi-automated Text Sanitization Tool for Mitigating the Risk of Whistleblower Re-Identification
Staufer, Dimitri
Pallas, Frank
Berendt, Bettina
PROCEEDINGS OF THE 2024 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, ACM FACCT 2024, 2024, : 733 - 745
[29] Evaluation of Re-identification Risk using Anonymization and Differential Privacy in Healthcare
Ratra, Ritu
Gulia, Preeti
Gill, Nasib Singh
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (02) : 563 - 570
[30] Perceived Risk of Re-Identification in OMOP-CDM Database: A Cross-Sectional Survey
Tak, Yae Won
You, Seng Chan
Han, Jeong Hyun
Kim, Soon-Seok
Kim, Gi-Tae
Lee, Yura
JOURNAL OF KOREAN MEDICAL SCIENCE, 2022, 37 (26)

← 1 2 3 4 5 →