(r, k, ε)-Anonymization: Privacy-Preserving Data Publishing Algorithm Based on Multi-Dimensional Outlier Detection, k-Anonymity, and ε-Differential Privacy

被引:0
作者
Kara, Burak Cem [1 ]
Eyupoglu, Can [1 ]
Karakus, Oktay [2 ]
机构
[1] Natl Def Univ, Turkish Air Force Acad, Dept Comp Engn, TR-34149 Istanbul, Turkiye
[2] Cardiff Univ, Sch Comp Sci & Informat, Cardiff CF24 4AG, Wales
关键词
Privacy; Differential privacy; Noise; Information integrity; Protection; Internet of Things; Publishing; Perturbation methods; Numerical models; Information filters; Privacy-preserving data publishing; data anonymity; epsilon-differential privacy; k-anonymity; PRESERVATION;
D O I
10.1109/ACCESS.2025.3559410
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, there has been a tremendous rise in both the volume and variety of big data, providing enormous potential benefits to businesses that seek to utilize consumer experiences for research or commercial purposes. The general data protection regulation (GDPR) implementation, on the other hand, has introduced extensive control over the use of individuals' personal information and placed many limits. Data anonymization technologies have become an important solution for businesses trying to generate value from data while adhering to GDPR limitations. To address these challenges, researchers have developed various methods, including k-anonymity and e-differential privacy, offering solutions for both industry and academia. However, protecting individuals' privacy against diverse attack attempts presents significant challenges for anonymization models that rely solely on a single technique, highlighting the need for more adaptable and hybrid approaches. In this study, a new hybrid anonymization algorithm called (r, k, e)-anonymization has been proposed, which combines k-anonymity and e-differential privacy models in a consistent framework and provides stronger privacy guarantees compared to existing privacy-preserving models. The proposed algorithm is capable of overcoming well-known shortcomings of the k-anonymity and e-differential privacy models, and it has been confirmed by extensive tests on real-world datasets. The proposed (r, k, e)-anonymization algorithm outperforms k-anonymity and e-differential privacy in terms of the average error rate measure, achieving data utility increases of 31.74% and 26.99%, respectively.
引用
收藏
页码:70422 / 70435
页数:14
相关论文
共 38 条
[1]   Federated learning and differential privacy for medical image analysis [J].
Adnan, Mohammed ;
Kalra, Shivam ;
Cresswell, Jesse C. ;
Taylor, Graham W. ;
Tizhoosh, Hamid R. .
SCIENTIFIC REPORTS, 2022, 12 (01)
[2]   MULTIDIMENSIONAL BINARY SEARCH TREES USED FOR ASSOCIATIVE SEARCHING [J].
BENTLEY, JL .
COMMUNICATIONS OF THE ACM, 1975, 18 (09) :509-517
[3]  
Bi MN, 2020, CHINA COMMUN, V17, P50, DOI 10.23919/JCC.2020.09.005
[4]   A Critical Review on the Use (and Misuse) of Differential Privacy in Machine Learning [J].
Blanco-Justicia, Alberto ;
Sanchez, David ;
Domingo-Ferrer, Josep ;
Muralidhar, Krishnamurty .
ACM COMPUTING SURVEYS, 2023, 55 (08)
[5]   A new utility-aware anonymization model for privacy preserving data publishing [J].
Canbay, Yavuz ;
Sagiroglu, Seref ;
Vural, Yilmaz .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (10)
[6]   Hiding Among Your Neighbors: Face Image Privacy Protection with Differential Private k-anonymity [J].
Cao, Jingyi ;
Liu, Bo ;
Wen, Yunqian ;
Zhu, Yunhui ;
Xie, Rong ;
Song, Li ;
Li, Lin ;
Yin, Yaoyao .
2022 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2022,
[7]   Differential Privacy Preserving of Training Model in Wireless Big Data with Edge Computing [J].
Du, Miao ;
Wang, Kun ;
Xia, Zhuoqun ;
Zhang, Yan .
IEEE TRANSACTIONS ON BIG DATA, 2020, 6 (02) :283-295
[8]  
Dwork C, 2006, LECT NOTES COMPUT SC, V4052, P1
[9]   An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques [J].
Eyupoglu, Can ;
Aydin, Muhammed Ali ;
Zaim, Abdul Halim ;
Sertbas, Ahmet .
ENTROPY, 2018, 20 (05)
[10]   Privacy preserving classification on local differential privacy in data centers [J].
Fan, Weibei ;
He, Jing ;
Guo, Mengjiao ;
Li, Peng ;
Han, Zhijie ;
Wang, Ruchuan .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 135 :70-82