A cluster-based approach for distributed anonymisation of vertically partitioned data

被引:0
作者
Xenakis, Antonios [1 ]
Chen, Zhiyuan [1 ]
Karabatis, George [1 ]
机构
[1] Department of Information Systems, University of Maryland, Baltimore County (UMBC), Baltimore, MD
关键词
cluster-based anonymisation; differential privacy; distributed anonymisation; K-anonymity; privacy;
D O I
10.1504/IJWET.2024.143360
中图分类号
学科分类号
摘要
In modern organisations, data is often spread across different sites, posing challenges for effective analysis. Transferring data to a centralised server may jeopardise privacy and leak sensitive/proprietary information. Therefore, organisations hesitate adopting this solution despite its potential to fully utilise, and analyse the data, for better decision making. Current approaches concentrate on distributed privacy-preserving techniques for data analysis, where data does not leave each site, but incurs substantial computational and communication overhead. This paper focuses on distributed data that is anonymised on site, then merged and sent to a centralised server for analysis. Two new approaches on cluster-based distributed anonymisation are introduced for vertically partitioned data, one based on distributed coordinated anonymisation, and the other based on top-down distributed anonymisation, resulting in low initial onsite anonymisation overhead. Experiments show these approaches preserve data privacy with very minor loss of utility of anonymised data and impose minimal computational overhead. © 2024 Inderscience Enterprises Ltd.
引用
收藏
页码:397 / 420
页数:23
相关论文
共 48 条
[1]  
Agrawal R., Srikant R., Privacy-preserving data mining, Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pp. 439-450, (2000)
[2]  
Aleroud A., Yang F., Pallaprolu S.C., Chen Z., Karabatis G., Anonymization of network traces data through condensation-based differential privacy, Digital Threats: Research and Practice (DTRAP), 2, 4, pp. 1-23, (2021)
[3]  
Ammad-Ud-Din M., Ivannikova E., Khan S.A., Oyomno W., Fu Q., Tan K.E., Flanagan A., Federated Collaborative Filtering for Privacy-Preserving Personalized Recommendation System, (2019)
[4]  
Bi M., Wang Y., Cai Z., Tong X., A privacy-preserving mechanism based on local differential privacy in edge computing, China Communications, 17, 9, pp. 50-65, (2020)
[5]  
Bonawitz K., Eichner H., Grieskamp W., Huba D., Ingerman A., Ivanov V., Kiddon C., Kone_cn_y J., Mazzocchi S., McMahan B., Et al., Towards federated learning at scale: system design, Proceedings of Machine Learning and Systems, 1, pp. 374-388, (2019)
[6]  
Chuanxin Z., Yi S., Degang W., Federated learning with Gaussian differential privacy, Proceedings of the 2020 2nd International Conference on Robotics, Intelligent Control and Artificial Intelligence, pp. 296-301, (2020)
[7]  
Doganay M.C., Pedersen T.B., Saygin Y., Savas_ E., Levi A., Distributed privacy preserving k-means clustering with additive secret sharing, Proceedings of the 2008 International Workshop on Privacy and Anonymity in Information Society, pp. 3-11, (2008)
[8]  
Domingo-Ferrer J., Torra V., Ordinal, continuous and heterogeneous k-anonymity through microaggregation, Data Mining and Knowledge Discovery, 11, 2, pp. 195-212, (2005)
[9]  
Du W., Han Y.S., Chen S., Privacy-preserving multivariate statistical analysis: linear regression and classification, Proceedings of the 2004 SIAM International Conference on Data Mining, pp. 222-233, (2004)
[10]  
Dwork C., Differential privacy: a survey of results, International Conference on Theory and Applications of Models of Computation, pp. 1-19, (2008)