Score, Arrange, and Cluster: A Novel Clustering-Based Technique for Privacy-Preserving Data Publishing

被引:1
作者
Sowmyarani, C. N. [1 ]
Namya, L. G. [1 ]
Nidhi, G. K. [1 ]
Ramakanth Kumar, P. [1 ]
机构
[1] RV Coll Engn, Bengaluru 560059, India
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Data privacy; Publishing; Stakeholders; Clustering algorithms; Data models; Information integrity; Genetic algorithms; Decision making; Homomorphic encryption; Clustering; k-anonymity; data privacy; privacy-preserving data publishing; genetic algorithm; MODEL;
D O I
10.1109/ACCESS.2024.3403372
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data-driven decision-making has become critical to every organization. There is a growing emphasis on adopting robust data governance frameworks for data management. This encompasses data publishing to empower stakeholders with the ability to access and analyze the published data, playing a pivotal role in decision-making. However, data publishing poses a threat to entity-specific information. Privacy-Preserving Data Publishing (PPDP) refers to publishing data while protecting the privacy of entity-specific information. K-anonymity is a well-recognized method that achieves PPDP and serves as the foundation of our proposed clustering-based data transformation algorithm, "Score, Arrange, and Cluster (SAC)". For effective data management and decision-making in organizations, it is crucial to address the varying data requirements and role-based access levels of the involved stakeholders. SAC was designed to offer only a generic data transformation with minimal data quality degradation. Hence, this work presents an enhancement to SAC that takes into account stakeholder roles and requirements, as illustrated through different scenarios. The scoring mechanism in SAC is augmented to accommodate customization or use the concepts of Genetic Algorithms to enforce role-based access control. The "Cost of Degradation" (CoD) metric is used to quantify the data quality degradation. As per the experimental results, in the customization scenario, a higher attribute priority leads to lower data quality degradation, while, in the role-based access control scenario a higher access level results in a lower data quality degradation.
引用
收藏
页码:79861 / 79874
页数:14
相关论文
共 50 条
  • [31] Clustering-based privacy preserving anonymity approach for table data sharing
    Piao, Chunhui
    Liu, Liping
    Shi, Yajuan
    Jiang, Xuehong
    Song, Ning
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2020, 11 (04) : 768 - 773
  • [32] A New Anonymity Model for Privacy-Preserving Data Publishing
    Huang Xuezhen
    Liu Jiqiang
    Han Zhen
    Yang Jun
    CHINA COMMUNICATIONS, 2014, 11 (09) : 47 - 59
  • [33] STDP: Secure Privacy-Preserving Trajectory Data Publishing
    Eom, Chris Soo-Hyun
    Lee, Wookey
    Leung, Carson Kai-Sang
    IEEE 2018 INTERNATIONAL CONGRESS ON CYBERMATICS / 2018 IEEE CONFERENCES ON INTERNET OF THINGS, GREEN COMPUTING AND COMMUNICATIONS, CYBER, PHYSICAL AND SOCIAL COMPUTING, SMART DATA, BLOCKCHAIN, COMPUTER AND INFORMATION TECHNOLOGY, 2018, : 892 - 899
  • [34] Clustering-based Efficient Privacy-preserving Face Recognition Scheme without Compromising Accuracy
    Liu, Meng
    Hu, Hongsheng
    Xiang, Haolong
    Yang, Chi
    Lyu, Lingjuan
    Zhang, Xuyun
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2021, 17 (03)
  • [35] On Privacy-Preserving Publishing of Spontaneous ADE Reporting Data
    Lin, Wen-Yang
    Yang, Duan-Chun
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [36] Privacy-Preserving Data Publishing: A Survey of Recent Developments
    Fung, Benjamin C. M.
    Wang, Ke
    Chen, Rui
    Yu, Philip S.
    ACM COMPUTING SURVEYS, 2010, 42 (04)
  • [37] EDAMS: Efficient Data Anonymization Model Selector for Privacy-Preserving Data Publishing
    Qamar, Tehreem
    Bawany, Narmeen Zakaria
    Khan, Najeed Ahmed
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2020, 10 (02) : 5423 - 5427
  • [38] Restricted Sensitive Attributes-based Sequential Anonymization (RSA-SA) approach for privacy-preserving data stream publishing
    Abdelharneed, Saad A.
    Moussa, Sherin M.
    Khalifa, Mohamed E.
    KNOWLEDGE-BASED SYSTEMS, 2019, 164 : 1 - 20
  • [39] Privacy, space, and time: a survey on privacy-preserving continuous data publishing
    Katsomallos, Manos
    Tzompanaki, Katerina
    Kotzinos, Dimitris
    JOURNAL OF SPATIAL INFORMATION SCIENCE, 2019, (19): : 57 - 103
  • [40] FPDP: Flexible Privacy-Preserving Data Publishing Scheme for Smart Agriculture
    Song, Jingcheng
    Zhong, Qi
    Wang, Weizheng
    Su, Chunhua
    Tan, Zhiyuan
    Liu, Yining
    IEEE SENSORS JOURNAL, 2021, 21 (16) : 17430 - 17438