An Improved Density Peak Clustering Algorithm Based on Chebyshev Inequality and Differential Privacy

被引:4
|
作者
Chen, Hua [1 ]
Zhou, Yuan [1 ,2 ]
Mei, Kehui [1 ]
Wang, Nan [1 ]
Tang, Mengdi [1 ]
Cai, Guangxing [1 ]
机构
[1] Hubei Univ Technol, Sch Sci, Wuhan 430068, Peoples R China
[2] Wuhan Univ Bioengn, Sch Comp Sci & Technol, Wuhan 430060, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 15期
基金
中国国家自然科学基金;
关键词
DPC algorithm; differential privacy; cosine distance; dichotomy method; Chebyshev inequality; BIG DATA;
D O I
10.3390/app13158674
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application Privacy protection and data mining. This study aims to improve the quality of the clustering results of the density peak clustering (DPC) algorithm and address the privacy protection problem in the clustering analysis process. To achieve this, a DPC algorithm based on Chebyshev inequality and differential privacy (DP-CDPC) is proposed. Firstly, the distance matrix is calculated using cosine distance instead of Euclidean distance when dealing with high-dimensional datasets, and the truncation distance is automatically calculated using the dichotomy method. Secondly, to solve the difficulty in selecting suitable clustering centers in the DPC algorithm, statistical constraints are constructed from the perspective of the decision graph using Chebyshev inequality, and the selection of clustering centers is achieved by adjusting the constraint parameters. Finally, to address the privacy leakage problem in the cluster analysis, the Laplace mechanism is applied to introduce noise to the local density in the process of cluster analysis, enabling the privacy protection of the algorithm. The experimental results demonstrate that the DP-CDPC algorithm can effectively select the clustering centers, improve the quality of clustering results, and provide good privacy protection performance.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Global Combination and Clustering Based Differential Privacy Mixed Data Publishing
    Chen, Lanxiang
    Zeng, Lingfang
    Mu, Yi
    Chen, Leilei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (11) : 11437 - 11448
  • [42] Improved Collaborative Filtering Algorithm Incorporating User Information and Using Differential Privacy
    Ren, Jiahui
    Xu, Xian
    Yu, Huiqun
    COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, CHINESECSCW 2019, 2019, 1042 : 458 - 471
  • [43] A Clustering Algorithm for Tumor Gene Data Based on Improved DPC Algorithm
    Wang W.
    Gao B.
    International Journal Bioautomation, 2022, 26 (02): : 175 - 192
  • [44] Mass-Based Density Peaks Clustering Algorithm
    Ling, Ding
    Xiao, Xu
    INTELLIGENT INFORMATION PROCESSING IX, 2018, 538 : 40 - 48
  • [45] A density-based clustering algorithm for earthquake zoning
    Scitovski, Sanja
    COMPUTERS & GEOSCIENCES, 2018, 110 : 90 - 95
  • [46] A Stochastic Gradient Descent Algorithm Based on Adaptive Differential Privacy
    Deng, Yupeng
    Li, Xiong
    He, Jiabei
    Liu, Yuzhen
    Liang, Wei
    COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2022, PT II, 2022, 461 : 133 - 152
  • [47] A Frequent Itemsets Data Mining Algorithm Based on Differential Privacy
    Li, Qingpeng
    Zhang, Longjun
    Li, Haoyu
    Sun, Wenjun
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, INFORMATION MANAGEMENT AND NETWORK SECURITY, 2016, 47 : 251 - 253
  • [48] A Blockchain-Based Continuous Query Differential Privacy Algorithm
    Ouyang, Heng
    Lyu, Hongqin
    Long, Shigong
    Liu, Hai
    Ding, Hongfa
    PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PDCAT 2021, 2022, 13148 : 604 - 615
  • [49] Research on Improved Privacy Publishing Algorithm Based on Set Cover
    Lv, Haoze
    Liu, Zhaobin
    Hu, Zhonglian
    Nie, Lihai
    Liu, Weijiang
    Ye, Xinfeng
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2019, 16 (03) : 705 - 731
  • [50] Improved DPC Clustering Algorithm with Neighbor Density Distribution Optimized Sample Assignment
    Ji X.
    Zhang T.
    Zhu J.
    Liu S.
    Li X.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2019, 47 (02): : 98 - 105