Research on differential privacy preserving clustering algorithm based on spark platform

被引:0
作者
Meng Q. [1 ]
Zhou L. [1 ]
机构
[1] Department of Information Engineering College, Capital Normal University, Beijing
关键词
Differential evolution; Differential privacy; K-means; Opposition-based learning; Spark;
D O I
10.3966/199115992018012901005
中图分类号
学科分类号
摘要
Differential privacy is a kind of privacy protection model based on data distortion proposed by Dwork. As the model does not need to assume the prior knowledge of the attacker, it has been a research hot spot in the field of privacy protection. Aimed at the problem that the traditional differential privacy K-means algorithm is more sensitive to the selection of the initial center points, which reduces the usability of clustering results, an improved differential privacy preserving clustering algorithm (DEDP K-means) is proposed by introducing adaptive opposition-based learning technique and differential evolution algorithm. At the same time, the improved algorithm is parallelized based on the Spark platform. It was also demonstrated that the improved algorithm can optimize the selection of the initial centers, improve the usability of clustering results and have a good speedup when dealing with massive data by parallel experiments. © 2018 Computer Society of the Republic of China. All rights reserved.
引用
收藏
页码:47 / 62
页数:15
相关论文
共 50 条
  • [21] Privacy Preserving Distributed Cell-based K-means Clustering Algorithm
    Su, Fang
    Zu, Yun-xiao
    Li, Wei-hai
    [J]. INTERNATIONAL CONFERENCE ON MATHEMATICS, MODELLING AND SIMULATION TECHNOLOGIES AND APPLICATIONS (MMSTA 2017), 2017, 215 : 377 - 383
  • [22] A reversible privacy-preserving clustering technique based on k-means algorithm
    Lin, Chen-Yi
    [J]. APPLIED SOFT COMPUTING, 2020, 87
  • [23] Research on improved K - nearest neighbor algorithm based on spark platform
    Geng, Yushui
    Yan, Xianzhao
    [J]. PROCEEDINGS OF THE 2017 2ND JOINT INTERNATIONAL INFORMATION TECHNOLOGY, MECHANICAL AND ELECTRONIC ENGINEERING CONFERENCE (JIMEC 2017), 2017, 62 : 553 - 557
  • [24] Research on Ant Colony Clustering Algorithm Based on HADOOP Platform
    Wang, Zhihao
    Huo, Yonghua
    Wang, Junfang
    Zhao, Kang
    Yang, Yang
    [J]. COLLABORATE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2016, 2017, 201 : 514 - 520
  • [25] An Improved Density Peak Clustering Algorithm Based on Chebyshev Inequality and Differential Privacy
    Chen, Hua
    Zhou, Yuan
    Mei, Kehui
    Wang, Nan
    Tang, Mengdi
    Cai, Guangxing
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (15):
  • [26] A clustering-based differential privacy protection algorithm for weighted social networks
    Zhang L.
    Ge L.
    [J]. Mathematical Biosciences and Engineering, 2024, 21 (03) : 3755 - 3773
  • [27] Differential Privacy-Preserving Density Peaks Clustering Based on Shared Near Neighbors Similarity
    Sun, Liping
    Bao, Shuting
    Ci, Shang
    Zheng, Xiaoyao
    Guo, Liangmin
    Luo, Yonglong
    [J]. IEEE ACCESS, 2019, 7 (89427-89440) : 89427 - 89440
  • [28] A Quadtree Density Clustering Algorithm Under Differential Privacy
    Chen, Hongyu
    Li, Shuyu
    [J]. PROCEEDINGS OF 2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (IEEE-ASID'2019), 2019, : 65 - 69
  • [29] A Density Peaking Clustering Algorithm for Differential Privacy Preservation
    Chen, Hua
    Mei, Kehui
    Zhou, Yuan
    Wang, Nan
    Tang, Mengdi
    Cai, Guangxing
    [J]. IEEE ACCESS, 2023, 11 : 54240 - 54253
  • [30] GAPBAS: Genetic algorithm-based privacy budget allocation strategy in differential privacy K-means clustering algorithm
    Li, Yong
    Song, Xiao
    Tu, Yuchun
    Liu, Ming
    [J]. COMPUTERS & SECURITY, 2024, 139