Research on differential privacy preserving clustering algorithm based on spark platform

被引:0
|
作者
Meng Q. [1 ]
Zhou L. [1 ]
机构
[1] Department of Information Engineering College, Capital Normal University, Beijing
关键词
Differential evolution; Differential privacy; K-means; Opposition-based learning; Spark;
D O I
10.3966/199115992018012901005
中图分类号
学科分类号
摘要
Differential privacy is a kind of privacy protection model based on data distortion proposed by Dwork. As the model does not need to assume the prior knowledge of the attacker, it has been a research hot spot in the field of privacy protection. Aimed at the problem that the traditional differential privacy K-means algorithm is more sensitive to the selection of the initial center points, which reduces the usability of clustering results, an improved differential privacy preserving clustering algorithm (DEDP K-means) is proposed by introducing adaptive opposition-based learning technique and differential evolution algorithm. At the same time, the improved algorithm is parallelized based on the Spark platform. It was also demonstrated that the improved algorithm can optimize the selection of the initial centers, improve the usability of clustering results and have a good speedup when dealing with massive data by parallel experiments. © 2018 Computer Society of the Republic of China. All rights reserved.
引用
收藏
页码:47 / 62
页数:15
相关论文
共 50 条
  • [1] DPHKMS: An Efficient Hybrid Clustering Preserving Differential Privacy in Spark
    Gao, Zhi-Qiang
    Zhang, Long-Jun
    ADVANCES IN INTERNETWORKING, DATA & WEB TECHNOLOGIES, EIDWT-2017, 2018, 6 : 367 - 377
  • [2] Density Peak Clustering Algorithm Based on Differential Privacy Preserving
    Chen, Yun
    Du, Yunlan
    Cao, Xiaomei
    SCIENCE OF CYBER SECURITY, SCISEC 2019, 2019, 11933 : 20 - 32
  • [3] A-PAM Clustering Algorithm Based on Differential Privacy Preserving
    Shao, Rong-min
    Zhang, Lin
    Liu, Yan
    Huang, Da-guang
    2015 INTERNATIONAL CONFERENCE ON SOFTWARE, MULTIMEDIA AND COMMUNICATION ENGINEERING (SMCE 2015), 2015, : 183 - 190
  • [4] An Improved Apriori Preserving Differential Privacy in the Framework of Spark
    Gao, Zhiqiang
    Zhang, Longjun
    Hu, Renyuan
    Li, Qingpeng
    Yang, Jihua
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, INFORMATION MANAGEMENT AND NETWORK SECURITY, 2016, 47 : 245 - 247
  • [5] Research on Retailer Data Clustering Algorithm Based on Spark
    Huang, Qiuman
    Zhou, Feng
    ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS I, 2017, 1820
  • [6] Affinity Propagation Clustering Algorithm based on Spark Platform
    Zhang, Lijia
    Cheng, Lianglun
    PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 532 - 535
  • [7] Differential privacy preserving clustering based on Haar wavelet transform
    Dishabi, Mohammad Reza Ebrahimi
    Azgomi, Mohammad Abdollahi
    INTELLIGENT DATA ANALYSIS, 2014, 18 (04) : 583 - 608
  • [8] Novel trajectory privacy-preserving method based on clustering using differential privacy
    Zhao, Xiaodong
    Pi, Dechang
    Chen, Junfu
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 149
  • [9] The Research on Privacy Preserving in Social Networking Based on Clustering Method
    Wang, Peng
    Hu, Teng
    Lin, Meitong
    Li, Songjiang
    Yang, Huamin
    2015 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND INTELLIGENT CONTROL (ISIC 2015), 2015, : 531 - 536
  • [10] A Spectral Clustering Algorithm Based on Differential Privacy Preservation
    Cui, Yuyang
    Wu, Huaming
    Zhang, Yongting
    Gao, Yonggang
    Wu, Xiang
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT III, 2022, 13157 : 397 - 410