Privacy-preserving Density-based Clustering

被引:10
|
作者
Bozdemir, Beyza [1 ]
Canard, Sebastien [2 ]
Ermis, Orhan [1 ]
Moellering, Helen [3 ]
Onen, Melek [1 ]
Schneider, Thomas [3 ]
机构
[1] EURECOM, Sophia Antipolis, France
[2] Orange Labs, Appl Crypto Grp, Caen, France
[3] Tech Univ Darmstadt, Darmstadt, Germany
基金
欧洲研究理事会; 欧盟地平线“2020”;
关键词
Private Machine Learning; Clustering; Secure Computation;
D O I
10.1145/3433210.3453104
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is an unsupervised machine learning technique that outputs clusters containing similar data items. In this work, we investigate privacy-preserving density-based clustering which is, for example, used in financial analytics and medical diagnosis. When (multiple) data owners collaborate or outsource the computation, privacy concerns arise. To address this problem, we design, implement, and evaluate the first practical and fully private density-based clustering scheme based on secure two-party computation. Our protocol privately executes the DBSCAN algorithm without disclosing any information (including the number and size of clusters). It can be used for private clustering between two parties as well as for private outsourcing of an arbitrary number of data owners to two non-colluding servers. Our implementation of the DBSCAN algorithm privately clusters data sets with 400 elements in 7 minutes on commodity hardware. Thereby, it flexibly determines the number of required clusters and is insensitive to outliers, while being only factor 19x slower than today's fastest private K-means protocol (Mohassel et al., PETS'20) which can only be used for specific data sets. We then show how to transfer our newly designed protocol to related clustering algorithms by introducing a private approximation of the TRACLUS algorithm for trajectory clustering which has interesting real-world applications like financial time series forecasts and the investigation of the spread of a disease like COVID-19.
引用
收藏
页码:658 / 671
页数:14
相关论文
共 50 条
  • [1] Practical and Privacy-Preserving Density-Based Clustering via Shuffling
    Wang, Yingzhe
    Li, Hongwei
    Chen, Hanxiao
    Zhang, Xilin
    Hao, Meng
    Proceedings - IEEE Global Communications Conference, GLOBECOM, 2023, : 50 - 55
  • [2] The density-based clustering method for privacy-preserving data mining
    Wu, Jimmy Ming-Tai
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    Djenouri, Youcef
    Chen, Chun-Hao
    Li, Zhongcui
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2019, 16 (03) : 1718 - 1728
  • [3] Practical and Privacy-Preserving Density-Based Clustering via Shuffling
    Wang, Yingzhe
    Li, Hongwei
    Chen, Hanxiao
    Zhang, Xilin
    Hao, Meng
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 50 - 55
  • [4] PPA-DBSCAN: Privacy-Preserving ρ-Approximate Density-Based Clustering
    Fu, Jiaxuan
    Cheng, Ke
    Chang, Zhao
    Shen, Yulong
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2024, 21 (06) : 5324 - 5340
  • [5] A Density-based Space Filling Curve for Location Privacy-preserving
    Tian, Feng
    Gui, Xiaolin
    An, Jian
    Yang, Pan
    Zhang, Xuejun
    2014 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2014), 2014, : 131 - 138
  • [6] Density-based clustering with differential privacy
    Wu, Fuyu
    Du, Mingjing
    Zhi, Qiang
    INFORMATION SCIENCES, 2024, 681
  • [7] A new scheme for distributed density estimation based privacy-preserving clustering
    Su, Chunhua
    Bao, Feng
    Zhou, Jianying
    Takagi, Tsuyoshi
    Sakurai, Kouichi
    ARES 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON AVAILABILITY, SECURITY AND RELIABILITY, 2008, : 112 - +
  • [8] A Privacy-Preserving k-Means Clustering Algorithm using Secure Comparison Protocol and Density-based Center Point Selection
    Kim, Hyeong-Jin
    Chang, Jae-Woo
    PROCEEDINGS 2018 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2018, : 928 - 931
  • [9] Privacy-preserving distributed clustering
    Erkin, Zekeriya
    Veugen, Thijs
    Toft, Tomas
    Lagendijk, Reginald L.
    EURASIP JOURNAL ON INFORMATION SECURITY, 2013, (01):
  • [10] A privacy-preserving density peak clustering algorithm in cloud computing
    Sun, Liping
    Ci, Shang
    Liu, Xiaoqing
    Zheng, Xiaoyao
    Yu, Qingying
    Luo, Yonglong
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (11):