Privacy-preserving Density-based Clustering

被引:10
|
作者
Bozdemir, Beyza [1 ]
Canard, Sebastien [2 ]
Ermis, Orhan [1 ]
Moellering, Helen [3 ]
Onen, Melek [1 ]
Schneider, Thomas [3 ]
机构
[1] EURECOM, Sophia Antipolis, France
[2] Orange Labs, Appl Crypto Grp, Caen, France
[3] Tech Univ Darmstadt, Darmstadt, Germany
基金
欧洲研究理事会; 欧盟地平线“2020”;
关键词
Private Machine Learning; Clustering; Secure Computation;
D O I
10.1145/3433210.3453104
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is an unsupervised machine learning technique that outputs clusters containing similar data items. In this work, we investigate privacy-preserving density-based clustering which is, for example, used in financial analytics and medical diagnosis. When (multiple) data owners collaborate or outsource the computation, privacy concerns arise. To address this problem, we design, implement, and evaluate the first practical and fully private density-based clustering scheme based on secure two-party computation. Our protocol privately executes the DBSCAN algorithm without disclosing any information (including the number and size of clusters). It can be used for private clustering between two parties as well as for private outsourcing of an arbitrary number of data owners to two non-colluding servers. Our implementation of the DBSCAN algorithm privately clusters data sets with 400 elements in 7 minutes on commodity hardware. Thereby, it flexibly determines the number of required clusters and is insensitive to outliers, while being only factor 19x slower than today's fastest private K-means protocol (Mohassel et al., PETS'20) which can only be used for specific data sets. We then show how to transfer our newly designed protocol to related clustering algorithms by introducing a private approximation of the TRACLUS algorithm for trajectory clustering which has interesting real-world applications like financial time series forecasts and the investigation of the spread of a disease like COVID-19.
引用
收藏
页码:658 / 671
页数:14
相关论文
共 50 条
  • [21] Privacy-Preserving DBSCAN Clustering Algorithm Based on Negative Database
    Zhang, Mingkun
    Liao, Hucheng
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (IEEE ICBDA 2020), 2020, : 209 - 213
  • [22] A privacy-preserving vehicle trajectory clustering framework
    Tian, Ran
    Gao, Pulun
    Liu, Yanxing
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2024, 25 (07) : 988 - 1002
  • [23] Privacy-preserving evaluation for support vector clustering
    Byun, J.
    Lee, J.
    Park, S.
    ELECTRONICS LETTERS, 2021, 57 (02) : 61 - 64
  • [24] PRIVACY-PRESERVING USER CLUSTERING IN A SOCIAL NETWORK
    Erkin, Zekeriya
    Veugen, Thijs
    Toft, Tomas
    Lagendijk, Reginald L.
    2009 FIRST IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2009, : 96 - +
  • [25] Communication-Efficient Privacy-Preserving Clustering
    Jagannathan, Geetha
    Pillaipakkamnatt, Krishnan
    Wright, Rebecca N.
    Umano, Daryl
    TRANSACTIONS ON DATA PRIVACY, 2010, 3 (01) : 2 - 26
  • [26] Privacy-preserving location-based traffic density monitoring
    Wu, Lei
    Wei, Xia
    Meng, Lingzhen
    Zhao, Shengnan
    Wang, Hao
    CONNECTION SCIENCE, 2022, 34 (01) : 874 - 894
  • [27] Differential Privacy-Preserving Recommendation Algorithm Based on Bhattacharyya Coefficient Clustering
    Wang Y.
    Yin E.-M.
    Ran X.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2021, 44 (02): : 81 - 88
  • [28] Density-based clustering
    Campello, Ricardo J. G. B.
    Kroeger, Peer
    Sander, Jorg
    Zimek, Arthur
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 10 (02)
  • [29] Density-based clustering
    Kriegel, Hans-Peter
    Kroeger, Peer
    Sander, Joerg
    Zimek, Arthur
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (03) : 231 - 240
  • [30] A clustering-based anonymization approach for privacy-preserving in the healthcare cloud
    Abbasi, Afsoon
    Mohammadi, Behnaz
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (01):