Differentially Private Analysis of Outliers

被引:8
|
作者
Okada, Rina [1 ]
Fukuchi, Kazuto [1 ]
Sakuma, Jun [1 ]
机构
[1] Univ Tsukuba, Tsukuba, Ibaraki 3058577, Japan
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT II | 2015年 / 9285卷
关键词
Differential privacy; Outlier detection; Smooth sensitivity; NOISE;
D O I
10.1007/978-3-319-23525-7_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an investigation of differentially private analysis of distance-based outliers. Outlier detection aims to identify instances that are apparently distant from other instances. Meanwhile, the objective of differential privacy is to conceal the presence (or absence) of any particular instance. Outlier detection and privacy protection are therefore intrinsically conflicting tasks. In this paper, we present differentially private queries for counting outliers that appear in a given subspace, instead of reporting the outliers detected. Our analysis of the global sensitivity of outlier counts reveals that regular global sensitivity-based methods can make the outputs too noisy, particularly when the dimensionality of the given subspace is high. Noting that the counts of outliers are typically expected to be small compared to the number of data, we introduce a mechanism based on the smooth upper bound of the local sensitivity. This study is the first trial to ensure differential privacy for distance-based outlier analysis. The experimentally obtained results show that our method achieves better utility than global sensitivity-based methods do.
引用
收藏
页码:458 / 473
页数:16
相关论文
共 50 条
  • [31] Differentially Private Outlier Detection in Multivariate Gaussian Signals
    Degue, Kwassi H.
    Gopalakrishnan, Karthik
    Li, Max Z.
    Balakrishnan, Hamsa
    Le Ny, Jerome
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 3626 - 3631
  • [32] Stochastic Adaptive Line Search for Differentially Private Optimization
    Chen, Chen
    Lee, Jaewoo
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1011 - 1020
  • [33] Differentially Private Data Release over Multiple Tables
    Ghazi, Badih
    Hu, Xiao
    Kumar, Ravi
    Manurangsi, Pasin
    PROCEEDINGS OF THE 42ND ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, PODS 2023, 2023, : 207 - 219
  • [34] Comparative Study of Differentially Private Data Synthesis Methods
    Bowen, Claire McKay
    Liu, Fang
    STATISTICAL SCIENCE, 2020, 35 (02) : 280 - 307
  • [35] Differentially private human activity recognition for smartphone users
    Avishek Garain
    Rudrajit Dawn
    Saswat Singh
    Chandreyee Chowdhury
    Multimedia Tools and Applications, 2022, 81 : 40827 - 40848
  • [36] A Differentially Private Kernel Two-Sample Test
    Raj, Anant
    Law, Ho Chung Leon
    Sejdinovic, Dino
    Park, Mijung
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT I, 2020, 11906 : 697 - 724
  • [37] Utility-preserving differentially private skyline query
    Lan, Qiujun
    Ma, Jiaqi
    Yan, Ziqi
    Li, Gang
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [38] Differentially private human activity recognition for smartphone users
    Garain, Avishek
    Dawn, Rudrajit
    Singh, Saswat
    Chowdhury, Chandreyee
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (28) : 40827 - 40848
  • [39] Cooperative Differentially Private LQG Control With Measurement Aggregation
    Degue, Kwassi H.
    Le Ny, Jerome
    IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 1093 - 1098
  • [40] Improving the Number of Queries Supported by Differentially Private Mechanisms
    Huang, Wen
    Zhou, Shijie
    Liao, Yongjian
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (BIGDATASE 2021), 2021, : 65 - 71