Differentially Private Analysis of Outliers

被引:8
|
作者
Okada, Rina [1 ]
Fukuchi, Kazuto [1 ]
Sakuma, Jun [1 ]
机构
[1] Univ Tsukuba, Tsukuba, Ibaraki 3058577, Japan
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT II | 2015年 / 9285卷
关键词
Differential privacy; Outlier detection; Smooth sensitivity; NOISE;
D O I
10.1007/978-3-319-23525-7_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an investigation of differentially private analysis of distance-based outliers. Outlier detection aims to identify instances that are apparently distant from other instances. Meanwhile, the objective of differential privacy is to conceal the presence (or absence) of any particular instance. Outlier detection and privacy protection are therefore intrinsically conflicting tasks. In this paper, we present differentially private queries for counting outliers that appear in a given subspace, instead of reporting the outliers detected. Our analysis of the global sensitivity of outlier counts reveals that regular global sensitivity-based methods can make the outputs too noisy, particularly when the dimensionality of the given subspace is high. Noting that the counts of outliers are typically expected to be small compared to the number of data, we introduce a mechanism based on the smooth upper bound of the local sensitivity. This study is the first trial to ensure differential privacy for distance-based outlier analysis. The experimentally obtained results show that our method achieves better utility than global sensitivity-based methods do.
引用
收藏
页码:458 / 473
页数:16
相关论文
共 50 条
  • [21] Old Techniques in Differentially Private Linear Regression
    Sheffet, Or
    ALGORITHMIC LEARNING THEORY, VOL 98, 2019, 98
  • [22] Differentially Private Ensemble Classifiers for Data Streams
    Gondara, Lovedeep
    Wang, Ke
    Carvalho, Ricardo Silva
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 325 - 333
  • [23] Differentially Private Hypothesis Testing for Linear Regression
    Alabi, Daniel G.
    Vadhan, Salil P.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [24] Improved Differentially Private Euclidean Distance Approximation
    Stausholm, Nina Mesing
    PODS '21: PROCEEDINGS OF THE 40TH SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2021, : 42 - 56
  • [25] A Differentially Private Hybrid Approach to Traffic Monitoring
    Rocha, Rogerio V. M.
    Liborio, Pedro P.
    Patil, Harsh Kupwade
    Aranha, Diego F.
    APPLIED CRYPTOGRAPHY AND NETWORK SECURITY, ACNS 2021, PT II, 2021, 12727 : 233 - 256
  • [26] Differentially private analysis of networks with covariates via a generalized β-model
    Yan, Ting
    SCIENCE CHINA-MATHEMATICS, 2025,
  • [27] Differentially Private Regression for Discrete-Time Survival Analysis
    Nguyen, Thong T.
    Hui, Siu Cheung
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1199 - 1208
  • [28] Differentially private nonlinear observer design using contraction analysis
    Le Ny, Jerome
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (11) : 4225 - 4243
  • [29] Analysis of Differentially Private Synthetic Data: A Measurement Error Approach
    Jiang, Yangdi
    Liu, Yi
    Yan, Xiaodong
    Charest, Anne-Sophie
    Kong, Linglong
    Jiang, Bei
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21206 - 21213
  • [30] Differentially Private Maximum Consensus: Design, Analysis and Impossibility Result
    Wang, Xin
    He, Jianping
    Cheng, Peng
    Chen, Jiming
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2019, 6 (04): : 928 - 939