Differentially Private Analysis of Outliers

被引:8
|
作者
Okada, Rina [1 ]
Fukuchi, Kazuto [1 ]
Sakuma, Jun [1 ]
机构
[1] Univ Tsukuba, Tsukuba, Ibaraki 3058577, Japan
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT II | 2015年 / 9285卷
关键词
Differential privacy; Outlier detection; Smooth sensitivity; NOISE;
D O I
10.1007/978-3-319-23525-7_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an investigation of differentially private analysis of distance-based outliers. Outlier detection aims to identify instances that are apparently distant from other instances. Meanwhile, the objective of differential privacy is to conceal the presence (or absence) of any particular instance. Outlier detection and privacy protection are therefore intrinsically conflicting tasks. In this paper, we present differentially private queries for counting outliers that appear in a given subspace, instead of reporting the outliers detected. Our analysis of the global sensitivity of outlier counts reveals that regular global sensitivity-based methods can make the outputs too noisy, particularly when the dimensionality of the given subspace is high. Noting that the counts of outliers are typically expected to be small compared to the number of data, we introduce a mechanism based on the smooth upper bound of the local sensitivity. This study is the first trial to ensure differential privacy for distance-based outlier analysis. The experimentally obtained results show that our method achieves better utility than global sensitivity-based methods do.
引用
收藏
页码:458 / 473
页数:16
相关论文
共 50 条
  • [1] Differentially Private Nonlinear Canonical Correlation Analysis
    Shen, Yanning
    2020 IEEE 11TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2020,
  • [2] Differentially Private Linear Regression Analysis via Truncating Technique
    Liu, Yifei
    Wang, Ning
    Wang, Zhigang
    Wang, Xiaodong
    Gao, Yun
    Ji, Xiaopeng
    Wei, Zhiqiang
    Qiao, Jun
    WEB INFORMATION SYSTEMS AND APPLICATIONS (WISA 2021), 2021, 12999 : 249 - 260
  • [3] DIFFERENTIALLY-PRIVATE CANONICAL CORRELATION ANALYSIS
    Imtiaz, Hafiz
    Sarwate, Anand D.
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 283 - 287
  • [4] DIFFERENTIALLY PRIVATE DISTRIBUTED PRINCIPAL COMPONENT ANALYSIS
    Imtiaz, Hafiz
    Sarwate, Anand D.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2206 - 2210
  • [5] Investigating Visual Analysis of Differentially Private Data
    Zhang, Dan
    Sarvghad, Ali
    Miklau, Gerome
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (02) : 1786 - 1796
  • [6] Differentially-Private Network Trace Analysis
    McSherry, Frank
    Mahajan, Ratul
    ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2010, 40 (04) : 123 - 134
  • [7] Differentially Private Data Publishing and Analysis: A Survey
    Zhu, Tianqing
    Li, Gang
    Zhou, Wanlei
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (08) : 1619 - 1638
  • [8] Differentially private block coordinate descent
    Riaz, Shazia
    Ali, Saqib
    Wang, Guojun
    Anees, Asad
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (01) : 283 - 295
  • [9] Differentially Private Naive Bayes Classification
    Vaidya, Jaideep
    Basu, Anirban
    Shafiq, Basit
    Hong, Yuan
    2013 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2013, : 571 - 576
  • [10] DIFFERENTIALLY PRIVATE LEARNING OF GEOMETRIC CONCEPTS
    Kaplan, Haim
    Mansour, Yishay
    Matias, Yossi
    Stemmer, Uri
    SIAM JOURNAL ON COMPUTING, 2022, 51 (04) : 952 - 974