Differentially private random decision forests using smooth sensitivity

被引:52
作者
Fletcher, Sam [1 ]
Islam, Md Zahidul [1 ]
机构
[1] Charles Sturt Univ, Sch Comp & Math, Bathurst, NSW, Australia
关键词
Privacy; Data mining; Decision tree; Decision forest; Differential privacy; Smooth sensitivity; ALGORITHM;
D O I
10.1016/j.eswa.2017.01.034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new differentially-private decision forest algorithm that minimizes both the number of queries required, and the sensitivity of those queries. To do so, we build an ensemble of random decision trees that avoids querying the private data except to find the majority class label in the leaf nodes. Rather than using a count query to return the class counts like the current state-of-the-art, we use the Exponential Mechanism to only output the class label itself. This drastically reduces the sensitivity of the query - often by several orders of magnitude - which in turn reduces the amount of noise that must be added to preserve privacy. Our improved sensitivity is achieved by using "smooth sensitivity", which takes into account the specific data used in the query rather than assuming the worst-case scenario. We also extend work done on the optimal depth of random decision trees to handle continuous features, not just discrete features. This, along with several other improvements, allows us to create a differentially private decision forest with substantially higher predictive power than the current state-of-the-art. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:16 / 31
页数:16
相关论文
共 47 条
  • [1] Agrawal R, 2000, SIGMOD REC, V29, P439, DOI 10.1145/335191.335438
  • [2] [Anonymous], J KING SAUD U COMPUT
  • [3] [Anonymous], 1948, UN DECL HUM RIGTS
  • [4] [Anonymous], 2011, Pei. data mining concepts and techniques, DOI 10.1016/C2009-0-61819-5
  • [5] [Anonymous], 2014, J COMPUT SCI-NETH
  • [6] [Anonymous], 2013, ALGORITHMIC FDN DIFF
  • [7] Bache K., 2013, UCI Machine Learning Repository
  • [8] Bhaskar R., 2010, KDD, P503, DOI DOI 10.1145/1835804.1835869
  • [9] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [10] A Learning Theory Approach to Noninteractive Database Privacy
    Blum, Avrim
    Ligett, Katrina
    Roth, Aaron
    [J]. JOURNAL OF THE ACM, 2013, 60 (02)