Differentially Private Naive Bayes Classification

被引:74
作者
Vaidya, Jaideep [1 ]
Basu, Anirban [2 ]
Shafiq, Basit [3 ]
Hong, Yuan [4 ]
机构
[1] Rutgers State Univ, 1 Washington Pk, Newark, NJ 07102 USA
[2] KDDI R&D Lab Inc, Saitama 3568502, Japan
[3] Lahore Univ Management Sci, Lahore 54792, Pakistan
[4] SUNY Albany, Albany, NY 12222 USA
来源
2013 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1 | 2013年
关键词
Differential Privacy; Naive Bayes Classification; NOISE;
D O I
10.1109/WI-IAT.2013.80
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Privacy and security concerns often prevent the sharing of users' data or even of the knowledge gained from it, thus deterring valuable information from being utilized. Privacy-preserving knowledge discovery, if done correctly, can alleviate this problem. One of the most important and widely used data mining techniques is that of classification. We consider the model where a single provider has centralized access to a dataset and would like to release a classifier while protecting privacy to the best extent possible. Recently, the model of differential privacy has been developed which provides a strong privacy guarantee even if adversaries hold arbitrary prior knowledge. In this paper, we apply this rigorous privacy model to develop a Naive Bayes classifier, which is often used as a baseline and consistently provides reasonable classification performance. We experimentally evaluate the proposed approach, and discuss how it could be potentially deployed in PaaS clouds.
引用
收藏
页码:571 / 576
页数:6
相关论文
共 22 条
[1]  
Agrawal R, 2000, SIGMOD REC, V29, P439, DOI 10.1145/335191.335438
[2]  
[Anonymous], 2008, ACM T KNOWL DISCOV D, DOI DOI 10.1145/1409620.1409624
[3]  
[Anonymous], 1997, MACHINE LEARNING, MCGRAW-HILL SCIENCE/ENGINEERING/MATH
[4]  
Bhaskar R., 2010, KDD, P503, DOI DOI 10.1145/1835804.1835869
[5]  
Blake C. L., 1998, Uci repository of machine learning databases
[6]  
Cormode G., 2011, P 17 ACM SIGKDD INT, P1253
[7]  
Duan Yitao, 2009, Proceedings of the 18th ACM conference on Information and knowledge management, P1517
[8]  
Dwork C, 2006, LECT NOTES COMPUT SC, V4052, P1
[9]  
Dwork C, 2006, LECT NOTES COMPUT SC, V4004, P486
[10]   Calibrating noise to sensitivity in private data analysis [J].
Dwork, Cynthia ;
McSherry, Frank ;
Nissim, Kobbi ;
Smith, Adam .
THEORY OF CRYPTOGRAPHY, PROCEEDINGS, 2006, 3876 :265-284