DiffPRFs: Random forest under differential privacy

被引:0
作者
Mu H.-R. [1 ]
Ding L.-P. [1 ]
Song Y.-N. [1 ]
Lu G.-Q. [1 ]
机构
[1] National Engineering Research Center of Fundamental Software, Institute of Software, Chinese Academy of Sciences, Beijing
来源
| 1600年 / Editorial Board of Journal on Communications卷 / 37期
关键词
Data mining; Differential privacy; Privacy protection; Random forest;
D O I
10.11959/j.issn.1000-436x.2016169
中图分类号
学科分类号
摘要
A differential privacy algorithm DiffPRFs based on random forests was proposed. Exponential mechanism was used to select split point and split attribute in each decision tree building process, and noise was added according to Laplace mechanism. Differential privacy protection requirement was satisfied through overall process. Compared to existed algorithms, the proposed method does not require pre-discretization of continuous attributes which significantly reduces the performance cost of preprocessing in large multi-dimensional dataset. Classification is achieved conveniently and efficiently while maintains the high accuracy. Experimental results demonstrate the effectiveness and superiority of the algorithm compared to other classification algorithms. © 2016, Editorial Board of Journal on Communications. All right reserved.
引用
收藏
页码:175 / 182
页数:7
相关论文
共 16 条
  • [1] Dwork C., Differential privacy, The 33rd International Colloquium on Automata, Languages and Programming, pp. 1-12, (2006)
  • [2] Dwork C., A firm foundation for private data analysis, Communications of the ACM, 54, 1, pp. 86-95, (2011)
  • [3] Zhang X.J., Meng X.F., Differential privacy in data publication and analysis, Chinese Journal of Computers, 37, 4, pp. 927-949, (2014)
  • [4] Dwork C., Mcsherry F., Nissim K., Et al., Calibrating noise to sensitivity in private data analysis, Theory of Cryptography, pp. 265-284, (2006)
  • [5] Mcsherry F., Talwar K., Mechanism design via differential privacy, Foundations of Computer Science, pp. 94-103, (2007)
  • [6] Fan M., Meng X.F., Data Minify: Concepts and Techniques, (2012)
  • [7] Blum A., Dwork C., Mcsherry F., Et al., Practical privacy: the SuLQ framework, The 24th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pp. 128-138, (2005)
  • [8] Friedman A., Schuster A., Data mining with differential privacy, The 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 493-502, (2010)
  • [9] Mohammed N., Chen R., Fung B., Et al., Differentially private data release for data mining, The 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 493-501, (2011)
  • [10] Zhu T., Xiong P., Xiang Y., Et al., An effective deferentially private data releasing algorithm for decision tree, Trust, Security and Privacy in Computing and Communications (TrustCom), 2013 12th IEEE International Conference, pp. 388-395, (2013)