A Differentially Private (Random) Decision Tree without Noise from k-Anonymity

被引:0
作者
Waseda, Atsushi [1 ]
Nojima, Ryo [2 ,3 ]
Wang, Lihua [3 ]
机构
[1] Tokyo Univ Informat Sci, Fac Informat, Dept Informat, 4-1 Onaridai,Wakaba Ku, Chiba 2658501, Japan
[2] Ritsumeikan Univ, Coll Informat Sci & Engn, 2-150 Iwakura Cho, Ibaraki, 5678570, Japan
[3] Natl Inst Informat & Commun Technol NICT, Cybersecur Res Inst, 4-2-1 Nukui Kitamachi, Koganei 1848795, Japan
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 17期
关键词
privacy preservation; (random) decision tree; k-anonymity; & ell; -diversity; differential privacy;
D O I
10.3390/app14177625
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper focuses on the relationship between decision trees, a typical machine learning method, and data anonymization. It is known that information leaked from trained decision trees can be evaluated using well-studied data anonymization techniques and that decision trees can be strengthened using k-anonymity and & ell;-diversity; unfortunately, however, this does not seem sufficient for differential privacy. In this paper, we show how one might apply k-anonymity to a (random) decision tree, which is a variant of the decision tree. Surprisingly, this results in differential privacy, which means that security is amplified from k-anonymity to differential privacy without the addition of noise.
引用
收藏
页数:15
相关论文
共 22 条
[1]  
[Anonymous], 1987, Mushroom UCI Machine Learning Repository
[2]  
[Anonymous], 2007, ACM T KNOWL DISCOV D, DOI [10.1145/1217299.1217302, DOI 10.1145/1217299.1217302]
[3]  
Becker Barry, 1996, Adult. UCI Machine Learning Repository
[4]  
Blum A., 2005, P 24 ACM SIGMOD SIGA, P128, DOI [DOI 10.1145/1065167.1065184, 10.1145/1065167.1065184]
[5]  
Ciriani V, 2008, ADV DATABASE SYST, V34, P105
[6]   The Algorithmic Foundations of Differential Privacy [J].
Dwork, Cynthia ;
Roth, Aaron .
FOUNDATIONS AND TRENDS IN THEORETICAL COMPUTER SCIENCE, 2013, 9 (3-4) :211-406
[7]  
Fan W, 2003, THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, P51
[8]   Decision Tree Classification with Differential Privacy: A Survey [J].
Fletcher, Sam ;
Islam, Md Zahidul .
ACM COMPUTING SURVEYS, 2019, 52 (04)
[9]   A Differentially Private Random Decision Forest Using Reliable Signal-to-Noise Ratios [J].
Fletcher, Sam ;
Islam, Md Zahidul .
AI 2015: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2015, 9457 :192-203
[10]  
Friedman A., 2010, P 16 ACM SIGKDD INT, P493, DOI [DOI 10.1145/1835804.1835868, 10.1145/1835804.1835868]