A differential privacy preserving algorithm for greedy decision tree

被引:0
作者
Yang, Shudan [1 ]
Li, Nan [1 ]
Sun, Daozhu [1 ]
Du, Qiming [1 ]
Liu, Wenfu [1 ]
机构
[1] Strateg Support Force Informat Engn Univ, Coll Cyberspace Secur, State Key Lab Math Engn & Adv Comp, Zhengzhou, Henan, Peoples R China
来源
2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021) | 2021年
基金
中国国家自然科学基金;
关键词
Differential privacy; Decision tree; Random forest; Budget allocation;
D O I
10.1109/ICBASE53849.2021.00050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the contradiction between data application and privacy protection has become increasingly prominent, and differential privacy is considered an effective technology to resolve this contradiction. Therefore, we propose a differential privacy protection algorithm for greedy decision trees. Firstly, the privacy budget is allocated according to the structural characteristics of the tree. Secondly, the algorithm meets c-differential privacy by adding Laplacian perturbation to the leaf nodes and exponential mechanism perturbation to the intermediate nodes. Finally, The integration strategy is used to construct random forest to avoid the instability of single tree. Experimental results show that this algorithm achieves a better balance between data availability and invisibility compared with other algorithms.
引用
收藏
页码:229 / 237
页数:9
相关论文
共 20 条
  • [1] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [2] Borhan N, BUDGET ALLOCATION DI
  • [3] Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1007/BF00058655
  • [4] A novel framework of fuzzy oblique decision tree construction for pattern classification
    Cai, Yuliang
    Zhang, Huaguang
    He, Qiang
    Duan, Jie
    [J]. APPLIED INTELLIGENCE, 2020, 50 (09) : 2959 - 2975
  • [5] Cordero AsunciA3n, 2020, APPL INTELL, DOI [10.1007/s10489-020-01765-6, DOI 10.1007/S10489-020-01765-6]
  • [6] Predicting breast cancer survivability: a comparison of three data mining methods
    Delen, D
    Walker, G
    Kadam, A
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 34 (02) : 113 - 127
  • [7] Dwork C, 2006, LECT NOTES COMPUT SC, V4052, P1
  • [8] Effective privacy preserving data publishing by vectorization
    Eom, Chris Soo-Hyun
    Lee, Charles Cheolgi
    Lee, Wookey
    Leung, Carson K.
    [J]. INFORMATION SCIENCES, 2020, 527 : 311 - 328
  • [9] Fletcher S., 2015, P 13 AUSTR DAT MIN C
  • [10] Differentially private random decision forests using smooth sensitivity
    Fletcher, Sam
    Islam, Md Zahidul
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 78 : 16 - 31