PrivateCTGAN: Adapting GAN for Privacy-Aware Tabular Data Sharing

被引:0
作者
Lopes, Frederico [1 ]
Soares, Carlos [1 ,2 ,3 ]
Cortez, Paulo [4 ]
机构
[1] Univ Porto, Faculdade Engenharia, Porto, Portugal
[2] Fraunhofer AICOS Portugal, Porto, Portugal
[3] Lab Artificial Intelligence & Comp Sci LIACC, Porto, Portugal
[4] Univ Minho, Dept Informat Syst, ALGORITMI Ctr LASI, Guimaraes, Portugal
来源
MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2023, PT II | 2025年 / 2134卷
关键词
Privacy; Data generation; Generative Adversarial Network; Synthetic Data;
D O I
10.1007/978-3-031-74627-7_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research addresses the challenge of generating synthetic data that resembles real-world data while preserving privacy. With privacy laws protecting sensitive information such as healthcare data, accessing sufficient training data becomes difficult, resulting in an increased difficulty in training Machine Learning models and in overall worst models. Recently, there has been an increased interest in the usage of Generative Adversarial Networks (GAN) to generate synthetic data since they enable researchers to generate more data to train their models. GANs, however, may not be suitable for privacy-sensitive data since they have no concern for the privacy of the generated data. We propose modifying the known Conditional Tabular GAN (CTGAN) model by incorporating a privacy-aware loss function, thus resulting in the Private CTGAN (PCTGAN) method. Several experiments were carried out using 10 public domain classification datasets and comparing PCTGAN with CTGAN and the state-of-the-art privacy-preserving model, the Differential Privacy CTGAN (DP-CTGAN). The results demonstrated that PCTGAN enables users to fine-tune the privacy fidelity trade-off by leveraging parameters, as well as that if desired, a higher level of privacy.
引用
收藏
页码:169 / 180
页数:12
相关论文
共 50 条
  • [41] Privacy-Aware Job Submission in the Cloud
    Al-Dulaimy, Auday
    Itani, Wassim
    Shamseddine, Maha
    Taheri, Javid
    2019 2ND IEEE MIDDLE EAST AND NORTH AFRICA COMMUNICATIONS CONFERENCE (IEEEMENACOMM'19), 2019, : 204 - 209
  • [42] Privacy-Aware Smart Metering: A Survey
    Finster, Soeren
    Baumgart, Ingmar
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2015, 17 (02): : 1088 - 1101
  • [43] A privacy-aware framework for targeted advertising
    Wang, Wei
    Yang, Linlin
    Chen, Yanjiao
    Zhang, Qian
    COMPUTER NETWORKS, 2015, 79 : 17 - 29
  • [44] Privacy-Aware Distributed Hypothesis Testing
    Sreekumar, Sreejith
    Cohen, Asaf
    Gunduz, Deniz
    ENTROPY, 2020, 22 (06)
  • [45] Discrimination- and privacy-aware patterns
    Hajian, Sara
    Domingo-Ferrer, Josep
    Monreale, Anna
    Pedreschi, Dino
    Giannotti, Fosca
    DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 29 (06) : 1733 - 1782
  • [46] GANs for Privacy-Aware Mobility Modeling
    Fontana, Ivan
    Langheinrich, Marc
    Gjoreski, Martin
    IEEE ACCESS, 2023, 11 : 29250 - 29262
  • [47] Privacy-Aware Authentication in the Internet of Things
    Gross, Hannes
    Hoelbl, Marko
    Slamanig, Daniel
    Spreitzer, Raphael
    CRYPTOLOGY AND NETWORK SECURITY, CANS 2015, 2015, 9476 : 32 - 39
  • [48] Privacy-aware Cloud Services Composition
    Ghafour, Samer Abdul
    Ghodous, Parisa
    Bonnet, Christine
    2015 IEEE 24TH INTERNATIONAL CONFERENCE ON ENABLING TECHNOLOGIES - INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES, 2015, : 140 - 142
  • [49] A Data-Centric Approach for Privacy-Aware Business Process Enablement
    Short, Stuart
    Kaluvuri, Samuel Paul
    ENTERPRISE INTEROPERABILITY, 2011, 76 : 191 - 203
  • [50] TIMED PRIVACY-AWARE BUSINESS PROTOCOLS
    Mokhtari-Aslaoui, Karima
    Benbernou, Salima
    Sahri, Soror
    Andrikopoulos, Vasilios
    Leymann, Frank
    Hacid, Mohand-Said
    INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2012, 21 (02) : 85 - 109