PrivateCTGAN: Adapting GAN for Privacy-Aware Tabular Data Sharing

被引:0
作者
Lopes, Frederico [1 ]
Soares, Carlos [1 ,2 ,3 ]
Cortez, Paulo [4 ]
机构
[1] Univ Porto, Faculdade Engenharia, Porto, Portugal
[2] Fraunhofer AICOS Portugal, Porto, Portugal
[3] Lab Artificial Intelligence & Comp Sci LIACC, Porto, Portugal
[4] Univ Minho, Dept Informat Syst, ALGORITMI Ctr LASI, Guimaraes, Portugal
来源
MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2023, PT II | 2025年 / 2134卷
关键词
Privacy; Data generation; Generative Adversarial Network; Synthetic Data;
D O I
10.1007/978-3-031-74627-7_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research addresses the challenge of generating synthetic data that resembles real-world data while preserving privacy. With privacy laws protecting sensitive information such as healthcare data, accessing sufficient training data becomes difficult, resulting in an increased difficulty in training Machine Learning models and in overall worst models. Recently, there has been an increased interest in the usage of Generative Adversarial Networks (GAN) to generate synthetic data since they enable researchers to generate more data to train their models. GANs, however, may not be suitable for privacy-sensitive data since they have no concern for the privacy of the generated data. We propose modifying the known Conditional Tabular GAN (CTGAN) model by incorporating a privacy-aware loss function, thus resulting in the Private CTGAN (PCTGAN) method. Several experiments were carried out using 10 public domain classification datasets and comparing PCTGAN with CTGAN and the state-of-the-art privacy-preserving model, the Differential Privacy CTGAN (DP-CTGAN). The results demonstrated that PCTGAN enables users to fine-tune the privacy fidelity trade-off by leveraging parameters, as well as that if desired, a higher level of privacy.
引用
收藏
页码:169 / 180
页数:12
相关论文
共 50 条
  • [31] Optimal Privacy-Aware Estimation
    Nekouei, Ehsan
    Sandberg, Henrik
    Skoglund, Mikael
    Johansson, Karl Henrik
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (05) : 2253 - 2266
  • [32] On privacy-aware eScience workflows
    Khalid Belhajjame
    Noura Faci
    Zakaria Maamar
    Vanilson Burégio
    Edvan Soares
    Mahmoud Barhamgi
    Computing, 2020, 102 : 1171 - 1185
  • [33] Privacy-Aware Traffic Monitoring
    Xie, Hairuo
    Kulik, Lars
    Tanin, Egemen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2010, 11 (01) : 61 - 70
  • [34] Privacy-Aware Contact Sharing for Groups in P2P Networks
    Meskanen, Tommi
    Kuusijarvi, Jarkko
    Niemi, Valtteri
    INTERNATIONAL JOURNAL OF EMBEDDED AND REAL-TIME COMMUNICATION SYSTEMS (IJERTCS), 2021, 12 (04): : 36 - 54
  • [35] Privacy-aware Linked Widgets
    Fernandez, Javier D.
    Ekaputra, Fajar J.
    Aryan, Peb Ruswono
    Kiesling, Elmar
    Azzam, Amr
    COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2019 ), 2019, : 508 - 514
  • [36] Privacy as a Service: Privacy-Aware Data Storage and Processing in Cloud Computing Architectures
    Itani, Wassim
    Kayssi, Ayman
    Chehab, Ali
    EIGHTH IEEE INTERNATIONAL CONFERENCE ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, PROCEEDINGS, 2009, : 711 - 716
  • [37] A privacy-aware data dissemination scheme for smart grid with abnormal data traceability
    Zhang, Xu
    Wen, Mi
    Lu, Kejie
    Lei, Jingsheng
    COMPUTER NETWORKS, 2017, 117 : 32 - 41
  • [38] The Empirical Implications of Privacy-Aware Choice
    Cummings, Rachel
    Echenique, Federico
    Wierman, Adam
    OPERATIONS RESEARCH, 2016, 64 (01) : 67 - 78
  • [39] Privacy-Aware Collaborative Spam Filtering
    Li, Kang
    Zhong, Zhenyu
    Ramaswamy, Lakshmish
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2009, 20 (05) : 725 - 739
  • [40] Towards Privacy-Aware Smart Surveillance
    Shirima, Emil
    Ghazinour, Kambiz
    FOUNDATIONS AND PRACTICE OF SECURITY, FPS 2019, 2020, 12056 : 398 - 406