Synthetic Data Digital Twins and Data Trusts Control for Privacy in Health Data Sharing

被引:0
|
作者
Lomotey, Richard K. [1 ]
Kumi, Sandra [2 ]
Ray, Madhurima [3 ]
Deters, Ralph [2 ]
机构
[1] Penn State Univ, Informat Sci & Tech, Monaca, PA 15061 USA
[2] Univ Saskatchewan, Dept Comp Sci, Saskatoon, SK, Canada
[3] Penn State Univ, Dept Comp Sci, Monaca, PA USA
来源
PROCEEDINGS OF THE 2024 ACM WORKSHOP ON SECURE AND TRUSTWORTHY CYBER-PHYSICAL SYSTEMS, SAT-CPS 2024 | 2024年
关键词
Synthetic Health Data; Digital Twins; Data Trusts; Machine Learning; Artificial Intelligence; Privacy; Middleware; FRAMEWORK;
D O I
10.1145/3643650.3658605
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Health data sharing is very valuable for medical research since it has the propensity to improve diagnostics, policy, medication, and so on. At the same time, sharing health data needs to be done without compromising the privacy of patients and stakeholders. However, recent advances in AI/ML and sophisticated analytics have proven to introduce biases that can easily identify patients based on their healthcare data, which violates privacy. In this work, we sort to address this major issue by exploring two emerging topics that are gaining attention from industry, academia, and governments, i.e., digital twins and data trusts. First, we proposed the use of digital twins (DTs) to generate synthetic records of patient's heart rate data. DTs are virtual replicas of the actual data and were created using two synthetic data generative models - Gaussian Copula (GC) and Tabular Variational Autoencoder (TVAE). The GC and TVAE achieved a maximum data quality score of 88% and 96% respectively. Next, we posit that the DTs should be shared with a data trusts layer. Data trusts are fiduciary frameworks that govern multi-party data sharing. The data trusts enforce access controls (based on metrics such as location, role-based, and policy-based) to the synthetic health data and reports to the data subject. The preliminary evaluations of the work show that merging the two techniques (i.e., synthetic data digital twins and data trusts) enforces better privacy for health data access. The synthetic data ensures more anonymization while the data trusts provide easy auditing, tracking, and efficient reporting to the patient or data subject. The paper also detailed the architectural design of the data trusts and evaluated the efficiency of the access control techniques.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [41] Frontex as a hub for surveillance and data sharing: Challenges for data protection and privacy rights
    Gandhi, Shrutika
    COMPUTER LAW & SECURITY REVIEW, 2024, 53
  • [42] Privacy in the Digital World: Medical and Health Data Outside of HIPAA Protections
    Glenn, Tasha
    Monteith, Scott
    CURRENT PSYCHIATRY REPORTS, 2014, 16 (11)
  • [43] Privacy in the Digital World: Medical and Health Data Outside of HIPAA Protections
    Tasha Glenn
    Scott Monteith
    Current Psychiatry Reports, 2014, 16
  • [44] Perceptions and Preferences About Granular Data Sharing and Privacy of Behavioral Health Patients
    Soni, Hiral
    Grando, Adela
    Aliste, Marcela P.
    Murcko, Anita
    Todd, Michael
    Mukundan, Madhumita
    Saks, Michael
    Horrow, Caroline
    Sharp, Richard
    Dye, Christy
    Chern, Darwyn
    Whitfield, Mary Jo
    Callesen, Mark
    MEDINFO 2019: HEALTH AND WELLBEING E-NETWORKS FOR ALL, 2019, 264 : 1361 - 1365
  • [45] Secure and Flexible Data Sharing With Dual Privacy Protection in Vehicular Digital Twin Networks
    Wang, Chenhao
    Ming, Yang
    Liu, Hang
    Feng, Jie
    Zhang, Ning
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (09) : 12407 - 12420
  • [46] Towards Linked Data for Ecosystems of Digital Twins
    Burattini, Samuele
    Zimmermann, Antoine
    Picone, Marco
    Ricci, Alessandro
    ACM/IEEE 27TH INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS: COMPANION PROCEEDINGS, MODELS 2024, 2024, : 332 - 337
  • [47] Artificial Intelligence and Ontologies for the Management of Heritage Digital Twins Data
    Felicetti, Achille
    Niccolucci, Franco
    DATA, 2025, 10 (01)
  • [48] Protecting Patients' Data: An Efficient Method for Health Data Privacy
    Daniels, Mark
    Rose, John
    Farkas, Csilla
    13TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY (ARES 2018), 2019,
  • [49] Revocable and Privacy-Preserving Bilateral Access Control for Cloud Data Sharing
    Zhao, Mingyang
    Zhang, Chuan
    Wu, Tong
    Ni, Jianbing
    Liu, Ximeng
    Zhu, Liehuang
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 5389 - 5404
  • [50] Generating Privacy Preserving Synthetic Medical Data
    Faisal, Fahim
    Mohammed, Noman
    Leung, Carson K.
    Wang, Yang
    2022 IEEE 9TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2022, : 1003 - 1012