Privacy-Preserving Deep Learning for the Detection of Protected Health Information in Real-World Data: Comparative Evaluation

被引:8
|
作者
Festag, Sven [1 ,2 ]
Spreckelsen, Cord [1 ,2 ]
机构
[1] Rhein Westfal TH Aachen, Med Fac, Dept Med Informat, Aachen, Germany
[2] Jena Univ Hosp, Inst Med Stat Comp & Data Sci, Bachstr 18, Jena, Germany
关键词
privacy-preserving protocols; neural networks; health informatics; distributed machine learning;
D O I
10.2196/14064
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Collaborative privacy-preserving training methods allow for the integration of locally stored private data sets into machine learning approaches while ensuring confidentiality and nondisclosure. Objective: In this work we assess the performance of a state-of-the-art neural network approach for the detection of protected health information in texts trained in a collaborative privacy-preserving way. Methods: The training adopts distributed selective stochastic gradient descent (ie, it works by exchanging local learning results achieved on private data sets). Five networks were trained on separated real-world clinical data sets by using the privacy-protecting protocol. In total, the data sets contain 1304 real longitudinal patient records for 296 patients. Results: These networks reached a mean F1 value of 0.955. The gold standard centralized training that is based on the union of all sets and does not take data security into consideration reaches a final value of 0.962. Conclusions: Using real-world clinical data, our study shows that detection of protected health information can be secured by collaborative privacy-preserving training. In general, the approach shows the feasibility of deep learning on distributed and confidential clinical data while ensuring data protection.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Privacy-preserving federated machine learning on FAIR health data: A real-world application
    Sinaci, A. Anil
    Gencturk, Mert
    Alvarez-Romero, Celia
    Erturkmen, Gokce Banu Laleci
    Martinez-Garcia, Alicia
    Escalona-Cuaresma, Maria Jose
    Parra-Calderon, Carlos Luis
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 24 : 136 - 145
  • [2] Privacy-preserving Real-world Video Anomaly Detection
    Noghre, Ghazal Alinezhad
    2023 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING, SMARTCOMP, 2023, : 253 - 254
  • [3] Privacy-Preserving Synthetic Location Data in the Real World
    Cunningham, Teddy
    Cormode, Graham
    Ferhatosmanoglu, Hakan
    PROCEEDINGS OF 17TH INTERNATIONAL SYMPOSIUM ON SPATIAL AND TEMPORAL DATABASES, SSTD 2021, 2021, : 23 - 33
  • [4] Real-World Identification for an Extensible and Privacy-Preserving Mobile eID
    Hoelzl, Michael
    Roland, Michael
    Mayrhofer, Rene
    PRIVACY AND IDENTITY MANAGEMENT: THE SMART REVOLUTION, 2018, 526 : 354 - 370
  • [5] Unsupervised, Federated and Privacy-Preserving Detection of Anomalous Electricity Consumption in Real-World Scenarios
    Bos, Joppe W.
    De Vis, Michiel
    Faes, Charles
    Gonzalez-Deleito, Nicolas
    Hristoskova, Anna
    Klein, Sarah
    Rajendran, Sreeraj
    2022 IEEE SUSTAINABLE POWER AND ENERGY CONFERENCE (ISPEC), 2022,
  • [6] Privacy-Preserving Deep Learning on Big Data in Cloud
    Fan, Yongkai
    Zhang, Wanyu
    Bai, Jianrong
    Lei, Xia
    Li, Kuanching
    CHINA COMMUNICATIONS, 2023, 20 (11) : 176 - 186
  • [7] Privacy-Preserving in the Context of Data Mining and Deep Learning
    Altalhi, Amjaad
    Al-Saedi, Maram
    Alsuwat, Hatim
    Alsuwat, Emad
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (06): : 137 - 142
  • [8] Privacy-Preserving Deep Learning on Big Data in Cloud
    Yongkai Fan
    Wanyu Zhang
    Jianrong Bai
    Xia Lei
    Kuanching Li
    China Communications, 2023, 20 (11) : 176 - 186
  • [9] Evaluation of Synthetic Data for Privacy-Preserving Machine Learning
    Hittmeir, Markus
    Ekelhart, Andreas
    Mayer, Rudolf
    ERCIM NEWS, 2020, (123): : 30 - 31
  • [10] Privacy-Preserving Statistical Analysis Method by Splitting Server Roles for Distributed Real-World Data
    Ishii, Jun
    Maeomichi, Hiroyuki
    Tsutsui, Akihiro
    Yoda, Ikuo
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2014, E97B (09) : 1779 - 1789