Differential privacy preserved federated learning for prognostic modeling in COVID-19 patients using large multi-institutional chest CT dataset

被引:5
作者
Shiri, Isaac [1 ]
Salimi, Yazdan [1 ]
Sirjani, Nasim [2 ]
Razeghi, Behrooz [3 ]
Bagherieh, Sara [4 ]
Pakbin, Masoumeh [5 ]
Mansouri, Zahra [1 ]
Hajianfar, Ghasem [1 ]
Avval, Atlas Haddadi [6 ]
Askari, Dariush [7 ]
Ghasemian, Mohammadreza [8 ]
Sandoughdaran, Saleh [9 ]
Sohrabi, Ahmad [10 ]
Sadati, Elham [11 ]
Livani, Somayeh [12 ]
Iranpour, Pooya [13 ]
Kolahi, Shahriar [14 ]
Khosravi, Bardia [15 ]
Bijari, Salar [11 ]
Sayfollahi, Sahar [16 ]
Atashzar, Mohammad Reza [17 ]
Hasanian, Mohammad [18 ]
Shahhamzeh, Alireza [19 ]
Teimouri, Arash [13 ]
Goharpey, Neda [20 ]
Shirzad-Aski, Hesamaddin [21 ]
Karimi, Jalal [22 ]
Radmard, Amir Reza [23 ]
Rezaei-Kalantari, Kiara [24 ]
Oghli, Mostafa Ghelich [2 ]
Oveisi, Mehrdad [25 ]
Vafaei Sadr, Alireza [26 ]
Voloshynovskiy, Slava [3 ]
Zaidi, Habib [1 ,27 ,28 ,29 ,30 ]
机构
[1] Geneva Univ Hosp, Div Nucl Med & Mol Imaging, Geneva, Switzerland
[2] Med Fanavarn Plus Co, Res & Dev Dept, Karaj, Iran
[3] Univ Geneva, Dept Comp Sci, Geneva, Switzerland
[4] Isfahan Univ Med Sci, Sch Med, Esfahan, Iran
[5] Qom Univ Med Sci, Imaging Dept, Qom, Iran
[6] Mashhad Univ Med Sci, Sch Med, Mashhad, Iran
[7] Shahid Beheshti Univ Med Sci, Dept Radiol Technol, Tehran, Iran
[8] Qom Univ Med Sci, Shahid Beheshti Hosp, Dept Radiol, Qom, Iran
[9] Royal Surrey Cty Hosp, Dept Clin Oncol, Guildford, England
[10] Iran Univ Med Sci, Radin Makian Azma Mehr Ltd, Radinmehr Vet Lab, Gorgan, Iran
[11] Tarbiat Modares Univ, Fac Med Sci, Dept Med Phys, Tehran, Iran
[12] Golestan Univ Med Sci, Sayad Shirazi Hosp, Clin Res Dev Unit CRDU, Gorgan, Iran
[13] Shiraz Univ Med Sci, Med Imaging Res Ctr, Dept Radiol, Shiraz, Iran
[14] Univ Tehran Med Sci, Imam Khomeini Hosp, Adv Diagnost & Intervent Radiol Res Ctr ADIR, Dept Radiol,Sch Med, Tehran, Iran
[15] Univ Tehran Med Sci, Digest Dis Res Inst, Digest Dis Res Ctr, Tehran, Iran
[16] Iran Univ Med Sci, Fac Med, Dept Neurosurg, Tehran, Iran
[17] Fasa Univ Med Sci, Sch Med, Dept Immunol, Fasa, Iran
[18] Arak Univ Med Sci, Dept Radiol, Arak, Iran
[19] Qom Univ Med Sci, Clin Res Dev Ctr, Qom, Iran
[20] Shahid Beheshti Univ Med Sci, Shohada e Tajrish Hosp, Dept Radiat Oncol, Tehran, Iran
[21] Golestan Univ Med Sci, Infect Dis Res Ctr, Gorgan, Iran
[22] Fasa Univ Med Sci, Sch Med, Dept Infect Dis, Fasa, Iran
[23] Univ Tehran Med Sci, Shariati Hosp, Dept Radiol, Tehran, Iran
[24] Iran Univ Med Sci, Rajaie Cardiovasc Med & Res Ctr, Tehran, Iran
[25] Univ British Columbia, Dept Comp Sci, Vancouver, BC, Canada
[26] Penn State Univ, Coll Med, Dept Publ Hlth Sci, Hershey, PA USA
[27] Univ Groningen, Univ Med Ctr Groningen, Dept Nucl Med & Mol Imaging, Groningen, Netherlands
[28] Univ Southern Denmark, Dept Nucl Med, Odense, Denmark
[29] Obuda Univ, Univ Res & Innovat Ctr, Budapest, Hungary
[30] Geneva Univ Hosp, Div Nucl Med & Mol Imaging, CH-1211 Geneva, Switzerland
基金
瑞士国家科学基金会;
关键词
COVID-19; CT; deep learning; federated learning; privacy; prognosis; ARTIFICIAL-INTELLIGENCE; CHALLENGES; SECURE;
D O I
10.1002/mp.16964
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
BackgroundNotwithstanding the encouraging results of previous studies reporting on the efficiency of deep learning (DL) in COVID-19 prognostication, clinical adoption of the developed methodology still needs to be improved. To overcome this limitation, we set out to predict the prognosis of a large multi-institutional cohort of patients with COVID-19 using a DL-based model.PurposeThis study aimed to evaluate the performance of deep privacy-preserving federated learning (DPFL) in predicting COVID-19 outcomes using chest CT images.MethodsAfter applying inclusion and exclusion criteria, 3055 patients from 19 centers, including 1599 alive and 1456 deceased, were enrolled in this study. Data from all centers were split (randomly with stratification respective to each center and class) into a training/validation set (70%/10%) and a hold-out test set (20%). For the DL model, feature extraction was performed on 2D slices, and averaging was performed at the final layer to construct a 3D model for each scan. The DensNet model was used for feature extraction. The model was developed using centralized and FL approaches. For FL, we employed DPFL approaches. Membership inference attack was also evaluated in the FL strategy. For model evaluation, different metrics were reported in the hold-out test sets. In addition, models trained in two scenarios, centralized and FL, were compared using the DeLong test for statistical differences.ResultsThe centralized model achieved an accuracy of 0.76, while the DPFL model had an accuracy of 0.75. Both the centralized and DPFL models achieved a specificity of 0.77. The centralized model achieved a sensitivity of 0.74, while the DPFL model had a sensitivity of 0.73. A mean AUC of 0.82 and 0.81 with 95% confidence intervals of (95% CI: 0.79-0.85) and (95% CI: 0.77-0.84) were achieved by the centralized model and the DPFL model, respectively. The DeLong test did not prove statistically significant differences between the two models (p-value = 0.98). The AUC values for the inference attacks fluctuate between 0.49 and 0.51, with an average of 0.50 +/- 0.003 and 95% CI for the mean AUC of 0.500 to 0.501.ConclusionThe performance of the proposed model was comparable to centralized models while operating on large and heterogeneous multi-institutional datasets. In addition, the model was resistant to inference attacks, ensuring the privacy of shared data during the training process.
引用
收藏
页码:4736 / 4747
页数:12
相关论文
共 74 条
[1]   A Survey on Federated Learning: The Journey From Centralized to Distributed On-Site Learning and Beyond [J].
AbdulRahman, Sawsan ;
Tout, Hanine ;
Ould-Slimane, Hakima ;
Mourad, Azzam ;
Talhi, Chamseddine ;
Guizani, Mohsen .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (07) :5476-5497
[2]  
Andrew G, 2021, ADV NEUR IN, V34
[3]  
Bhagoji AN, 2019, PR MACH LEARN RES, V97
[4]  
Bonawitz K.A., 2016, arXiv
[5]  
Carbonell Guillermo, 2021, medRxiv, DOI 10.1101/2021.10.11.21264709
[6]  
Chen WC, 2022, PR MACH LEARN RES
[7]   Adaptive medical image encryption algorithm based on multiple chaotic mapping [J].
Chen, Xiao ;
Hu, Chun-Jie .
SAUDI JOURNAL OF BIOLOGICAL SCIENCES, 2017, 24 (08) :1821-1827
[8]   Federated learning based Covid-19 detection [J].
Chowdhury, Deepraj ;
Banerjee, Soham ;
Sannigrahi, Madhushree ;
Chakraborty, Arka ;
Das, Anik ;
Dey, Ajoy ;
Dwivedi, Ashutosh Dhar .
EXPERT SYSTEMS, 2023, 40 (05)
[9]   Federated learning for predicting clinical outcomes in patients with COVID-19 [J].
Dayan, Ittai ;
Roth, Holger R. ;
Zhong, Aoxiao ;
Harouni, Ahmed ;
Gentili, Amilcare ;
Abidin, Anas Z. ;
Liu, Andrew ;
Costa, Anthony Beardsworth ;
Wood, Bradford J. ;
Tsai, Chien-Sung ;
Wang, Chih-Hung ;
Hsu, Chun-Nan ;
Lee, C. K. ;
Ruan, Peiying ;
Xu, Daguang ;
Wu, Dufan ;
Huang, Eddie ;
Kitamura, Felipe Campos ;
Lacey, Griffin ;
de Antonio Corradi, Gustavo Cesar ;
Nino, Gustavo ;
Shin, Hao-Hsin ;
Obinata, Hirofumi ;
Ren, Hui ;
Crane, Jason C. ;
Tetreault, Jesse ;
Guan, Jiahui ;
Garrett, John W. ;
Kaggie, Joshua D. ;
Park, Jung Gil ;
Dreyer, Keith ;
Juluru, Krishna ;
Kersten, Kristopher ;
Rockenbach, Marcio Aloisio Bezerra Cavalcanti ;
Linguraru, Marius George ;
Haider, Masoom A. ;
AbdelMaseeh, Meena ;
Rieke, Nicola ;
Damasceno, Pablo F. ;
Silva, Pedro Mario Cruz E. ;
Wang, Pochuan ;
Xu, Sheng ;
Kawano, Shuichi ;
Sriswasdi, Sira ;
Park, Soo Young ;
Grist, Thomas M. ;
Buch, Varun ;
Jantarabenjakul, Watsamon ;
Wang, Weichung ;
Tak, Won Young .
NATURE MEDICINE, 2021, 27 (10) :1735-+
[10]   The Algorithmic Foundations of Differential Privacy [J].
Dwork, Cynthia ;
Roth, Aaron .
FOUNDATIONS AND TRENDS IN THEORETICAL COMPUTER SCIENCE, 2013, 9 (3-4) :211-406