Developing a prototype for federated analysis to enhance privacy and enable trustworthy access to COVID-19 research data

被引:1
作者
Oskoui, Solmaz Eradat [1 ]
Retford, Matthew [2 ]
Forde, Eoghan [1 ]
Barnes, Rodrigo [1 ]
Hunter, Karen J. [2 ]
Wozencraft, Anne [2 ]
Thompson, Simon [3 ]
Orton, Chris [3 ]
Ford, David [3 ]
Heys, Sharon [3 ]
Kennedy, Julie [3 ]
Mcnerney, Cynthia [3 ]
Peng, Jeffrey [3 ]
Ghanbariadolat, Hamed [3 ]
Rees, Sarah [3 ]
Mulholland, Rachel H. [4 ]
Sheikh, Aziz [4 ]
Burgner, David [5 ,6 ]
Brockway, Meredith [7 ,8 ]
Azad, Meghan B. [9 ,10 ]
Rodriguez, Natalie [9 ,10 ]
Zoega, Helga [11 ]
Stock, Sarah J. [4 ]
Calvert, Clara [4 ,12 ]
Miller, Jessica E. [5 ]
Fiorentino, Nicole [9 ]
Racine, Amy [13 ]
Haggstrom, Jonas [13 ]
Postlethwaite, Neil [2 ]
机构
[1] Aridhia Informat, Glasgow, Scotland
[2] Hlth Data Res UK HDR UK, London, England
[3] Swansea Univ, Med Sch, Swansea, Wales
[4] Univ Edinburgh, Usher Inst, Edinburgh, Scotland
[5] Royal Childrens Hosp, Murdoch Childrens Res Inst, Infect Immun & Global Hlth Theme, Parkville, Vic, Australia
[6] Univ Melbourne, Dept Paediat, Parkville, Vic, Australia
[7] Univ Calgary, Fac Nursing, Calgary, AB, Canada
[8] Alberta Childrens Hosp Res Inst, Calgary, AB, Canada
[9] UNIV MANITOBA, Pediat & Child Hlth, WINNIPEG, MB, Canada
[10] Childrens Hosp Res Inst Manitoba, Winnipeg, MB, Canada
[11] Univ Iceland, Fac Med, Ctr Publ Hlth Sci, Reykjavik, Iceland
[12] London Sch Hyg & Trop Med, Dept Populat Hlth, London, England
[13] Cytel, Cambridge, MA USA
基金
英国惠康基金;
关键词
Federated Networks; Federated Analytics; COVID-19; Health Data Research; Privacy-Preserving; Secondary Data; Data Re-use;
D O I
10.1016/j.ijmedinf.2024.105708
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: The use of federated networks can reduce the risk of disclosure for sensitive datasets by removing the requirement to physically transfer data. Federated networks support federated analytics, a type of privacy- enhancing technology, enabling trustworthy data analysis without the movement of source data. Objectives: To set out the methodology used by the International COVID-19 Data Alliance (ICODA) and its partners, the Secure Anonymised Information Linkage (SAIL) Databank and Aridhia Informatics in piloting a federated network infrastructure and consequently testing federated analytics using test data provided from an ICODA project, the International Perinatal Outcome in the Pandemic (iPOP) Study. To share the challenges and benefits of using a federated network infrastructure to enable trustworthy analysis of health-related data from multiple countries and sources. Results: This project successfully developed a federated network between the SAIL Databank and the ICODA Workbench and piloted the use of federated analysis using aggregate-level model outputs as test data from the iPOP Study, a one-year, multi-country COVID-19 research project. This integration is a first step in implementing the necessary technical, governance and user experiences for future research studies to build upon, including those using individual-level datasets from multiple data nodes. Conclusions: Creating federated networks requires extensive investment from a data governance, technology, training, resources, timing and funding perspective. For future initiatives, the establishment of a federated network should be built into medium to long term plans to provide researchers with a secure and robust data analysis platform to perform joint multi-site collaboration. Federated networks can unlock the enormous potential of national and international health datasets through enabling collaborative research that addresses critical public health challenges, whilst maintaining privacy and trustworthiness by preventing direct access to the source data.
引用
收藏
页数:13
相关论文
共 62 条
[1]   Predicting 30-Day Readmission Risk for Patients With Chronic Obstructive Pulmonary Disease Through a Federated Machine Learning Architecture on Findable, Accessible, Interoperable, and Reusable (FAIR) Data: Development and Validation Study [J].
Alvarez-Romero, Celia ;
Martinez-Garcia, Alicia ;
Vega, Jara Ternero ;
Diaz-Jimenez, Pablo ;
Jimenez-Juan, Carlos ;
Nieto-Martin, Maria Dolores ;
Villaran, Esther Roman ;
Kovacevic, Tomi ;
Bokan, Darijo ;
Hromis, Sanja ;
Malbasa, Jelena Djekic ;
Besla, Suzana ;
Zaric, Bojan ;
Gencturk, Mert ;
Sinaci, A. Anil ;
Baturone, Manuel Ollero ;
Calderon, Carlos Luis Parra .
JMIR MEDICAL INFORMATICS, 2022, 10 (06)
[2]  
[Anonymous], DataSHIELD'
[3]  
[Anonymous], CanDIG'
[4]  
[Anonymous], 2024, ga4gh/task-execution-schemas
[5]  
[Anonymous], Sail Databank-Home', SAIL Databank
[6]  
[Anonymous], DATA CATALOG VOCABULARY
[7]  
[Anonymous], Home-Australian Genomics'
[8]  
[Anonymous], Data Partners', ehden.eu
[9]  
[Anonymous], Critical Path Institute-The Path Forward in Drug Development', C-Path
[10]  
[Anonymous], Autism Sharing Initiative', Autism Sharing Initiative