The need for multimodal health data modeling: A practical approach for a federated-learning healthcare platform

被引:25
作者
Cremonesi, Francesco [1 ,8 ]
Planat, Vincent [2 ]
Kalokyri, Varvara [3 ]
Kondylakis, Haridimos [3 ]
Sanavia, Tiziana [4 ]
Resinas, Victor Miguel Mateos [5 ]
Singh, Babita [6 ]
Uribe, Silvia [7 ]
机构
[1] Univ Cote dAzur, Epione Res Project, Inria Sophia Antipolis Mediteranee, Nice, France
[2] Dedalus, Global Consulting, Le Plessis Robinson, France
[3] Fdn Res & Technol Hellas, Inst Comp Sci, Iraklion, Greece
[4] Univ Torino, Dept Med Sci, Turin, Italy
[5] Dedalus Healthcare, Malaga, Spain
[6] Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Barcelona, Spain
[7] Univ Politecn Madrid, Escuela Tecn Super Ingn Sistemas Informat, Madrid, Spain
[8] Datawizard srl, Rome, Italy
关键词
Federated learning; Data model; Healthcare; Medical research; Omics; Lessons learned; MEDICAL DATA; ARCHITECTURE; INFORMATICS; PRIVACY;
D O I
10.1016/j.jbi.2023.104338
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Federated learning initiatives in healthcare are being developed to collaboratively train predictive models without the need to centralize sensitive personal data. GenoMed4All is one such project, with the goal of connecting European clinical and -omics data repositories on rare diseases through a federated learning platform. Currently, the consortium faces the challenge of a lack of well-established international datasets and interoperability standards for federated learning applications on rare diseases. This paper presents our practical approach to select and implement a Common Data Model (CDM) suitable for the federated training of predictive models applied to the medical domain, during the initial design phase of our federated learning platform. We describe our selection process, composed of identifying the consortium's needs, reviewing our functional and technical architecture specifications, and extracting a list of business requirements. We review the state of the art and evaluate three widely-used approaches (FHIR, OMOP and Phenopackets) based on a checklist of requirements and specifications. We discuss the pros and cons of each approach considering the use cases specific to our consortium as well as the generic issues of implementing a European federated learning healthcare platform. A list of lessons learned from the experience in our consortium is discussed, from the importance of establishing the proper communication channels for all stakeholders to technical aspects related to -omics data. For federated learning projects focused on secondary use of health data for predictive modeling, encompassing multiple data modalities, a phase of data model convergence is sorely needed to gather different data representations developed in the context of medical research, interoperability of clinical care software, imaging, and -omics analysis into a coherent, unified data model. Our work identifies this need and presents our experience and a list of actionable lessons learned for future work in this direction.
引用
收藏
页数:12
相关论文
共 47 条
[11]  
Deist TM, 2017, CLIN TRANSL RAD ONCO, V4, P24, DOI 10.1016/j.ctro.2016.12.004
[12]   HL7 Clinical Document Architecture, Release 2 [J].
Dolin, RH ;
Alschuler, L ;
Boyer, S ;
Beebe, C ;
Behlen, FM ;
Biron, PV ;
Shabo, A .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2006, 13 (01) :30-39
[13]   Portal of medical data models: information infrastructure for medical research and healthcare [J].
Dugas, Martin ;
Neuhaus, Philipp ;
Meidt, Alexandra ;
Doods, Justin ;
Storck, Michael ;
Bruland, Philipp ;
Varghese, Julian .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
[14]  
fhir, About us
[15]   Launching PCORnet, a national patient-centered clinical research network [J].
Fleurence, Rachael L. ;
Curtis, Lesley H. ;
Califf, Robert M. ;
Platt, Richard ;
Selby, Joe V. ;
Brown, Jeffrey S. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (04) :578-582
[16]   Achieving a Nationwide Learning Health System [J].
Friedman, Charles P. ;
Wong, Adam K. ;
Blumenthal, David .
SCIENCE TRANSLATIONAL MEDICINE, 2010, 2 (57)
[17]  
Genereaux Brad, 2021, IHE radiology white paper-AI interoperability in imaging
[18]   KETOS: Clinical decision support and machine learning as a service - A training and deployment platform based on Docker, OMOP-CDM, and FHIR Web Services [J].
Gruendner, Julian ;
Schwachhofer, Thorsten ;
Sippl, Phillip ;
Wolf, Nicolas ;
Erpenbeck, Marcel ;
Gulden, Christian ;
Kapsner, Lorenz A. ;
Zierk, Jakob ;
Mate, Sebastian ;
Stuerzl, Michael ;
Croner, Roland ;
Prokosch, Hans-Ulrich ;
Toddenroth, Dennis .
PLOS ONE, 2019, 14 (10)
[19]  
Huser V, 2018, BIOCOMPUT-PAC SYM, P628
[20]  
International Organization for Standardization, 2017, 11615 ISO