EHR-based prediction modelling meets multimodal deep learning: A systematic review of structured and textual data fusion methods

被引:1
作者
Teles, Ariel Soares [1 ,2 ]
de Moura, Ivan Rodrigues [3 ]
Silva, Francisco [4 ]
Roberts, Angus [1 ]
Stahl, Daniel
机构
[1] Kings Coll London, Inst Psychiat Psychol & Neurosci, Dept Biostat & Hlth Informat, London, England
[2] Fed Inst Maranhao, Sao Luis, Maranhao, Brazil
[3] Fed Inst Piaui, Teresina, Piaui, Brazil
[4] Univ Fed Maranhao, Maranhao, Maranhao, Brazil
关键词
Multimodal; Deep learning; Electronic Health Records; Predictive modelling; Data fusion; Natural Language Processing; Medical AI; ELECTRONIC HEALTH RECORDS; ARTIFICIAL-INTELLIGENCE; DISEASE PREDICTION; INFORMATION; AGREEMENT; KAPPA; RISK;
D O I
10.1016/j.inffus.2025.102981
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Electronic Health Records (EHRs) have transformed healthcare by digitally consolidating patient medical history, encompassing structured data (e.g., demographic data, lab results), and unstructured textual data (e.g., clinical notes). These data hold significant potential for predictive modelling, and recent studies have dedicated efforts to leverage the different modalities in a cohesive and effective manner to improve predictive accuracy. This Systematic Literature Review (SLR) addresses the application of Multimodal Deep Learning (MDL) methods in EHR-based prediction modelling, specifically through the fusion of structured and textual data. Following PRISMA guidelines, we conducted a comprehensive literature search across six article databases, using a carefully designed search string. After applying inclusion and exclusion criteria, we selected 77 primary studies. Data extraction was standardized using a structured form based on the CHARMS checklist. We categorized and analysed the fusion strategies employed across the studies. By combining structured and textual data at the input level, early fusion enabled models to learn joint feature representations from the beginning, whether in vectorized representations or data textualization. Intermediate fusion, which delays integration, was particularly useful for tasks where each modality provides unique insights that need to be processed independently before being combined. Late fusion enabled modularity by integrating outputs from unimodal models, which is suitable when EHR structured and textual data have varying quality or reliability. We also identified trends and open issues that need attention. This review contributes a comprehensive understanding of EHR data fusion practices using MDL, highlighting potential pathways for future research and development in health informatics.
引用
收藏
页数:24
相关论文
共 183 条
[51]   Combining clinical notes with structured electronic health records enhances the prediction of mental health crises [J].
Garriga, Roger ;
Buda, Teodora Sandra ;
Guerreiro, Joao ;
Iglesias, Jesus Omana ;
Aguerri, Inaki Estella ;
Matic, Aleksandar .
CELL REPORTS MEDICINE, 2023, 4 (11)
[52]  
Gemma Team, 2024, Arxiv, DOI [arXiv:2408.00118, DOI 10.48550/ARXIV.2408.00118]
[53]   Evaluating the accuracy of a state-of-the-art large language model for prediction of admissions from the emergency room [J].
Glicksberg, Benjamin S. ;
Timsina, Prem ;
Patel, Dhaval ;
Sawant, Ashwin ;
Vaid, Akhil ;
Raut, Ganesh ;
Charney, Alexander W. ;
Apakama, Donald ;
Carr, Brendan G. ;
Freeman, Robert ;
Nadkarni, Girish N. ;
Klang, Eyal .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (09) :1921-1928
[54]   Machine Learning Algorithms to Predict Breast Cancer Recurrence Using Structured and Unstructured Sources from Electronic Health Records [J].
Gonzalez-Castro, Lorena ;
Chavez, Marcela ;
Duflot, Patrick ;
Bleret, Valerie ;
Martin, Alistair G. ;
Zobel, Marc ;
Nateqi, Jama ;
Lin, Simon ;
Pazos-Arias, Jose J. ;
Del Fiol, Guilherme ;
Lopez-Nores, Martin .
CANCERS, 2023, 15 (10)
[55]   Looking for low vision: Predicting visual prognosis by fusing structured and free-text data from electronic health records [J].
Gui, Haiwen ;
Tseng, Benjamin ;
Hu, Wendeng ;
Wang, Sophia Y. .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2022, 159
[56]  
Gupta M, 2022, PR MACH LEARN RES, V193, P311
[57]   PRISMA2020: An R package and Shiny app for producing PRISMA 2020-compliant flow diagrams, with interactivity for optimised digital transparency and Open Synthesis [J].
Haddaway, Neal R. ;
Page, Matthew J. ;
Pritchard, Chris C. ;
McGuinness, Luke A. .
CAMPBELL SYSTEMATIC REVIEWS, 2022, 18 (02)
[58]   Affordable and real-time antimicrobial resistance prediction from multimodal electronic health records [J].
Hardan, Shahad ;
Shaaban, Mai A. ;
Abdalla, Jehad ;
Yaqub, Mohammad .
SCIENTIFIC REPORTS, 2024, 14 (01)
[59]   A deep attention model to forecast the Length Of Stay and the in-hospital mortality right on admission from ICD codes and demographic data [J].
Harerimana, Gaspard ;
Kim, Jong Wook ;
Jang, Beakcheol .
JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 118
[60]   Multitask learning and benchmarking with clinical time series data [J].
Harutyunyan, Hrayr ;
Khachatrian, Hrant ;
Kale, David C. ;
Ver Steeg, Greg ;
Galstyan, Aram .
SCIENTIFIC DATA, 2019, 6 (1)