Learning Multimodal Explainable AI Models from Medical Images and Tabular Data: Proof of Concept

被引:0
作者
Malafaia, Mafalda [1 ]
Schlender, Thalea [2 ]
Bosman, Peter A. N. [1 ,3 ]
Alderliesten, Tanja [2 ]
机构
[1] Ctr Wiskunde & Informat, Evolutionary Intelligence Grp, Amsterdam, Netherlands
[2] Leiden Univ Med Ctr, Dept Radiat Oncol, Leiden, Netherlands
[3] Delft Univ Technol, Fac Elect Engn Math & Comp Sci, Delft, Netherlands
来源
MEDICAL IMAGING 2025: IMAGE PROCESSING | 2025年 / 13406卷
关键词
explainability; genetic programming; medical image analysis; multimodal learning;
D O I
10.1117/12.3040402
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Medical applications often involve several data modalities, particularly medical images and clinical information, which can be combined to enhance the decision-making process by improving accuracy. Multimodal learning approaches can leverage all available data for increased robustness in the resulting models, consequently outperforming unimodal approaches. Furthermore, AI frameworks must be human-verifiable and interpretable to be deployed in real-world situations, considering legal and privacy aspects. Due to the opaque nature of Deep Learning (DL) methods, interpretability is often limited despite their state-of-the-art performance in many tasks. Genetic Programming (GP) can provide compact and interpretable symbolic expressions for tabular data but is less effective for image analysis. We introduce MultiFIX: a new interpretability-focused pipeline for multimodal learning that leverages the strengths of DL and GP to explicitly engineer features from different data types and combine them to make the final prediction. The MultiFIX pipeline comprises two stages: the training stage, where a DL (black-box) model is trained using different training procedures to extract relevant features from each modality; and the inference stage, where the resulting model is transformed to be interpretable. Image features are explained with attention maps by Grad-CAM, and inherently interpretable symbolic expressions evolved with GP fully replace the tabular feature engineering block, and the fusion of the extracted features to predict the target label. To show the application potential of the presented pipeline, we demonstrate MultiFIX with a Melanoma Risk Assessment dataset. Results show that MultiFIX outperforms unimodal models while offering explanations that can be straightforwardly analysed and are consistent with the expectations.
引用
收藏
页数:7
相关论文
共 15 条
[1]   A review on multimodal medical image fusion: Compendious analysis of medical modalities, multimodal databases, fusion techniques and quality metrics [J].
Azam, Muhammad Adeel ;
Khan, Khan Bahadar ;
Salahuddin, Sana ;
Rehman, Eid ;
Khan, Sajid Ali ;
Khan, Muhammad Attique ;
Kadry, Seifedine ;
Gandomi, Amir H. .
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 144
[2]   Assessment of emerging pretraining strategies in interpretable multimodal deep learning for cancer prognostication [J].
Azher, Zarif L. ;
Suvarna, Anish ;
Chen, Ji-Qing ;
Zhang, Ze ;
Christensen, Brock C. ;
Salas, Lucas A. ;
Vaickus, Louis J. ;
Levy, Joshua J. .
BIODATA MINING, 2023, 16 (01)
[3]   Using Feature Clustering for GP-Based Feature Construction on High-Dimensional Data [J].
Binh Tran ;
Xue, Bing ;
Zhang, Mengjie .
GENETIC PROGRAMMING, EUROGP 2017, 2017, 10196 :210-226
[4]  
Ha Q, 2020, Arxiv, DOI [arXiv:2010.05351, 10.48550/arXiv.2010.05351]
[5]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[6]  
Holzinger A, 2017, Arxiv, DOI arXiv:1712.09923
[7]   A Review of Fusion Methods for Omics and Imaging Data [J].
Huang, Weixian ;
Tan, Kaiwen ;
Zhang, Ziye ;
Hu, Jinlong ;
Dong, Shoubin .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) :74-93
[8]   Multimodal machine learning in precision health: A scoping review [J].
Kline, Adrienne ;
Wang, Hanyin ;
Li, Yikuan ;
Dennis, Saya ;
Hutch, Meghan ;
Xu, Zhenxing ;
Wang, Fei ;
Cheng, Feixiong ;
Luo, Yuan .
NPJ DIGITAL MEDICINE, 2022, 5 (01)
[9]  
La Cava W., 2021, P NEUR INF PROC SYST, V1
[10]   A patient-centric dataset of images and metadata for identifying melanomas using clinical context [J].
Rotemberg, Veronica ;
Kurtansky, Nicholas ;
Betz-Stablein, Brigid ;
Caffery, Liam ;
Chousakos, Emmanouil ;
Codella, Noel ;
Combalia, Marc ;
Dusza, Stephen ;
Guitera, Pascale ;
Gutman, David ;
Halpern, Allan ;
Helba, Brian ;
Kittler, Harald ;
Kose, Kivanc ;
Langer, Steve ;
Lioprys, Konstantinos ;
Malvehy, Josep ;
Musthaq, Shenara ;
Nanda, Jabpani ;
Reiter, Ofer ;
Shih, George ;
Stratigos, Alexander ;
Tschandl, Philipp ;
Weber, Jochen ;
Soyer, H. Peter .
SCIENTIFIC DATA, 2021, 8 (01)