DrFuse: Learning Disentangled Representation for Clinical Multi-Modal Fusion with Missing Modality and Modal Inconsistency

被引：0

作者：

Yao, Wenfang ^{[1
]}

Yin, Kejing ^{[2
]}

Cheung, William K. ^{[2
]}

Liu, Jia ^{[3
]}

Qin, Jing ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Sch Nursing, Hong Kong, Peoples R China

[2] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China

[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15 | 2024年

关键词：

DIAGNOSIS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The combination of electronic health records (EHR) and medical images is crucial for clinicians in making diagnoses and forecasting prognoses. Strategically fusing these two data modalities has great potential to improve the accuracy of machine learning models in clinical prediction tasks. However, the asynchronous and complementary nature of EHR and medical images presents unique challenges. Missing modalities due to clinical and administrative factors are inevitable in practice, and the significance of each data modality varies depending on the patient and the prediction target, resulting in inconsistent predictions and suboptimal model performance. To address these challenges, we propose DrFuse to achieve effective clinical multi-modal fusion. It tackles the missing modality issue by disentangling the features shared across modalities and those unique within each modality. Furthermore, we address the modal inconsistency issue via a diseasewise attention layer that produces the patient- and diseasewise weighting for each modality to make the final prediction. We validate the proposed method using real-world large-scale datasets, MIMIC-IV and MIMIC-CXR. Experimental results show that the proposed method significantly outperforms the state-of-the-art models.

引用

页码：16416 / 16424

页数：9

共 50 条

[1] Exploiting Multi-modal Fusion for Robust Face Representation Learning with Missing Modality
Zhu, Yizhe
Sun, Xin
Zhou, Xi
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT II, 2023, 14255 : 283 - 294
[2] Efficient disentangled representation learning for multi-modal finger biometrics
Yang, Weili
Huang, Junduan
Luo, Dacan
Kang, Wenxiong
PATTERN RECOGNITION, 2024, 145
[3] Learnable Cross-modal Knowledge Distillation for Multi-modal Learning with Missing Modality
Wang, Hu
Ma, Congbo
Zhang, Jianpeng
Zhang, Yuan
Avery, Jodie
Hull, Louise
Carneiro, Gustavo
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 216 - 226
[4] SSDMM-VAE: variational multi-modal disentangled representation learning
Arnab Kumar Mondal
Ajay Sailopal
Parag Singla
Prathosh AP
Applied Intelligence, 2023, 53 : 8467 - 8481
[5] SSDMM-VAE: variational multi-modal disentangled representation learning
Mondal, Arnab Kumar
Sailopal, Ajay
Singla, Parag
Ap, Prathosh
APPLIED INTELLIGENCE, 2023, 53 (07) : 8467 - 8481
[6] Missing-modality enabled multi-modal fusion architecture for medical data
Wang, Muyu
Fan, Shiyu
Li, Yichen
Xie, Zhongrang
Chen, Hui
JOURNAL OF BIOMEDICAL INFORMATICS, 2025, 164
[7] Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning
Jiang, Qian
Chen, Changyou
Zhao, Han
Chen, Liqun
Ping, Qing
Tran, Son Dinh
Xu, Yi
Zeng, Belinda
Chilimbi, Trishul
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7661 - 7671
[8] Multi-modal Network Representation Learning
Zhang, Chuxu
Jiang, Meng
Zhang, Xiangliang
Ye, Yanfang
Chawla, Nitesh, V
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3557 - 3558
[9] Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning
Liang, Weixin
Zhang, Yuhui
Kwon, Yongchan
Yeung, Serena
Zou, James
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[10] Unified Multi-Modal Image Synthesis for Missing Modality Imputation
Zhang, Yue
Peng, Chengtao
Wang, Qiuli
Song, Dan
Li, Kaiyan
Zhou, S. Kevin
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (01) : 4 - 18

← 1 2 3 4 5 →