DrFuse: Learning Disentangled Representation for Clinical Multi-Modal Fusion with Missing Modality and Modal Inconsistency

被引:0
|
作者
Yao, Wenfang [1 ]
Yin, Kejing [2 ]
Cheung, William K. [2 ]
Liu, Jia [3 ]
Qin, Jing [1 ]
机构
[1] Hong Kong Polytech Univ, Sch Nursing, Hong Kong, Peoples R China
[2] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
来源
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15 | 2024年
关键词
DIAGNOSIS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The combination of electronic health records (EHR) and medical images is crucial for clinicians in making diagnoses and forecasting prognoses. Strategically fusing these two data modalities has great potential to improve the accuracy of machine learning models in clinical prediction tasks. However, the asynchronous and complementary nature of EHR and medical images presents unique challenges. Missing modalities due to clinical and administrative factors are inevitable in practice, and the significance of each data modality varies depending on the patient and the prediction target, resulting in inconsistent predictions and suboptimal model performance. To address these challenges, we propose DrFuse to achieve effective clinical multi-modal fusion. It tackles the missing modality issue by disentangling the features shared across modalities and those unique within each modality. Furthermore, we address the modal inconsistency issue via a diseasewise attention layer that produces the patient- and diseasewise weighting for each modality to make the final prediction. We validate the proposed method using real-world large-scale datasets, MIMIC-IV and MIMIC-CXR. Experimental results show that the proposed method significantly outperforms the state-of-the-art models.
引用
收藏
页码:16416 / 16424
页数:9
相关论文
共 50 条
  • [1] Exploiting Multi-modal Fusion for Robust Face Representation Learning with Missing Modality
    Zhu, Yizhe
    Sun, Xin
    Zhou, Xi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT II, 2023, 14255 : 283 - 294
  • [2] Efficient disentangled representation learning for multi-modal finger biometrics
    Yang, Weili
    Huang, Junduan
    Luo, Dacan
    Kang, Wenxiong
    PATTERN RECOGNITION, 2024, 145
  • [3] Learnable Cross-modal Knowledge Distillation for Multi-modal Learning with Missing Modality
    Wang, Hu
    Ma, Congbo
    Zhang, Jianpeng
    Zhang, Yuan
    Avery, Jodie
    Hull, Louise
    Carneiro, Gustavo
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 216 - 226
  • [4] SSDMM-VAE: variational multi-modal disentangled representation learning
    Arnab Kumar Mondal
    Ajay Sailopal
    Parag Singla
    Prathosh AP
    Applied Intelligence, 2023, 53 : 8467 - 8481
  • [5] SSDMM-VAE: variational multi-modal disentangled representation learning
    Mondal, Arnab Kumar
    Sailopal, Ajay
    Singla, Parag
    Ap, Prathosh
    APPLIED INTELLIGENCE, 2023, 53 (07) : 8467 - 8481
  • [6] Missing-modality enabled multi-modal fusion architecture for medical data
    Wang, Muyu
    Fan, Shiyu
    Li, Yichen
    Xie, Zhongrang
    Chen, Hui
    JOURNAL OF BIOMEDICAL INFORMATICS, 2025, 164
  • [7] Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning
    Jiang, Qian
    Chen, Changyou
    Zhao, Han
    Chen, Liqun
    Ping, Qing
    Tran, Son Dinh
    Xu, Yi
    Zeng, Belinda
    Chilimbi, Trishul
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7661 - 7671
  • [8] Multi-modal Network Representation Learning
    Zhang, Chuxu
    Jiang, Meng
    Zhang, Xiangliang
    Ye, Yanfang
    Chawla, Nitesh, V
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3557 - 3558
  • [9] Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning
    Liang, Weixin
    Zhang, Yuhui
    Kwon, Yongchan
    Yeung, Serena
    Zou, James
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [10] Unified Multi-Modal Image Synthesis for Missing Modality Imputation
    Zhang, Yue
    Peng, Chengtao
    Wang, Qiuli
    Song, Dan
    Li, Kaiyan
    Zhou, S. Kevin
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (01) : 4 - 18