PMFN-SSL: Self-supervised learning-based progressive multimodal fusion network for cancer diagnosis and prognosis

被引:3
作者
Li, Le [1 ,2 ]
Pan, Hudan [3 ]
Liang, Yong [1 ,4 ]
Shao, Mingwen [5 ]
Xie, Shengli [6 ]
Lu, Shanghui [2 ]
Liao, Shuilin [2 ]
机构
[1] Peng Cheng Lab, Shenzhen, Peoples R China
[2] Macau Univ Sci & Technol, Sch Fac Innovat Engn, Macau, Peoples R China
[3] Guangzhou Univ Chinese Med, Affiliated Hosp 2, State Key Lab Tradit Chinese Med Syndrome, Guangzhou, Peoples R China
[4] Pazhou Lab Huangpu, Guangzhou, Peoples R China
[5] China Univ Petr, Coll Comp Sci & Technol, Qingdao, Peoples R China
[6] Guangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China
基金
美国国家科学基金会;
关键词
Multimodal learning; Self-supervised learning; Survival analysis; Grade prediction; ARTIFICIAL-INTELLIGENCE; SURVIVAL PREDICTION; CLASSIFICATION; IMAGES;
D O I
10.1016/j.knosys.2024.111502
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The integration of digital pathology images and genetic data is a developing field in cancer research, presenting potential opportunities for predicting survival and classifying grades through multiple source data. However, obtaining comprehensive annotations proves challenging in practical medical settings, and the extraction of features from high -resolution pathology images is hindered by inter-domain disparities. Current data fusion methods ignore the spatio-temporal incongruity among multimodal data. To address the above challenges, we propose a novel self-supervised transformer-based pathology feature extraction strategy, and construct an interpretable Progressive Multimodal Fusion Network (PMFN-SSL) for cancer diagnosis and prognosis. Our contributions are mainly divided into three aspects. Firstly, we propose a joint patch sampling strategy based on the information entropy and HSV components of an image, which reduces the demand for sample annotations and avoid image quality degradation caused by manual contamination. Secondly, a self-supervised transformerbased feature extraction module for pathology images is proposed and innovatively leverages partially weakly supervised labeling to align the extracted features with downstream medical tasks. Further, we improve the existing multimodal feature fusion model with an progressive fusion strategy to reduce the inconsistency between multimodal data due to differences in collection of temporal and spatial. Abundant ablation and comparison experiments demonstrate that the proposed data preprocessing method and multimodal fusion paradigm strengthen the quality of feature extraction and improve the prediction based on real cancer grading and prognosis. Code and trained models are made available at: https://github.com/Mercuriiio/PMFN-SSL.
引用
收藏
页数:12
相关论文
共 75 条
  • [1] Automated classification of brain tumor type in whole-slide digital pathology images using local representative tiles
    Barker, Jocelyn
    Hoogi, Assaf
    Depeursinge, Adrien
    Rubin, Daniel L.
    [J]. MEDICAL IMAGE ANALYSIS, 2016, 30 : 60 - 71
  • [2] Survival analysis and Cox regression
    Benitez-Parejo, N.
    Rodriguez del Aguila, M. M.
    Perez-Vicente, S.
    [J]. ALLERGOLOGIA ET IMMUNOPATHOLOGIA, 2011, 39 (06) : 362 - 373
  • [3] Gliomas With 1p/19q Codeletion: a.k.a. Oligodendroglioma
    Cairncross, Gregory
    Jenkins, Robert
    [J]. CANCER JOURNAL, 2008, 14 (06) : 352 - 357
  • [4] Chen RJ, 2021, Arxiv, DOI arXiv:2108.02278
  • [5] Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images
    Chen, Richard J.
    Lu, Ming Y.
    Weng, Wei-Hung
    Chen, Tiffany Y.
    Williamson, Drew F. K.
    Manz, Trevor
    Shady, Maha
    Mahmood, Faisal
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3995 - 4005
  • [6] Chen RJ, 2022, IEEE T MED IMAGING, V41, P757, DOI [10.1109/TMI.2020.3021387, 10.1109/TITS.2020.3030218]
  • [7] Methodological conduct of prognostic prediction models developed using machine learning in oncology: a systematic review
    Dhiman, Paula
    Ma, Jie
    Navarro, Constanza L. Andaur
    Speich, Benjamin
    Bullock, Garrett
    Damen, Johanna A. A.
    Hooft, Lotty
    Kirtley, Shona
    Riley, Richard D.
    Van Calster, Ben
    Moons, Karel G. M.
    Collins, Gary S.
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
  • [8] Multi-scale Prototypical Transformer forWhole Slide Image Classification
    Ding, Saisai
    Wang, Jun
    Li, Juncheng
    Shi, Jun
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 602 - 611
  • [9] Donglin Di, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12265), P428, DOI 10.1007/978-3-030-59722-1_41
  • [10] Dou Q, 2018, Arxiv, DOI arXiv:1804.10916