PMFN-SSL: Self-supervised learning-based progressive multimodal fusion network for cancer diagnosis and prognosis

被引：3

作者：

Li, Le ^{[1
,2
]}

Pan, Hudan ^{[3
]}

Liang, Yong ^{[1
,4
]}

Shao, Mingwen ^{[5
]}

Xie, Shengli ^{[6
]}

Lu, Shanghui ^{[2
]}

Liao, Shuilin ^{[2
]}

机构：

[1] Peng Cheng Lab, Shenzhen, Peoples R China

[2] Macau Univ Sci & Technol, Sch Fac Innovat Engn, Macau, Peoples R China

[3] Guangzhou Univ Chinese Med, Affiliated Hosp 2, State Key Lab Tradit Chinese Med Syndrome, Guangzhou, Peoples R China

[4] Pazhou Lab Huangpu, Guangzhou, Peoples R China

[5] China Univ Petr, Coll Comp Sci & Technol, Qingdao, Peoples R China

[6] Guangdong Univ Technol, Sch Automat, Guangzhou, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2024年 / 289卷

基金：

美国国家科学基金会;

关键词：

Multimodal learning; Self-supervised learning; Survival analysis; Grade prediction; ARTIFICIAL-INTELLIGENCE; SURVIVAL PREDICTION; CLASSIFICATION; IMAGES;

D O I：

10.1016/j.knosys.2024.111502

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The integration of digital pathology images and genetic data is a developing field in cancer research, presenting potential opportunities for predicting survival and classifying grades through multiple source data. However, obtaining comprehensive annotations proves challenging in practical medical settings, and the extraction of features from high -resolution pathology images is hindered by inter-domain disparities. Current data fusion methods ignore the spatio-temporal incongruity among multimodal data. To address the above challenges, we propose a novel self-supervised transformer-based pathology feature extraction strategy, and construct an interpretable Progressive Multimodal Fusion Network (PMFN-SSL) for cancer diagnosis and prognosis. Our contributions are mainly divided into three aspects. Firstly, we propose a joint patch sampling strategy based on the information entropy and HSV components of an image, which reduces the demand for sample annotations and avoid image quality degradation caused by manual contamination. Secondly, a self-supervised transformerbased feature extraction module for pathology images is proposed and innovatively leverages partially weakly supervised labeling to align the extracted features with downstream medical tasks. Further, we improve the existing multimodal feature fusion model with an progressive fusion strategy to reduce the inconsistency between multimodal data due to differences in collection of temporal and spatial. Abundant ablation and comparison experiments demonstrate that the proposed data preprocessing method and multimodal fusion paradigm strengthen the quality of feature extraction and improve the prediction based on real cancer grading and prognosis. Code and trained models are made available at: https://github.com/Mercuriiio/PMFN-SSL.

引用

页数：12

共 75 条

[1] Automated classification of brain tumor type in whole-slide digital pathology images using local representative tiles
Barker, Jocelyn
Hoogi, Assaf
Depeursinge, Adrien
Rubin, Daniel L.
[J]. MEDICAL IMAGE ANALYSIS, 2016, 30 : 60 - 71
[2] Survival analysis and Cox regression
Benitez-Parejo, N.
Rodriguez del Aguila, M. M.
Perez-Vicente, S.
[J]. ALLERGOLOGIA ET IMMUNOPATHOLOGIA, 2011, 39 (06) : 362 - 373
[3] Gliomas With 1p/19q Codeletion: a.k.a. Oligodendroglioma
Cairncross, Gregory
Jenkins, Robert
[J]. CANCER JOURNAL, 2008, 14 (06) : 352 - 357
[4] Chen RJ, 2021, Arxiv, DOI arXiv:2108.02278
[5] Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images
Chen, Richard J.
Lu, Ming Y.
Weng, Wei-Hung
Chen, Tiffany Y.
Williamson, Drew F. K.
Manz, Trevor
Shady, Maha
Mahmood, Faisal
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3995 - 4005
[6] Chen RJ, 2022, IEEE T MED IMAGING, V41, P757, DOI [10.1109/TMI.2020.3021387, 10.1109/TITS.2020.3030218]
[7] Methodological conduct of prognostic prediction models developed using machine learning in oncology: a systematic review
Dhiman, Paula
Ma, Jie
Navarro, Constanza L. Andaur
Speich, Benjamin
Bullock, Garrett
Damen, Johanna A. A.
Hooft, Lotty
Kirtley, Shona
Riley, Richard D.
Van Calster, Ben
Moons, Karel G. M.
Collins, Gary S.
[J]. BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
[8] Multi-scale Prototypical Transformer forWhole Slide Image Classification
Ding, Saisai
Wang, Jun
Li, Juncheng
Shi, Jun
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 602 - 611
[9] Donglin Di, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12265), P428, DOI 10.1007/978-3-030-59722-1_41
[10] Dou Q, 2018, Arxiv, DOI arXiv:1804.10916

← 1 2 3 4 5 6 7 8 →