Pathology-Knowledge Enhanced Multi-instance Prompt Learning for Few-Shot Whole Slide Image Classification

被引:1
|
作者
Qu, Linhao [1 ]
Yang, Dingkang [2 ]
Huang, Dan [3 ]
Guo, Qinhao [4 ]
Luo, Rongkui [5 ]
Zhang, Shaoting [1 ]
Wang, Xiaosong [1 ]
机构
[1] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China
[2] Fudan Univ, Acad Engn & Technol, Shanghai, Peoples R China
[3] Fudan Univ, Shanghai Canc Ctr, Dept Pathol, Shanghai, Peoples R China
[4] Fudan Univ, Shanghai Canc Ctr, Dept Gynecol Oncol, Shanghai, Peoples R China
[5] Fudan Univ, Zhongshan Hosp, Dept Pathol, Shanghai, Peoples R China
来源
COMPUTER VISION - ECCV 2024, PT XI | 2025年 / 15069卷
基金
国家重点研发计划;
关键词
Pathology image analysis; Prompt learning;
D O I
10.1007/978-3-031-73247-8_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current multi-instance learning algorithms for pathology image analysis often require a substantial number of Whole Slide Images for effective training but exhibit suboptimal performance in scenarios with limited learning data. In clinical settings, restricted access to pathology slides is inevitable due to patient privacy concerns and the prevalence of rare or emerging diseases. The emergence of the Few-shot Weakly Supervised WSI Classification accommodates the significant challenge of the limited slide data and sparse slide-level labels for diagnosis. Prompt learning based on the pre-trained models (e.g., CLIP) appears to be a promising scheme for this setting; however, current research in this area is limited, and existing algorithms often focus solely on patch-level prompts or confine themselves to language prompts. This paper proposes amulti-instance prompt learning framework enhanced with pathology knowledge, i.e., integrating visual and textual prior knowledge into prompts at both patch and slide levels. The training process employs a combination of static and learnable prompts, effectively guiding the activation of pre-trained models and further facilitating the diagnosis of key pathology patterns. Lightweight Messenger (self-attention) and Summary (attention-pooling) layers are introduced to model relationships between patches and slides within the same patient data. Additionally, alignment-wise contrastive losses ensure the feature-level alignment between visual and textual learnable prompts for both patches and slides. Our method demonstrates superior performance in three challenging clinical tasks, significantly outperforming comparative few-shot methods.
引用
收藏
页码:196 / 212
页数:17
相关论文
共 50 条
  • [1] Multi-instance attention network for few-shot learning
    Qin, Zhili
    Wang, Han
    Mawuli, Cobbinah Bernard
    Han, Wei
    Zhang, Rui
    Yang, Qinli
    Shao, Junming
    Information Sciences, 2022, 611 : 464 - 475
  • [2] Multi-instance attention network for few-shot learning
    Qin, Zhili
    Wang, Han
    Mawuli, Cobbinah Bernard
    Han, Wei
    Zhang, Rui
    Yang, Qinli
    Shao, Junming
    INFORMATION SCIENCES, 2022, 611 : 464 - 475
  • [3] Knowledge-Enhanced Prompt Learning for Few-Shot Text Classification
    Liu, Jinshuo
    Yang, Lu
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (04)
  • [4] MuRCL: Multi-Instance Reinforcement Contrastive Learning for Whole Slide Image Classification
    Zhu, Zhonghang
    Yu, Lequan
    Wu, Wei
    Yu, Rongshan
    Zhang, Defu
    Wang, Liansheng
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (05) : 1337 - 1348
  • [5] Multi-scale multi-instance contrastive learning for whole slide image classification
    Zhang, Jianan
    Hao, Fang
    Liu, Xueyu
    Yao, Shupei
    Wu, Yongfei
    Li, Ming
    Zheng, Wen
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [6] Clustering-Based Multi-instance Learning Network for Whole Slide Image Classification
    Wu, Wei
    Zhu, Zhonghang
    Magnier, Baptiste
    Wang, Liansheng
    COMPUTATIONAL MATHEMATICS MODELING IN CANCER ANALYSIS, CMMCA 2022, 2022, 13574 : 100 - 109
  • [7] Second-order multi-instance learning model for whole slide image classification
    Wang, Qian
    Zou, Ying
    Zhang, Jianxin
    Liu, Bin
    PHYSICS IN MEDICINE AND BIOLOGY, 2021, 66 (14):
  • [8] RMDL: Recalibrated multi-instance deep learning for whole slide gastric image classification
    Wang, Shujun
    Zhu, Yaxi
    Yu, Lequan
    Chen, Hao
    Lin, Huangjing
    Wan, Xiangbo
    Fan, Xinjuan
    Heng, Pheng-Ann
    MEDICAL IMAGE ANALYSIS, 2019, 58
  • [9] Enhanced Prompt Learning for Few-shot Text Classification Method
    Li R.
    Wei Z.
    Fan Y.
    Ye S.
    Zhang G.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2024, 60 (01): : 1 - 12
  • [10] Enhancing whole slide image classification through label denoising in a multi-instance learning framework
    Wang, Rui
    Gu, Yun
    Yang, Jie
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 105