Pathology-Knowledge Enhanced Multi-instance Prompt Learning for Few-Shot Whole Slide Image Classification

被引：1

作者：

Qu, Linhao ^{[1
]}

Yang, Dingkang ^{[2
]}

Huang, Dan ^{[3
]}

Guo, Qinhao ^{[4
]}

Luo, Rongkui ^{[5
]}

Zhang, Shaoting ^{[1
]}

Wang, Xiaosong ^{[1
]}

机构：

[1] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China

[2] Fudan Univ, Acad Engn & Technol, Shanghai, Peoples R China

[3] Fudan Univ, Shanghai Canc Ctr, Dept Pathol, Shanghai, Peoples R China

[4] Fudan Univ, Shanghai Canc Ctr, Dept Gynecol Oncol, Shanghai, Peoples R China

[5] Fudan Univ, Zhongshan Hosp, Dept Pathol, Shanghai, Peoples R China

来源：

COMPUTER VISION - ECCV 2024, PT XI | 2025年 / 15069卷

基金：

国家重点研发计划;

关键词：

Pathology image analysis; Prompt learning;

D O I：

10.1007/978-3-031-73247-8_12

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Current multi-instance learning algorithms for pathology image analysis often require a substantial number of Whole Slide Images for effective training but exhibit suboptimal performance in scenarios with limited learning data. In clinical settings, restricted access to pathology slides is inevitable due to patient privacy concerns and the prevalence of rare or emerging diseases. The emergence of the Few-shot Weakly Supervised WSI Classification accommodates the significant challenge of the limited slide data and sparse slide-level labels for diagnosis. Prompt learning based on the pre-trained models (e.g., CLIP) appears to be a promising scheme for this setting; however, current research in this area is limited, and existing algorithms often focus solely on patch-level prompts or confine themselves to language prompts. This paper proposes amulti-instance prompt learning framework enhanced with pathology knowledge, i.e., integrating visual and textual prior knowledge into prompts at both patch and slide levels. The training process employs a combination of static and learnable prompts, effectively guiding the activation of pre-trained models and further facilitating the diagnosis of key pathology patterns. Lightweight Messenger (self-attention) and Summary (attention-pooling) layers are introduced to model relationships between patches and slides within the same patient data. Additionally, alignment-wise contrastive losses ensure the feature-level alignment between visual and textual learnable prompts for both patches and slides. Our method demonstrates superior performance in three challenging clinical tasks, significantly outperforming comparative few-shot methods.

引用

页码：196 / 212

页数：17

共 50 条

[1] Multi-instance attention network for few-shot learning
Qin, Zhili
Wang, Han
Mawuli, Cobbinah Bernard
Han, Wei
Zhang, Rui
Yang, Qinli
Shao, Junming
Information Sciences, 2022, 611 : 464 - 475
[2] Multi-instance attention network for few-shot learning
Qin, Zhili
Wang, Han
Mawuli, Cobbinah Bernard
Han, Wei
Zhang, Rui
Yang, Qinli
Shao, Junming
INFORMATION SCIENCES, 2022, 611 : 464 - 475
[3] Knowledge-Enhanced Prompt Learning for Few-Shot Text Classification
Liu, Jinshuo
Yang, Lu
BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (04)
[4] MuRCL: Multi-Instance Reinforcement Contrastive Learning for Whole Slide Image Classification
Zhu, Zhonghang
Yu, Lequan
Wu, Wei
Yu, Rongshan
Zhang, Defu
Wang, Liansheng
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (05) : 1337 - 1348
[5] Multi-scale multi-instance contrastive learning for whole slide image classification
Zhang, Jianan
Hao, Fang
Liu, Xueyu
Yao, Shupei
Wu, Yongfei
Li, Ming
Zheng, Wen
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
[6] Clustering-Based Multi-instance Learning Network for Whole Slide Image Classification
Wu, Wei
Zhu, Zhonghang
Magnier, Baptiste
Wang, Liansheng
COMPUTATIONAL MATHEMATICS MODELING IN CANCER ANALYSIS, CMMCA 2022, 2022, 13574 : 100 - 109
[7] Second-order multi-instance learning model for whole slide image classification
Wang, Qian
Zou, Ying
Zhang, Jianxin
Liu, Bin
PHYSICS IN MEDICINE AND BIOLOGY, 2021, 66 (14):
[8] RMDL: Recalibrated multi-instance deep learning for whole slide gastric image classification
Wang, Shujun
Zhu, Yaxi
Yu, Lequan
Chen, Hao
Lin, Huangjing
Wan, Xiangbo
Fan, Xinjuan
Heng, Pheng-Ann
MEDICAL IMAGE ANALYSIS, 2019, 58
[9] Enhanced Prompt Learning for Few-shot Text Classification Method
Li R.
Wei Z.
Fan Y.
Ye S.
Zhang G.
Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2024, 60 (01): : 1 - 12
[10] Enhancing whole slide image classification through label denoising in a multi-instance learning framework
Wang, Rui
Gu, Yun
Yang, Jie
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 105

← 1 2 3 4 5 →