CoD-MIL: Chain-of-Diagnosis Prompting Multiple Instance Learning for Whole Slide Image Classification

被引：0

作者：

Shi, Jiangbo ^{[1
]}

Li, Chen ^{[1
]}

Gong, Tieliang

Wang, Chunbao ^{[2
]}

Fu, Huazhu ^{[3
]}

机构：

[1] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Shaanxi, Peoples R China

[2] Xi An Jiao Tong Univ, Affiliated Hosp 1, Dept Pathol, Xian 710061, Shaanxi, Peoples R China

[3] ASTAR, Inst High Performance Comp IHPC, Singapore 138632, Singapore

来源：

IEEE TRANSACTIONS ON MEDICAL IMAGING | 2025年 / 44卷 / 03期

基金：

新加坡国家研究基金会;

关键词：

Pathology; Tumors; Feature extraction; Visualization; Image classification; Training; Electronic mail; Cognition; Cancer; Hospitals; Histopathology; whole slide image analysis; multiple instance learning; vision language model; TRANSFORMER;

D O I：

10.1109/TMI.2024.3485120

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Multiple instance learning (MIL) has emerged as a prominent paradigm for processing the whole slide image with pyramid structure and giga-pixel size in digital pathology. However, existing attention-based MIL methods are primarily trained on the image modality and a pre-defined label set, leading to limited generalization and interpretability. Recently, vision language models (VLM) have achieved promising performance and transferability, offering potential solutions to the limitations of MIL-based methods. Pathological diagnosis is an intricate process that requires pathologists to examine the WSI step-by-step. In the field of natural language process, the chain-of-thought (CoT) prompting method is widely utilized to imitate the human reasoning process. Inspired by the CoT prompt and pathologists' clinic knowledge, we propose a chain-of-diagnosis prompting multiple instance learning (CoD-MIL) framework for whole slide image classification. Specifically, the chain-of-diagnosis text prompt decomposes the complex diagnostic process in WSI into progressive sub-processes from low to high magnification. Additionally, we propose a text-guided contrastive masking module to accurately localize the tumor region by masking the most discriminative instances and introducing the guidance of normal tissue texts in a contrastive way. Extensive experiments conducted on three real-world subtyping datasets demonstrate the effectiveness and superiority of CoD-MIL.

引用

页码：1218 / 1229

页数：12

共 50 条

[31] ReMix: A General and Efficient Framework for Multiple Instance Learning Based Whole Slide Image Classification
Yang, Jiawei
Chen, Hanbo
Zhao, Yu
Yang, Fan
Zhang, Yao
He, Lei
Yao, Jianhua
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 35 - 45
[32] CAMIL: channel attention-based multiple instance learning for whole slide image classification
Mao, Jinyang
Xu, Junlin
Tang, Xianfang
Liu, Yongjin
Zhao, Heaven
Tian, Geng
Yang, Jialiang
BIOINFORMATICS, 2025, 41 (02)
[33] A universal multiple instance learning framework for whole slide image analysis
Zhang X.
Liu C.
Zhu H.
Wang T.
Du Z.
Ding W.
Computers in Biology and Medicine, 2024, 178
[34] Self-supervised comparative learning based improved multiple instance learning for whole slide image classification
Yao, Luhan
Wang, Hongyu
Hao, Yingguang
PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023, 2023, : 1353 - 1357
[35] H2-MIL: Exploring Hierarchical Representation with Heterogeneous Multiple Instance Learning for Whole Slide Image Analysis
Hou, Wentai
Yu, Lequan
Lin, Chengxuan
Huang, Helong
Yu, Rongshan
Qin, Jing
Wang, Liansheng
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 933 - 941
[36] Dual-Attention Multiple Instance Learning Framework for Pathology Whole-Slide Image Classification
Liu, Dehua
Li, Chengming
Hu, Xiping
Hu, Bin
ELECTRONICS, 2024, 13 (22)
[37] ProMIL: A weakly supervised multiple instance learning for whole slide image classification based on class proxy
Li, Xiaoyu
Yang, Bei
Chen, Tiandong
Gao, Zheng
Huang, Mengjie
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
[38] Pseudo-label attention-based multiple instance learning for whole slide image classification
He, Jing
Wang, Ping
Cai, Jingwen
Tang, Dan
Yao, Shaowen
Liu, Renyang
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 142
[39] SparseConvMIL: Sparse Convolutional Context-Aware Multiple Instance Learning for Whole Slide Image Classification
Lerousseau, Marvin
Vakalopoulou, Maria
Deutsch, Eric
Paragios, Nikos
MICCAI WORKSHOP ON COMPUTATIONAL PATHOLOGY, VOL 156, 2021, 156 : 129 - 139
[40] Multiple instance learning-based two-stage metric learning network for whole slide image classification
Li, Xiaoyu
Yang, Bei
Chen, Tiandong
Gao, Zheng
Li, Huijie
VISUAL COMPUTER, 2024, 40 (08): : 5717 - 5732

← 1 2 3 4 5 →