CoD-MIL: Chain-of-Diagnosis Prompting Multiple Instance Learning for Whole Slide Image Classification

被引：0

作者：

Shi, Jiangbo ^{[1
]}

Li, Chen ^{[1
]}

Gong, Tieliang

Wang, Chunbao ^{[2
]}

Fu, Huazhu ^{[3
]}

机构：

[1] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Shaanxi, Peoples R China

[2] Xi An Jiao Tong Univ, Affiliated Hosp 1, Dept Pathol, Xian 710061, Shaanxi, Peoples R China

[3] ASTAR, Inst High Performance Comp IHPC, Singapore 138632, Singapore

来源：

IEEE TRANSACTIONS ON MEDICAL IMAGING | 2025年 / 44卷 / 03期

基金：

新加坡国家研究基金会;

关键词：

Pathology; Tumors; Feature extraction; Visualization; Image classification; Training; Electronic mail; Cognition; Cancer; Hospitals; Histopathology; whole slide image analysis; multiple instance learning; vision language model; TRANSFORMER;

D O I：

10.1109/TMI.2024.3485120

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Multiple instance learning (MIL) has emerged as a prominent paradigm for processing the whole slide image with pyramid structure and giga-pixel size in digital pathology. However, existing attention-based MIL methods are primarily trained on the image modality and a pre-defined label set, leading to limited generalization and interpretability. Recently, vision language models (VLM) have achieved promising performance and transferability, offering potential solutions to the limitations of MIL-based methods. Pathological diagnosis is an intricate process that requires pathologists to examine the WSI step-by-step. In the field of natural language process, the chain-of-thought (CoT) prompting method is widely utilized to imitate the human reasoning process. Inspired by the CoT prompt and pathologists' clinic knowledge, we propose a chain-of-diagnosis prompting multiple instance learning (CoD-MIL) framework for whole slide image classification. Specifically, the chain-of-diagnosis text prompt decomposes the complex diagnostic process in WSI into progressive sub-processes from low to high magnification. Additionally, we propose a text-guided contrastive masking module to accurately localize the tumor region by masking the most discriminative instances and introducing the guidance of normal tissue texts in a contrastive way. Extensive experiments conducted on three real-world subtyping datasets demonstrate the effectiveness and superiority of CoD-MIL.

引用

页码：1218 / 1229

页数：12

共 50 条

[21] Iteratively Coupled Multiple Instance Learning from Instance to Bag Classifier for Whole Slide Image Classification
Wang, Hongyi
Luo, Luyang
Wang, Fang
Tong, Ruofeng
Chen, Yen-Wei
Hu, Hongjie
Lin, Lanfen
Chen, Hao
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 467 - 476
[22] TPMIL: Trainable Prototype Enhanced Multiple Instance Learning for Whole Slide Image Classification
Yang, Litao
Mehta, Deval
Liu, Sidong
Mahapatra, Dwarikanath
Di Ieva, Antonio
Ge, Zongyuan
MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 1655 - 1665
[23] TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification
Shao, Zhuchen
Bian, Hao
Chen, Yang
Wang, Yifeng
Zhang, Jian
Ji, Xiangyang
Zhang, Yongbing
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
[24] PHIM-MIL: Multiple instance learning with prototype similarity-guided feature fusion and hard instance mining for whole slide image classification
Xie, Yining
Liu, Zequn
Zhao, Jing
Ma, Jiayi
INFORMATION FUSION, 2025, 117
[25] ProtoMIL: Multiple Instance Learning with Prototypical Parts for Whole-Slide Image Classification
Rymarczyk, Dawid
Pardyl, Adam
Kraus, Jaroslaw
Kaczynska, Aneta
Skomorowski, Marek
Zielinski, Bartosz
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT I, 2023, 13713 : 421 - 436
[26] Rethinking Multiple Instance Learning for Whole Slide Image Classification: A Good Instance Classifier Is All You Need
Qu, Linhao
Ma, Yingfan
Luo, Xiaoyuan
Guo, Qinhao
Wang, Manning
Song, Zhijian
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9732 - 9744
[27] LNPL-MIL: Learning from Noisy Pseudo Labels for Promoting Multiple Instance Learning in Whole Slide Image
Shao, Zhuchen
Wang, Yifeng
Chen, Yang
Bian, Hao
Liu, Shaohui
Wang, Haoqian
Zhang, Yongbing
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21438 - 21448
[28] Feature Re-calibration Based Multiple Instance Learning for Whole Slide Image Classification
Chikontwe, Philip
Nam, Soo Jeong
Go, Heounjeong
Kim, Meejeong
Sung, Hyun Jung
Park, Sang Hyun
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 420 - 430
[29] PAMIL: Prototype Attention-Based Multiple Instance Learning for Whole Slide Image Classification
Liu, Jiashuai
Mao, Anyu
Niu, Yi
Zhang, Xianli
Gong, Tieliang
Li, Chen
Gao, Zeyu
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IV, 2024, 15004 : 362 - 372
[30] Registration-enhanced multiple instance learning for cervical cancer whole slide image classification
He, Qiming
Wang, Chengjiang
Zeng, Siqi
Liang, Zhendong
Duan, Hufei
Yang, Jingying
Pan, Feiyang
He, Yonghong
Huang, Wenting
Guan, Tian
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (01)

← 1 2 3 4 5 →