CoD-MIL: Chain-of-Diagnosis Prompting Multiple Instance Learning for Whole Slide Image Classification

被引:0
|
作者
Shi, Jiangbo [1 ]
Li, Chen [1 ]
Gong, Tieliang
Wang, Chunbao [2 ]
Fu, Huazhu [3 ]
机构
[1] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Shaanxi, Peoples R China
[2] Xi An Jiao Tong Univ, Affiliated Hosp 1, Dept Pathol, Xian 710061, Shaanxi, Peoples R China
[3] ASTAR, Inst High Performance Comp IHPC, Singapore 138632, Singapore
基金
新加坡国家研究基金会;
关键词
Pathology; Tumors; Feature extraction; Visualization; Image classification; Training; Electronic mail; Cognition; Cancer; Hospitals; Histopathology; whole slide image analysis; multiple instance learning; vision language model; TRANSFORMER;
D O I
10.1109/TMI.2024.3485120
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multiple instance learning (MIL) has emerged as a prominent paradigm for processing the whole slide image with pyramid structure and giga-pixel size in digital pathology. However, existing attention-based MIL methods are primarily trained on the image modality and a pre-defined label set, leading to limited generalization and interpretability. Recently, vision language models (VLM) have achieved promising performance and transferability, offering potential solutions to the limitations of MIL-based methods. Pathological diagnosis is an intricate process that requires pathologists to examine the WSI step-by-step. In the field of natural language process, the chain-of-thought (CoT) prompting method is widely utilized to imitate the human reasoning process. Inspired by the CoT prompt and pathologists' clinic knowledge, we propose a chain-of-diagnosis prompting multiple instance learning (CoD-MIL) framework for whole slide image classification. Specifically, the chain-of-diagnosis text prompt decomposes the complex diagnostic process in WSI into progressive sub-processes from low to high magnification. Additionally, we propose a text-guided contrastive masking module to accurately localize the tumor region by masking the most discriminative instances and introducing the guidance of normal tissue texts in a contrastive way. Extensive experiments conducted on three real-world subtyping datasets demonstrate the effectiveness and superiority of CoD-MIL.
引用
收藏
页码:1218 / 1229
页数:12
相关论文
共 50 条
  • [21] Iteratively Coupled Multiple Instance Learning from Instance to Bag Classifier for Whole Slide Image Classification
    Wang, Hongyi
    Luo, Luyang
    Wang, Fang
    Tong, Ruofeng
    Chen, Yen-Wei
    Hu, Hongjie
    Lin, Lanfen
    Chen, Hao
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 467 - 476
  • [22] TPMIL: Trainable Prototype Enhanced Multiple Instance Learning for Whole Slide Image Classification
    Yang, Litao
    Mehta, Deval
    Liu, Sidong
    Mahapatra, Dwarikanath
    Di Ieva, Antonio
    Ge, Zongyuan
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 1655 - 1665
  • [23] TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification
    Shao, Zhuchen
    Bian, Hao
    Chen, Yang
    Wang, Yifeng
    Zhang, Jian
    Ji, Xiangyang
    Zhang, Yongbing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [24] PHIM-MIL: Multiple instance learning with prototype similarity-guided feature fusion and hard instance mining for whole slide image classification
    Xie, Yining
    Liu, Zequn
    Zhao, Jing
    Ma, Jiayi
    INFORMATION FUSION, 2025, 117
  • [25] ProtoMIL: Multiple Instance Learning with Prototypical Parts for Whole-Slide Image Classification
    Rymarczyk, Dawid
    Pardyl, Adam
    Kraus, Jaroslaw
    Kaczynska, Aneta
    Skomorowski, Marek
    Zielinski, Bartosz
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT I, 2023, 13713 : 421 - 436
  • [26] Rethinking Multiple Instance Learning for Whole Slide Image Classification: A Good Instance Classifier Is All You Need
    Qu, Linhao
    Ma, Yingfan
    Luo, Xiaoyuan
    Guo, Qinhao
    Wang, Manning
    Song, Zhijian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9732 - 9744
  • [27] LNPL-MIL: Learning from Noisy Pseudo Labels for Promoting Multiple Instance Learning in Whole Slide Image
    Shao, Zhuchen
    Wang, Yifeng
    Chen, Yang
    Bian, Hao
    Liu, Shaohui
    Wang, Haoqian
    Zhang, Yongbing
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21438 - 21448
  • [28] Feature Re-calibration Based Multiple Instance Learning for Whole Slide Image Classification
    Chikontwe, Philip
    Nam, Soo Jeong
    Go, Heounjeong
    Kim, Meejeong
    Sung, Hyun Jung
    Park, Sang Hyun
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 420 - 430
  • [29] PAMIL: Prototype Attention-Based Multiple Instance Learning for Whole Slide Image Classification
    Liu, Jiashuai
    Mao, Anyu
    Niu, Yi
    Zhang, Xianli
    Gong, Tieliang
    Li, Chen
    Gao, Zeyu
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IV, 2024, 15004 : 362 - 372
  • [30] Registration-enhanced multiple instance learning for cervical cancer whole slide image classification
    He, Qiming
    Wang, Chengjiang
    Zeng, Siqi
    Liang, Zhendong
    Duan, Hufei
    Yang, Jingying
    Pan, Feiyang
    He, Yonghong
    Huang, Wenting
    Guan, Tian
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (01)