CoD-MIL: Chain-of-Diagnosis Prompting Multiple Instance Learning for Whole Slide Image Classification

被引:0
|
作者
Shi, Jiangbo [1 ]
Li, Chen [1 ]
Gong, Tieliang
Wang, Chunbao [2 ]
Fu, Huazhu [3 ]
机构
[1] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Shaanxi, Peoples R China
[2] Xi An Jiao Tong Univ, Affiliated Hosp 1, Dept Pathol, Xian 710061, Shaanxi, Peoples R China
[3] ASTAR, Inst High Performance Comp IHPC, Singapore 138632, Singapore
基金
新加坡国家研究基金会;
关键词
Pathology; Tumors; Feature extraction; Visualization; Image classification; Training; Electronic mail; Cognition; Cancer; Hospitals; Histopathology; whole slide image analysis; multiple instance learning; vision language model; TRANSFORMER;
D O I
10.1109/TMI.2024.3485120
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multiple instance learning (MIL) has emerged as a prominent paradigm for processing the whole slide image with pyramid structure and giga-pixel size in digital pathology. However, existing attention-based MIL methods are primarily trained on the image modality and a pre-defined label set, leading to limited generalization and interpretability. Recently, vision language models (VLM) have achieved promising performance and transferability, offering potential solutions to the limitations of MIL-based methods. Pathological diagnosis is an intricate process that requires pathologists to examine the WSI step-by-step. In the field of natural language process, the chain-of-thought (CoT) prompting method is widely utilized to imitate the human reasoning process. Inspired by the CoT prompt and pathologists' clinic knowledge, we propose a chain-of-diagnosis prompting multiple instance learning (CoD-MIL) framework for whole slide image classification. Specifically, the chain-of-diagnosis text prompt decomposes the complex diagnostic process in WSI into progressive sub-processes from low to high magnification. Additionally, we propose a text-guided contrastive masking module to accurately localize the tumor region by masking the most discriminative instances and introducing the guidance of normal tissue texts in a contrastive way. Extensive experiments conducted on three real-world subtyping datasets demonstrate the effectiveness and superiority of CoD-MIL.
引用
收藏
页码:1218 / 1229
页数:12
相关论文
共 50 条
  • [31] ReMix: A General and Efficient Framework for Multiple Instance Learning Based Whole Slide Image Classification
    Yang, Jiawei
    Chen, Hanbo
    Zhao, Yu
    Yang, Fan
    Zhang, Yao
    He, Lei
    Yao, Jianhua
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 35 - 45
  • [32] CAMIL: channel attention-based multiple instance learning for whole slide image classification
    Mao, Jinyang
    Xu, Junlin
    Tang, Xianfang
    Liu, Yongjin
    Zhao, Heaven
    Tian, Geng
    Yang, Jialiang
    BIOINFORMATICS, 2025, 41 (02)
  • [33] A universal multiple instance learning framework for whole slide image analysis
    Zhang X.
    Liu C.
    Zhu H.
    Wang T.
    Du Z.
    Ding W.
    Computers in Biology and Medicine, 2024, 178
  • [34] Self-supervised comparative learning based improved multiple instance learning for whole slide image classification
    Yao, Luhan
    Wang, Hongyu
    Hao, Yingguang
    PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023, 2023, : 1353 - 1357
  • [35] H2-MIL: Exploring Hierarchical Representation with Heterogeneous Multiple Instance Learning for Whole Slide Image Analysis
    Hou, Wentai
    Yu, Lequan
    Lin, Chengxuan
    Huang, Helong
    Yu, Rongshan
    Qin, Jing
    Wang, Liansheng
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 933 - 941
  • [36] Dual-Attention Multiple Instance Learning Framework for Pathology Whole-Slide Image Classification
    Liu, Dehua
    Li, Chengming
    Hu, Xiping
    Hu, Bin
    ELECTRONICS, 2024, 13 (22)
  • [37] ProMIL: A weakly supervised multiple instance learning for whole slide image classification based on class proxy
    Li, Xiaoyu
    Yang, Bei
    Chen, Tiandong
    Gao, Zheng
    Huang, Mengjie
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [38] Pseudo-label attention-based multiple instance learning for whole slide image classification
    He, Jing
    Wang, Ping
    Cai, Jingwen
    Tang, Dan
    Yao, Shaowen
    Liu, Renyang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 142
  • [39] SparseConvMIL: Sparse Convolutional Context-Aware Multiple Instance Learning for Whole Slide Image Classification
    Lerousseau, Marvin
    Vakalopoulou, Maria
    Deutsch, Eric
    Paragios, Nikos
    MICCAI WORKSHOP ON COMPUTATIONAL PATHOLOGY, VOL 156, 2021, 156 : 129 - 139
  • [40] Multiple instance learning-based two-stage metric learning network for whole slide image classification
    Li, Xiaoyu
    Yang, Bei
    Chen, Tiandong
    Gao, Zheng
    Li, Huijie
    VISUAL COMPUTER, 2024, 40 (08): : 5717 - 5732