DT-MIL: Deformable Transformer for Multi-instance Learning on Histopathological Image

被引:66
|
作者
Li, Hang [1 ,2 ]
Yang, Fan [2 ]
Zhao, Yu [2 ]
Xing, Xiaohan [2 ,3 ]
Zhang, Jun [2 ]
Gao, Mingxuan [1 ,2 ]
Huang, Junzhou [2 ]
Wang, Liansheng [1 ]
Yao, Jianhua [2 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen, Peoples R China
[2] Tencent, AI Lab, Shenzhen, Peoples R China
[3] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII | 2021年 / 12908卷
基金
国家重点研发计划;
关键词
Deformable transformer; Multi-instance learning; Key-value attention; Histopathological image analysis; CANCER;
D O I
10.1007/978-3-030-87237-3_20
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Learning informative representations is crucial for classification and prediction tasks on histopathological images. Due to the huge image size, whole-slide histopathological image analysis is normally addressed with multi-instance learning (MIL) scheme. However, the weakly supervised nature of MIL leads to the challenge of learning an effective whole-slide-level representation. To tackle this issue, we present a novel embedded-space MIL model based on deformable transformer (DT) architecture and convolutional layers, which is termed DT-MIL. The DT architecture enables our MIL model to update each instance feature by globally aggregating instance features in a bag simultaneously and encoding the position context information of instances during bag representation learning. Compared with other state-of-the-art MIL models, our model has the following advantages: (1) generating the bag representation in a fully trainable way, (2) representing the bag with a high-level and nonlinear combination of all instances instead of fixed pooling-based methods (e.g. max pooling and average pooling) or simply attention-based linear aggregation, and (3) encoding the position relationship and context information during bag embedding phase. Besides our proposed DT-MIL, we also develop other possible transformer-based MILs for comparison. Extensive experiments show that our DT-MIL outperforms the state-of-the-art methods and other transformer-based MIL architectures in histopathological image classification and prediction tasks. An open-source implementation of our approach can be found at https://github.com/yfzon/DT-MIL.
引用
收藏
页码:206 / 216
页数:11
相关论文
共 50 条
  • [1] MIST: Multi-instance selective transformer for histopathological subtype prediction
    Zhao, Rongchang
    Xi, Zijun
    Liu, Huanchi
    Jian, Xiangkun
    Zhang, Jian
    Zhang, Zijian
    Li, Shuo
    MEDICAL IMAGE ANALYSIS, 2024, 97
  • [2] Automated skin biopsy histopathological image annotation using multi-instance representation and learning
    Gang Zhang
    Jian Yin
    Ziping Li
    Xiangyang Su
    Guozheng Li
    Honglai Zhang
    BMC Medical Genomics, 6
  • [3] Automated skin biopsy histopathological image annotation using multi-instance representation and learning
    Zhang, Gang
    Yin, Jian
    Li, Ziping
    Su, Xiangyang
    Li, Guozheng
    Zhang, Honglai
    BMC MEDICAL GENOMICS, 2013, 6
  • [4] Multi-instance Learning based on Instance Consistency for Image Retrieval
    Zhang, Miao
    Wu, Zhize
    Wan, Shouhong
    Yue, Lihua
    Yin, Bangjie
    NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420
  • [5] PT-MIL:Parallel transformer based on multi-instance learning for osteoporosis detection in panoramic oral radiography
    黄欣然
    YANG Hongjie
    CHEN Hu
    ZHANG Yi
    廖培希
    中国体视学与图像分析, 2023, 28 (04) : 410 - 418
  • [6] A multi-instance learning based approach to image retrieval
    Dai, Hong-Bin
    Zhang, Min-Ling
    Zhou, Zhi-Hua
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2006, 19 (02): : 179 - 185
  • [7] Deep Multi-Instance Multi-Label Learning for Image Annotation
    Guo, Hai-Feng
    Han, Lixin
    Su, Shoubao
    Sun, Zhou-Bao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (03)
  • [8] HLFSRNN-MIL: A Hybrid Multi-Instance Learning Model for 3D CT Image Classification
    Chen, Huilong
    Zhang, Xiaoxia
    APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [9] Joint multi-label multi-instance learning for image classification
    Zha, Zheng-Jun
    Hua, Xian-Sheng
    Mei, Tao
    Wang, Jingdong
    Qi, Guo-Jun
    Wang, Zengfu
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 333 - +
  • [10] Image classification with multi-view multi-instance metric learning
    Tang, Jingjing
    Li, Dewei
    Tian, Yingjie
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 189