DT-MIL: Deformable Transformer for Multi-instance Learning on Histopathological Image

被引：66

作者：

Li, Hang ^{[1
,2
]}

Yang, Fan ^{[2
]}

Zhao, Yu ^{[2
]}

Xing, Xiaohan ^{[2
,3
]}

Zhang, Jun ^{[2
]}

Gao, Mingxuan ^{[1
,2
]}

Huang, Junzhou ^{[2
]}

Wang, Liansheng ^{[1
]}

Yao, Jianhua ^{[2
]}

机构：

[1] Xiamen Univ, Sch Informat, Xiamen, Peoples R China

[2] Tencent, AI Lab, Shenzhen, Peoples R China

[3] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China

来源：

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII | 2021年 / 12908卷

基金：

国家重点研发计划;

关键词：

Deformable transformer; Multi-instance learning; Key-value attention; Histopathological image analysis; CANCER;

D O I：

10.1007/978-3-030-87237-3_20

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Learning informative representations is crucial for classification and prediction tasks on histopathological images. Due to the huge image size, whole-slide histopathological image analysis is normally addressed with multi-instance learning (MIL) scheme. However, the weakly supervised nature of MIL leads to the challenge of learning an effective whole-slide-level representation. To tackle this issue, we present a novel embedded-space MIL model based on deformable transformer (DT) architecture and convolutional layers, which is termed DT-MIL. The DT architecture enables our MIL model to update each instance feature by globally aggregating instance features in a bag simultaneously and encoding the position context information of instances during bag representation learning. Compared with other state-of-the-art MIL models, our model has the following advantages: (1) generating the bag representation in a fully trainable way, (2) representing the bag with a high-level and nonlinear combination of all instances instead of fixed pooling-based methods (e.g. max pooling and average pooling) or simply attention-based linear aggregation, and (3) encoding the position relationship and context information during bag embedding phase. Besides our proposed DT-MIL, we also develop other possible transformer-based MILs for comparison. Extensive experiments show that our DT-MIL outperforms the state-of-the-art methods and other transformer-based MIL architectures in histopathological image classification and prediction tasks. An open-source implementation of our approach can be found at https://github.com/yfzon/DT-MIL.

引用

页码：206 / 216

页数：11

共 50 条

[1] MIST: Multi-instance selective transformer for histopathological subtype prediction
Zhao, Rongchang
Xi, Zijun
Liu, Huanchi
Jian, Xiangkun
Zhang, Jian
Zhang, Zijian
Li, Shuo
MEDICAL IMAGE ANALYSIS, 2024, 97
[2] Automated skin biopsy histopathological image annotation using multi-instance representation and learning
Gang Zhang
Jian Yin
Ziping Li
Xiangyang Su
Guozheng Li
Honglai Zhang
BMC Medical Genomics, 6
[3] Automated skin biopsy histopathological image annotation using multi-instance representation and learning
Zhang, Gang
Yin, Jian
Li, Ziping
Su, Xiangyang
Li, Guozheng
Zhang, Honglai
BMC MEDICAL GENOMICS, 2013, 6
[4] Multi-instance Learning based on Instance Consistency for Image Retrieval
Zhang, Miao
Wu, Zhize
Wan, Shouhong
Yue, Lihua
Yin, Bangjie
NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420
[5] PT-MIL:Parallel transformer based on multi-instance learning for osteoporosis detection in panoramic oral radiography
黄欣然
YANG Hongjie
CHEN Hu
ZHANG Yi
廖培希
中国体视学与图像分析, 2023, 28 (04) : 410 - 418
[6] A multi-instance learning based approach to image retrieval
Dai, Hong-Bin
Zhang, Min-Ling
Zhou, Zhi-Hua
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2006, 19 (02): : 179 - 185
[7] Deep Multi-Instance Multi-Label Learning for Image Annotation
Guo, Hai-Feng
Han, Lixin
Su, Shoubao
Sun, Zhou-Bao
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (03)
[8] HLFSRNN-MIL: A Hybrid Multi-Instance Learning Model for 3D CT Image Classification
Chen, Huilong
Zhang, Xiaoxia
APPLIED SCIENCES-BASEL, 2024, 14 (14):
[9] Joint multi-label multi-instance learning for image classification
Zha, Zheng-Jun
Hua, Xian-Sheng
Mei, Tao
Wang, Jingdong
Qi, Guo-Jun
Wang, Zengfu
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 333 - +
[10] Image classification with multi-view multi-instance metric learning
Tang, Jingjing
Li, Dewei
Tian, Yingjie
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 189

← 1 2 3 4 5 →