Convolution-Free Medical Image Segmentation Using Transformers

被引:74
作者
Karimi, Davood [1 ]
Vasylechko, Serge Didenko
Gholipour, Ali
机构
[1] Boston Childrens Hosp, Dept Radiol, Computat Radiol Lab CRL, Boston, MA 02115 USA
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I | 2021年 / 12901卷
基金
美国国家卫生研究院;
关键词
Segmentation; Deep learning; Transformers; Attention;
D O I
10.1007/978-3-030-87193-2_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Like other applications in computer vision, medical image segmentation and his email address have been most successfully addressed using deep learning models that rely on the convolution operation as their main building block. Convolutions enjoy important properties such as sparse interactions, weight sharing, and translation equivariance. These properties give convolutional neural networks (CNNs) a strong and useful inductive bias for vision tasks. However, the convolution operation also has important shortcomings: it performs a fixed operation on every test image regardless of the content and it cannot efficiently model long-range interactions. In this work we show that a network based on self-attention between neighboring patches and without any convolution operations can achieve better results. Given a 3D image block, our network divides it into n(3) 3D patches, where n = 3 or 5 and computes a 1D embedding for each patch. The network predicts the segmentation map for the center patch of the block based on the self-attention between these patch embeddings. We show that the proposed model can achieve higher segmentation accuracies than a state of the art CNN. For scenarios with very few labeled images, we propose methods for pre-training the network on large corpora of unlabeled images. Our experiments show that with pre-training the advantage of our proposed network over CNNs can be significant when labeled training data is small.
引用
收藏
页码:78 / 88
页数:11
相关论文
共 50 条
  • [1] TransDeepLab: Convolution-Free Transformer-Based DeepLab v3+for Medical Image Segmentation
    Azad, Reza
    Heidari, Moein
    Shariatnia, Moein
    Aghdam, Ehsan Khodapanah
    Karimijafarbigloo, Sanaz
    Adeli, Ehsan
    Merhof, Dorit
    PREDICTIVE INTELLIGENCE IN MEDICINE (PRIME 2022), 2022, 13564 : 91 - 102
  • [2] 3D Medical image segmentation using parallel transformers
    Yan, Qingsen
    Liu, Shengqiang
    Xu, Songhua
    Dong, Caixia
    Li, Zongfang
    Shi, Javen Qinfeng
    Zhang, Yanning
    Dai, Duwei
    PATTERN RECOGNITION, 2023, 138
  • [3] Transformers in medical image segmentation: a narrative review
    Khan, Rabeea Fatma
    Lee, Byoung-Dai
    Lee, Mu Sook
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2023, 13 (12) : 8747 - 8767
  • [4] Hybrid Transformer and Convolution for Medical Image Segmentation
    Wang, Fan
    Wang, Bo
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 156 - 159
  • [5] INTERACTIVE IMAGE SEGMENTATION WITH TRANSFORMERS
    Faizov, Boris
    Shakhuro, Vlad
    Konushin, Anton
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1171 - 1175
  • [6] TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation
    Zhang, Yundong
    Liu, Huiye
    Hu, Qiang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 14 - 24
  • [7] SGBTransNet: Bridging the semantic gap in medical image segmentation models using Transformers
    Zhou, Yunlai
    Wang, Bing
    Yang, Jie
    Yang, Ying
    Tian, Xuedong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 98
  • [8] LiteTrans: Reconstruct Transformer with Convolution for Medical Image Segmentation
    Xu, Shuying
    Quan, Hongyan
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 300 - 313
  • [9] TransFusion: Multi-view Divergent Fusion for Medical Image Segmentation with Transformers
    Liu, Di
    Gao, Yunhe
    Zhangli, Qilong
    Han, Ligong
    He, Xiaoxiao
    Xia, Zhaoyang
    Wen, Song
    Chang, Qi
    Yan, Zhennan
    Zhou, Mu
    Metaxas, Dimitris
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 485 - 495
  • [10] HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation
    Heidari, Moein
    Kazerouni, Amirhossein
    Soltany, Milad
    Azad, Reza
    Aghdam, Ehsan Khodapanah
    Cohen-Adad, Julien
    Merhof, Dorit
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 6191 - 6201