Convolution-Free Medical Image Segmentation Using Transformers

被引:74
作者
Karimi, Davood [1 ]
Vasylechko, Serge Didenko
Gholipour, Ali
机构
[1] Boston Childrens Hosp, Dept Radiol, Computat Radiol Lab CRL, Boston, MA 02115 USA
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I | 2021年 / 12901卷
基金
美国国家卫生研究院;
关键词
Segmentation; Deep learning; Transformers; Attention;
D O I
10.1007/978-3-030-87193-2_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Like other applications in computer vision, medical image segmentation and his email address have been most successfully addressed using deep learning models that rely on the convolution operation as their main building block. Convolutions enjoy important properties such as sparse interactions, weight sharing, and translation equivariance. These properties give convolutional neural networks (CNNs) a strong and useful inductive bias for vision tasks. However, the convolution operation also has important shortcomings: it performs a fixed operation on every test image regardless of the content and it cannot efficiently model long-range interactions. In this work we show that a network based on self-attention between neighboring patches and without any convolution operations can achieve better results. Given a 3D image block, our network divides it into n(3) 3D patches, where n = 3 or 5 and computes a 1D embedding for each patch. The network predicts the segmentation map for the center patch of the block based on the self-attention between these patch embeddings. We show that the proposed model can achieve higher segmentation accuracies than a state of the art CNN. For scenarios with very few labeled images, we propose methods for pre-training the network on large corpora of unlabeled images. Our experiments show that with pre-training the advantage of our proposed network over CNNs can be significant when labeled training data is small.
引用
收藏
页码:78 / 88
页数:11
相关论文
共 50 条
  • [31] Boosting medical image segmentation via conditional-synergistic convolution and lesion decoupling
    Yang, Huakun
    Chen, Qian
    Fu, Keren
    Zhu, Lei
    Jin, Lujia
    Qiu, Bensheng
    Ren, Qiushi
    Du, Hongwei
    Lu, Yanye
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2022, 101
  • [32] Advances in medical image analysis with vision Transformers: A review
    Azad, Reza
    Kazerouni, Amirhossein
    Heidari, Moein
    Aghdam, Ehsan Khodapanah
    Molaei, Amirali
    Jia, Yiwei
    Jose, Abin
    Roy, Rijo
    Merhof, Dorit
    MEDICAL IMAGE ANALYSIS, 2024, 91
  • [33] A new method for segmentation of medical image using convolutional neural network
    Luo, Fugui
    Qin, Yunchu
    Li, Mingzhen
    Song, Qian
    JOURNAL OF OPTICS-INDIA, 2024, 53 (04): : 3411 - 3420
  • [34] Medical Ultrasound Image Segmentation Using U-Net Architecture
    Shereena, V. B.
    Raju, G.
    ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I, 2022, 1613 : 361 - 372
  • [35] Medical Image Segmentation Review: The Success of U-Net
    Azad, Reza
    Aghdam, Ehsan Khodapanah
    Rauland, Amelie
    Jia, Yiwei
    Avval, Atlas Haddadi
    Bozorgpour, Afshin
    Karimijafarbigloo, Sanaz
    Cohen, Joseph Paul
    Adeli, Ehsan
    Merhof, Dorit
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10076 - 10095
  • [36] Medical Image Segmentation on Attention-based Gaussian Blurring
    Mao, Yue-Tian
    Liu, Qing-Song
    Fang, Yin-Feng
    Jiang, Guo-Zhang
    Jiang, Du
    SENSORS AND MATERIALS, 2024, 36 (11) : 4683 - 4694
  • [37] Advantages of transformer and its application for medical image segmentation: a survey
    Pu, Qiumei
    Xi, Zuoxin
    Yin, Shuai
    Zhao, Zhe
    Zhao, Lina
    BIOMEDICAL ENGINEERING ONLINE, 2024, 23 (01)
  • [38] TU-Net: U-shaped Structure Based on Transformers for Medical Image Segmentation
    Zhao, Jiamei
    Wu, Dikang
    Wang, Zhifang
    DATA SCIENCE (ICPCSEE 2022), PT I, 2022, 1628 : 376 - 386
  • [39] ConvFormer: Plug-and-Play CNN-Style Transformers for Improving Medical Image Segmentation
    Lin, Xian
    Yan, Zengqiang
    Deng, Xianbo
    Zheng, Chuansheng
    Yu, Li
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 642 - 651
  • [40] Medical Image Segmentation and Localization using Deformable Templates
    Spiller, J. M.
    Marwala, T.
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING 2006, VOL 14, PTS 1-6, 2007, 14 : 2292 - +