D-former: a U-shaped Dilated Transformer for 3D medical image segmentation

被引:0
作者
Wu, Yixuan [1 ]
Liao, Kuanlun [2 ]
Chen, Jintai [2 ]
Wang, Jinhong [2 ]
Chen, Danny Z. [3 ]
Gao, Honghao [4 ,5 ]
Wu, Jian [6 ,7 ]
机构
[1] Zhejiang Univ, Sch Med, Hangzhou 310030, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310058, Peoples R China
[3] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[4] Gachon Univ, Coll Future Ind, Seongnam 13120, South Korea
[5] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[6] Zhejiang Univ, Affiliated Hosp 2, Sch Med, Hangzhou 310058, Peoples R China
[7] Zhejiang Univ, Sch Publ Hlth, Hangzhou 310058, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image analysis; Segmentation; Transformer; Long-range dependency; Position encoding; NETWORKS; ATTENTION;
D O I
10.1007/s00521-022-07859-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computer-aided medical image segmentation has been applied widely in diagnosis and treatment to obtain clinically useful information of shapes and volumes of target organs and tissues. In the past several years, convolutional neural network (CNN)-based methods (e.g., U-Net) have dominated this area, but still suffered from inadequate long-range information capturing. Hence, recent work presented computer vision Transformer variants for medical image segmentation tasks and obtained promising performances. Such Transformers modeled long-range dependency by computing pair-wise patch relations. However, they incurred prohibitive computational costs, especially on 3D medical images (e.g., CT and MRI). In this paper, we propose a new method called Dilated Transformer, which conducts self-attention alternately in local and global scopes for pair-wise patch relations capturing. Inspired by dilated convolution kernels, we conduct the global self-attention in a dilated manner, enlarging receptive fields without increasing the patches involved and thus reducing computational costs. Based on this design of Dilated Transformer, we construct a U-shaped encoder-decoder hierarchical architecture called D-Former for 3D medical image segmentation. Experiments on the Synapse and ACDC datasets show that our D-Former model, trained from scratch, outperforms various competitive CNN-based or Transformer-based segmentation models at a low computational cost without time-consuming per-training process.
引用
收藏
页码:1931 / 1944
页数:14
相关论文
共 50 条
  • [21] 3D U2-Net: A 3D Universal U-Net for Multi-domain Medical Image Segmentation
    Huang, Chao
    Han, Hu
    Yao, Qingsong
    Zhu, Shankuan
    Zhou, S. Kevin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 : 291 - 299
  • [22] A hybrid framework for 3D medical image segmentation
    Chen, T
    Metaxas, D
    MEDICAL IMAGE ANALYSIS, 2005, 9 (06) : 547 - 565
  • [23] SPCTNet: A Series-Parallel CNN and Transformer Network for 3D Medical Image Segmentation
    Yu, Bin
    Zhou, Quan
    Zhang, Xuming
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 376 - 387
  • [24] RockFormer: A U-Shaped Transformer Network for Martian Rock Segmentation
    Liu, Haiqiang
    Yao, Meibao
    Xiao, Xueming
    Xiong, Yonggang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [25] TU-Net: U-shaped Structure Based on Transformers for Medical Image Segmentation
    Zhao, Jiamei
    Wu, Dikang
    Wang, Zhifang
    DATA SCIENCE (ICPCSEE 2022), PT I, 2022, 1628 : 376 - 386
  • [26] SHIP SEGMENTATION ON HIGH-RESOLUTION SAR IMAGE BY A 3D DILATED MULTISCALE U-NET
    Li, Jichao
    Guo, Chubing
    Gou, Shuiping
    Chen, Yuanbo
    Wang, Miao
    Chen, Jia-Wei
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2575 - 2578
  • [27] Hybrid transformer-CNN with boundary-awareness network for 3D medical image segmentation
    He, Jianfei
    Xu, Canhui
    APPLIED INTELLIGENCE, 2023, 53 (23) : 28542 - 28554
  • [28] TPAFNet: Transformer-Driven Pyramid Attention Fusion Network for 3D Medical Image Segmentation
    Li, Zheng
    Zhang, Jinhui
    Wei, Siyi
    Gao, Yueyang
    Cao, Chengwei
    Wu, Zhiwei
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (11) : 6803 - 6814
  • [29] Medical image segmentation using 3D MRI data
    Voronin, V.
    Marchuk, V.
    Semenishchev, E.
    Cen, Yigang
    Agaian, S.
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2017, 2017, 10221
  • [30] 3D Medical image segmentation using parallel transformers
    Yan, Qingsen
    Liu, Shengqiang
    Xu, Songhua
    Dong, Caixia
    Li, Zongfang
    Shi, Javen Qinfeng
    Zhang, Yanning
    Dai, Duwei
    PATTERN RECOGNITION, 2023, 138