D-former: a U-shaped Dilated Transformer for 3D medical image segmentation

被引:0
作者
Wu, Yixuan [1 ]
Liao, Kuanlun [2 ]
Chen, Jintai [2 ]
Wang, Jinhong [2 ]
Chen, Danny Z. [3 ]
Gao, Honghao [4 ,5 ]
Wu, Jian [6 ,7 ]
机构
[1] Zhejiang Univ, Sch Med, Hangzhou 310030, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310058, Peoples R China
[3] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[4] Gachon Univ, Coll Future Ind, Seongnam 13120, South Korea
[5] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[6] Zhejiang Univ, Affiliated Hosp 2, Sch Med, Hangzhou 310058, Peoples R China
[7] Zhejiang Univ, Sch Publ Hlth, Hangzhou 310058, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image analysis; Segmentation; Transformer; Long-range dependency; Position encoding; NETWORKS; ATTENTION;
D O I
10.1007/s00521-022-07859-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computer-aided medical image segmentation has been applied widely in diagnosis and treatment to obtain clinically useful information of shapes and volumes of target organs and tissues. In the past several years, convolutional neural network (CNN)-based methods (e.g., U-Net) have dominated this area, but still suffered from inadequate long-range information capturing. Hence, recent work presented computer vision Transformer variants for medical image segmentation tasks and obtained promising performances. Such Transformers modeled long-range dependency by computing pair-wise patch relations. However, they incurred prohibitive computational costs, especially on 3D medical images (e.g., CT and MRI). In this paper, we propose a new method called Dilated Transformer, which conducts self-attention alternately in local and global scopes for pair-wise patch relations capturing. Inspired by dilated convolution kernels, we conduct the global self-attention in a dilated manner, enlarging receptive fields without increasing the patches involved and thus reducing computational costs. Based on this design of Dilated Transformer, we construct a U-shaped encoder-decoder hierarchical architecture called D-Former for 3D medical image segmentation. Experiments on the Synapse and ACDC datasets show that our D-Former model, trained from scratch, outperforms various competitive CNN-based or Transformer-based segmentation models at a low computational cost without time-consuming per-training process.
引用
收藏
页码:1931 / 1944
页数:14
相关论文
共 50 条
  • [31] Hybrid transformer-CNN with boundary-awareness network for 3D medical image segmentation
    Jianfei He
    Canhui Xu
    Applied Intelligence, 2023, 53 : 28542 - 28554
  • [32] Medical Image Segmentation by Improved 3D Adaptive Thresholding
    Kim, Cheol-Hwan
    Lee, Yun-Jung
    2015 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC), 2015, : 263 - 265
  • [33] Transformer-Based Cascade U-shaped Network for Action Segmentation
    Bao, Wenxia
    Lin, An
    Huang, Hua
    Yang, Xianjun
    Chen, Hemu
    2024 3RD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MEDIA COMPUTING, ICIPMC 2024, 2024, : 157 - 161
  • [34] A Separate 3D Convolutional Neural Network Architecture for 3D Medical Image Semantic Segmentation
    Dong, Shidu
    Liu, Zhi
    Wang, Huaqiu
    Zhang, Yihao
    Cui, Shaoguo
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (08) : 1705 - 1716
  • [35] DEU-Net: Dual Encoder U-Net for 3D Medical Image Segmentation
    Zhou, Yuxiang
    Kang, Xin
    Ren, Fuji
    Nakagawa, Satoshi
    Shan, Xiao
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 2735 - 2741
  • [36] Diffusion Transformer U-Net for Medical Image Segmentation
    Chowdary, G. Jignesh
    Yin, Zhaozheng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 622 - 631
  • [37] LUCF-Net: Lightweight U-Shaped Cascade Fusion Network for Medical Image Segmentation
    She, Qingshan
    Sun, Songkai
    Ma, Yuliang
    Li, Rihui
    Zhang, Yingchun
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2025, 29 (03) : 2088 - 2099
  • [38] UT-MT: A Semi-Supervised Model of Fusion Transformer for 3D Medical Image Segmentation
    Liu, Xianchang
    Liu, Peishun
    Wang, Jinyu
    Wang, Qinshuo
    Guo, Qing
    Tang, Ruichun
    2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 190 - 196
  • [39] Deformable M-reps for 3D medical image segmentation
    Pizer, SM
    Fletcher, PT
    Joshi, S
    Thall, A
    Chen, JZ
    Fridman, Y
    Fritsch, DS
    Gash, AG
    Glotzer, JM
    Jiroutek, MR
    Lu, CL
    Muller, KE
    Tracton, G
    Yushkevich, P
    Chaney, EL
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2003, 55 (2-3) : 85 - 106
  • [40] 3D Medical Image Segmentation Based on Rough Set Theory
    CHEN Shi-hao
    Chinese Journal of Biomedical Engineering, 2007, (01) : 39 - 46