D-former: a U-shaped Dilated Transformer for 3D medical image segmentation

被引：0

作者：

Wu, Yixuan ^{[1
]}

Liao, Kuanlun ^{[2
]}

Chen, Jintai ^{[2
]}

Wang, Jinhong ^{[2
]}

Chen, Danny Z. ^{[3
]}

Gao, Honghao ^{[4
,5
]}

Wu, Jian ^{[6
,7
]}

机构：

[1] Zhejiang Univ, Sch Med, Hangzhou 310030, Peoples R China

[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310058, Peoples R China

[3] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA

[4] Gachon Univ, Coll Future Ind, Seongnam 13120, South Korea

[5] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China

[6] Zhejiang Univ, Affiliated Hosp 2, Sch Med, Hangzhou 310058, Peoples R China

[7] Zhejiang Univ, Sch Publ Hlth, Hangzhou 310058, Peoples R China

来源：

NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Medical image analysis; Segmentation; Transformer; Long-range dependency; Position encoding; NETWORKS; ATTENTION;

D O I：

10.1007/s00521-022-07859-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Computer-aided medical image segmentation has been applied widely in diagnosis and treatment to obtain clinically useful information of shapes and volumes of target organs and tissues. In the past several years, convolutional neural network (CNN)-based methods (e.g., U-Net) have dominated this area, but still suffered from inadequate long-range information capturing. Hence, recent work presented computer vision Transformer variants for medical image segmentation tasks and obtained promising performances. Such Transformers modeled long-range dependency by computing pair-wise patch relations. However, they incurred prohibitive computational costs, especially on 3D medical images (e.g., CT and MRI). In this paper, we propose a new method called Dilated Transformer, which conducts self-attention alternately in local and global scopes for pair-wise patch relations capturing. Inspired by dilated convolution kernels, we conduct the global self-attention in a dilated manner, enlarging receptive fields without increasing the patches involved and thus reducing computational costs. Based on this design of Dilated Transformer, we construct a U-shaped encoder-decoder hierarchical architecture called D-Former for 3D medical image segmentation. Experiments on the Synapse and ACDC datasets show that our D-Former model, trained from scratch, outperforms various competitive CNN-based or Transformer-based segmentation models at a low computational cost without time-consuming per-training process.

引用

页码：1931 / 1944

页数：14

共 50 条

[31] Hybrid transformer-CNN with boundary-awareness network for 3D medical image segmentation
Jianfei He
Canhui Xu
Applied Intelligence, 2023, 53 : 28542 - 28554
[32] Medical Image Segmentation by Improved 3D Adaptive Thresholding
Kim, Cheol-Hwan
Lee, Yun-Jung
2015 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC), 2015, : 263 - 265
[33] Transformer-Based Cascade U-shaped Network for Action Segmentation
Bao, Wenxia
Lin, An
Huang, Hua
Yang, Xianjun
Chen, Hemu
2024 3RD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MEDIA COMPUTING, ICIPMC 2024, 2024, : 157 - 161
[34] A Separate 3D Convolutional Neural Network Architecture for 3D Medical Image Semantic Segmentation
Dong, Shidu
Liu, Zhi
Wang, Huaqiu
Zhang, Yihao
Cui, Shaoguo
JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (08) : 1705 - 1716
[35] DEU-Net: Dual Encoder U-Net for 3D Medical Image Segmentation
Zhou, Yuxiang
Kang, Xin
Ren, Fuji
Nakagawa, Satoshi
Shan, Xiao
2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 2735 - 2741
[36] Diffusion Transformer U-Net for Medical Image Segmentation
Chowdary, G. Jignesh
Yin, Zhaozheng
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 622 - 631
[37] LUCF-Net: Lightweight U-Shaped Cascade Fusion Network for Medical Image Segmentation
She, Qingshan
Sun, Songkai
Ma, Yuliang
Li, Rihui
Zhang, Yingchun
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2025, 29 (03) : 2088 - 2099
[38] UT-MT: A Semi-Supervised Model of Fusion Transformer for 3D Medical Image Segmentation
Liu, Xianchang
Liu, Peishun
Wang, Jinyu
Wang, Qinshuo
Guo, Qing
Tang, Ruichun
2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 190 - 196
[39] Deformable M-reps for 3D medical image segmentation
Pizer, SM
Fletcher, PT
Joshi, S
Thall, A
Chen, JZ
Fridman, Y
Fritsch, DS
Gash, AG
Glotzer, JM
Jiroutek, MR
Lu, CL
Muller, KE
Tracton, G
Yushkevich, P
Chaney, EL
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2003, 55 (2-3) : 85 - 106
[40] 3D Medical Image Segmentation Based on Rough Set Theory
CHEN Shi-hao
Chinese Journal of Biomedical Engineering, 2007, (01) : 39 - 46

← 1 2 3 4 5 →