D-former: a U-shaped Dilated Transformer for 3D medical image segmentation

被引：0

作者：

Wu, Yixuan ^{[1
]}

Liao, Kuanlun ^{[2
]}

Chen, Jintai ^{[2
]}

Wang, Jinhong ^{[2
]}

Chen, Danny Z. ^{[3
]}

Gao, Honghao ^{[4
,5
]}

Wu, Jian ^{[6
,7
]}

机构：

[1] Zhejiang Univ, Sch Med, Hangzhou 310030, Peoples R China

[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310058, Peoples R China

[3] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA

[4] Gachon Univ, Coll Future Ind, Seongnam 13120, South Korea

[5] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China

[6] Zhejiang Univ, Affiliated Hosp 2, Sch Med, Hangzhou 310058, Peoples R China

[7] Zhejiang Univ, Sch Publ Hlth, Hangzhou 310058, Peoples R China

来源：

NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Medical image analysis; Segmentation; Transformer; Long-range dependency; Position encoding; NETWORKS; ATTENTION;

D O I：

10.1007/s00521-022-07859-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Computer-aided medical image segmentation has been applied widely in diagnosis and treatment to obtain clinically useful information of shapes and volumes of target organs and tissues. In the past several years, convolutional neural network (CNN)-based methods (e.g., U-Net) have dominated this area, but still suffered from inadequate long-range information capturing. Hence, recent work presented computer vision Transformer variants for medical image segmentation tasks and obtained promising performances. Such Transformers modeled long-range dependency by computing pair-wise patch relations. However, they incurred prohibitive computational costs, especially on 3D medical images (e.g., CT and MRI). In this paper, we propose a new method called Dilated Transformer, which conducts self-attention alternately in local and global scopes for pair-wise patch relations capturing. Inspired by dilated convolution kernels, we conduct the global self-attention in a dilated manner, enlarging receptive fields without increasing the patches involved and thus reducing computational costs. Based on this design of Dilated Transformer, we construct a U-shaped encoder-decoder hierarchical architecture called D-Former for 3D medical image segmentation. Experiments on the Synapse and ACDC datasets show that our D-Former model, trained from scratch, outperforms various competitive CNN-based or Transformer-based segmentation models at a low computational cost without time-consuming per-training process.

引用

页码：1931 / 1944

页数：14

共 50 条

[21] 3D U2-Net: A 3D Universal U-Net for Multi-domain Medical Image Segmentation
Huang, Chao
Han, Hu
Yao, Qingsong
Zhu, Shankuan
Zhou, S. Kevin
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 : 291 - 299
[22] A hybrid framework for 3D medical image segmentation
Chen, T
Metaxas, D
MEDICAL IMAGE ANALYSIS, 2005, 9 (06) : 547 - 565
[23] SPCTNet: A Series-Parallel CNN and Transformer Network for 3D Medical Image Segmentation
Yu, Bin
Zhou, Quan
Zhang, Xuming
ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 376 - 387
[24] RockFormer: A U-Shaped Transformer Network for Martian Rock Segmentation
Liu, Haiqiang
Yao, Meibao
Xiao, Xueming
Xiong, Yonggang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[25] TU-Net: U-shaped Structure Based on Transformers for Medical Image Segmentation
Zhao, Jiamei
Wu, Dikang
Wang, Zhifang
DATA SCIENCE (ICPCSEE 2022), PT I, 2022, 1628 : 376 - 386
[26] SHIP SEGMENTATION ON HIGH-RESOLUTION SAR IMAGE BY A 3D DILATED MULTISCALE U-NET
Li, Jichao
Guo, Chubing
Gou, Shuiping
Chen, Yuanbo
Wang, Miao
Chen, Jia-Wei
IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2575 - 2578
[27] Hybrid transformer-CNN with boundary-awareness network for 3D medical image segmentation
He, Jianfei
Xu, Canhui
APPLIED INTELLIGENCE, 2023, 53 (23) : 28542 - 28554
[28] TPAFNet: Transformer-Driven Pyramid Attention Fusion Network for 3D Medical Image Segmentation
Li, Zheng
Zhang, Jinhui
Wei, Siyi
Gao, Yueyang
Cao, Chengwei
Wu, Zhiwei
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (11) : 6803 - 6814
[29] Medical image segmentation using 3D MRI data
Voronin, V.
Marchuk, V.
Semenishchev, E.
Cen, Yigang
Agaian, S.
MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2017, 2017, 10221
[30] 3D Medical image segmentation using parallel transformers
Yan, Qingsen
Liu, Shengqiang
Xu, Songhua
Dong, Caixia
Li, Zongfang
Shi, Javen Qinfeng
Zhang, Yanning
Dai, Duwei
PATTERN RECOGNITION, 2023, 138

← 1 2 3 4 5 →