SegFormer3D: an Efficient Transformer for 3D Medical Image Segmentation

被引:5
|
作者
Perera, Shehan [1 ]
Navard, Pouyan [1 ]
Yilmaz, Alper [1 ]
机构
[1] Ohio State Univ, Photogrammetr Comp Vis Lab, Columbus, OH 43210 USA
关键词
D O I
10.1109/CVPRW63382.2024.00503
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The adoption of Vision Transformers (ViTs) based architectures represents a significant advancement in 3D Medical Image (MI) segmentation, surpassing traditional Convolutional Neural Network (CNN) models by enhancing global contextual understanding. While this paradigm shift has significantly enhanced 3D segmentation performance, state-of-the-art architectures require extremely large and complex architectures with large scale computing resources for training and deployment. Furthermore, in the context of limited datasets, often encountered in medical imaging, larger models can present hurdles in both model generalization and convergence. In response to these challenges and to demonstrate that lightweight models are a valuable area of research in 3D medical imaging, we present SegFormer3D, a hierarchical Transformer that calculates attention across multiscale volumetric features. Additionally, SegFormer3D avoids complex decoders and uses an all-MLP decoder to aggregate local and global attention features to produce highly accurate segmentation masks. The proposed memory efficient Transformer preserves the performance characteristics of a significantly larger model in a compact design. SegFormer3D democratizes deep learning for 3D medical image segmentation by offering a model with 33x less parameters and a 13x reduction in GFLOPS compared to the current state-of-the-art (SOTA). We benchmark SegFormer3D against the current SOTA models on three widely used datasets Synapse, BRaTs, and ACDC, achieving competitive results. Code: https://github.com/OSUPCVLab/SegFormer3D.git
引用
收藏
页码:4981 / 4988
页数:8
相关论文
共 50 条
  • [1] EFFICIENT 3D TRANSFORMER WITH CLUSTER-BASED DOMAIN-ADVERSARIAL LEARNING FOR 3D MEDICAL IMAGE SEGMENTATION
    Zhang, Haoran
    Chen, Hao
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [2] FATUnetr:fully attention Transformer for 3D medical image segmentation
    Li, QingFeng
    Tong, Jigang
    Yang, Sen
    Du, Shengzhi
    2024 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, ICMA 2024, 2024, : 1415 - 1419
  • [3] nnFormer: Volumetric Medical Image Segmentation via a 3D Transformer
    Zhou, Hong-Yu
    Guo, Jiansen
    Zhang, Yinghao
    Han, Xiaoguang
    Yu, Lequan
    Wang, Liansheng
    Yu, Yizhou
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4036 - 4045
  • [4] Efficient combined algorithm of Transformer and U-Net for 3D medical image segmentation
    Zhang, Mingyan
    Wang, Aixia
    Yang, Gang
    Li, Jingjiao
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4377 - 4382
  • [5] 3D Medical Axial Transformer: A Lightweight Transformer Model for 3D Brain Tumor Segmentation
    Liu, Cheng
    Kiryu, Hisanori
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 799 - 813
  • [6] A 3D Medical Image Segmentation Framework Fusing Convolution and Transformer Features
    Zhu, Fazhan
    Lv, Jiaxing
    Lu, Kun
    Wang, Wenyan
    Cong, Hongshou
    Zhang, Jun
    Chen, Peng
    Zhao, Yuan
    Wu, Ziheng
    INTELLIGENT COMPUTING THEORIES AND APPLICATION (ICIC 2022), PT I, 2022, 13393 : 772 - 786
  • [7] MCRformer: Morphological constraint reticular transformer for 3D medical image segmentation
    Li, Jun
    Chen, Nan
    Zhou, Han
    Lai, Taotao
    Dong, Heng
    Feng, Chunhui
    Chen, Riqing
    Yang, Changcai
    Cai, Fanggang
    Wei, Lifang
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 232
  • [8] DAST: Differentiable Architecture Search with Transformer for 3D Medical Image Segmentation
    Yang, Dong
    Xu, Ziyue
    He, Yufan
    Nath, Vishwesh
    Li, Wenqi
    Myronenko, Andriy
    Hatamizadeh, Ali
    Zhao, Can
    Roth, Holger R.
    Xu, Daguang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 747 - 756
  • [9] A Transformer-Based Network for Anisotropic 3D Medical Image Segmentation
    Guo, Danfeng
    Terzopoulos, Demetri
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8857 - 8861
  • [10] CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation
    Xie, Yutong
    Zhang, Jianpeng
    Shen, Chunhua
    Xia, Yong
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 171 - 180