CONVFORMER: COMBINING CNN AND TRANSFORMER FOR MEDICAL IMAGE SEGMENTATION

被引:4
|
作者
Gu, Pengfei [1 ]
Zhang, Yejia [1 ]
Wang, Chaoli [1 ]
Chen, Danny Z. [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
关键词
D O I
10.1109/ISBI53787.2023.10230838
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural network (CNN) based methods have achieved great successes in medical image segmentation, but their capability to learn global representations is still limited due to using small effective receptive fields of convolution operations. Transformer based methods are capable of modelling long-range dependencies of information for capturing global representations, yet their ability to model local context is lacking. Integrating CNN and Transformer to learn both local and global representations while exploring multi-scale features is instrumental in further improving medical image segmentation. In this paper, we propose a hierarchical CNN and Transformer hybrid architecture, called ConvFormer, for medical image segmentation. ConvFormer is based on several simple yet effective designs. (1) A feed forward module of Deformable Transformer (DeTrans) is re-designed to introduce local information, called Enhanced DeTrans. (2) A residual-shaped hybrid stem based on a combination of convolutions and Enhanced DeTrans is developed to capture both local and global representations to enhance representation ability. (3) Our encoder utilizes the residual-shaped hybrid stem in a hierarchical manner to generate feature maps in different scales, and an additional Enhanced DeTrans encoder with residual connections is built to exploit multi-scale features with feature maps of different scales as input. Experiments on several datasets show that our ConvFormer, trained from scratch, outperforms various CNN- or Transformerbased architectures, achieving state-of-the-art performance.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] UCTNet: Uncertainty-guided CNN-Transformer hybrid networks for medical image segmentation
    Guo, Xiayu
    Lin, Xian
    Yang, Xin
    Yu, Li
    Cheng, Kwang-Ting
    Yan, Zengqiang
    PATTERN RECOGNITION, 2024, 152
  • [32] RAMIS: Increasing robustness and accuracy in medical image segmentation with hybrid CNN-transformer synergy
    Gu, Jia
    Tian, Fangzheng
    Oh, Il-Seok
    NEUROCOMPUTING, 2025, 618
  • [33] EFFICIENT BINARY CNN FOR MEDICAL IMAGE SEGMENTATION
    Brahma, Kaustav
    Kumar, Viksit
    Samir, Anthony E.
    Chandrakasan, Anantha P.
    Eldar, Yonina C.
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 817 - 821
  • [34] LcmUNet: A Lightweight Network Combining CNN and MLP for Real-Time Medical Image Segmentation
    Zhang, Shuai
    Niu, Yanmin
    BIOENGINEERING-BASEL, 2023, 10 (06):
  • [35] HybridCTrm: Bridging CNN and Transformer for Multimodal Brain Image Segmentation
    Sun, Qixuan
    Fang, Nianhua
    Liu, Zhuo
    Zhao, Liang
    Wen, Youpeng
    Lin, Hongxiang
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [36] Segmentation Method of Magnetoelectric Brain Image Based on the Transformer and the CNN
    Liu, Xiaoli
    Cheng, Xiaorong
    INFORMATION, 2022, 13 (10)
  • [37] CNN and Transformer Fusion for Remote Sensing Image Semantic Segmentation
    Chen, Xin
    Li, Dongfen
    Liu, Mingzhe
    Jia, Jiaru
    REMOTE SENSING, 2023, 15 (18)
  • [38] Medical Image Segmentation Using Transformer Networks
    Karimi, Davood
    Dou, Haoran
    Gholipour, Ali
    IEEE ACCESS, 2022, 10 : 29322 - 29332
  • [39] Hybrid Transformer and Convolution for Medical Image Segmentation
    Wang, Fan
    Wang, Bo
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 156 - 159
  • [40] ATFormer: Advanced transformer for medical image segmentation
    Chen, Yong
    Lu, Xuesong
    Xie, Oinlan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85