CONVFORMER: COMBINING CNN AND TRANSFORMER FOR MEDICAL IMAGE SEGMENTATION

被引:4
|
作者
Gu, Pengfei [1 ]
Zhang, Yejia [1 ]
Wang, Chaoli [1 ]
Chen, Danny Z. [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
关键词
D O I
10.1109/ISBI53787.2023.10230838
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural network (CNN) based methods have achieved great successes in medical image segmentation, but their capability to learn global representations is still limited due to using small effective receptive fields of convolution operations. Transformer based methods are capable of modelling long-range dependencies of information for capturing global representations, yet their ability to model local context is lacking. Integrating CNN and Transformer to learn both local and global representations while exploring multi-scale features is instrumental in further improving medical image segmentation. In this paper, we propose a hierarchical CNN and Transformer hybrid architecture, called ConvFormer, for medical image segmentation. ConvFormer is based on several simple yet effective designs. (1) A feed forward module of Deformable Transformer (DeTrans) is re-designed to introduce local information, called Enhanced DeTrans. (2) A residual-shaped hybrid stem based on a combination of convolutions and Enhanced DeTrans is developed to capture both local and global representations to enhance representation ability. (3) Our encoder utilizes the residual-shaped hybrid stem in a hierarchical manner to generate feature maps in different scales, and an additional Enhanced DeTrans encoder with residual connections is built to exploit multi-scale features with feature maps of different scales as input. Experiments on several datasets show that our ConvFormer, trained from scratch, outperforms various CNN- or Transformerbased architectures, achieving state-of-the-art performance.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation
    Xie, Yutong
    Zhang, Jianpeng
    Shen, Chunhua
    Xia, Yong
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 171 - 180
  • [22] FAFuse: A Four-Axis Fusion framework of CNN and Transformer for medical image segmentation
    Xu, Shoukun
    Xiao, Dehao
    Yuan, Baohua
    Liu, Yi
    Wang, Xueyuan
    Li, Ning
    Shi, Lin
    Chen, Jialu
    Zhang, Ju-Xiao
    Wang, Yanhao
    Cao, Jianfeng
    Shao, Yeqin
    Jiang, Mingjie
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 166
  • [23] Mutual Retinex: Combining Transformer and CNN for Image Enhancement
    Jiang, Kui
    Wang, Qiong
    An, Zhaoyi
    Wang, Zheng
    Zhang, Cong
    Lin, Chia-Wen
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (03): : 2240 - 2252
  • [24] MixSegNet: A Novel Crack Segmentation Network Combining CNN and Transformer
    Zhou, Yang
    Ali, Raza
    Mokhtar, Norrima
    Harun, Sulaiman Wadi
    Iwahashi, Masahiro
    IEEE ACCESS, 2024, 12 : 111535 - 111545
  • [25] LTMSegnet: Lightweight multi-scale medical image segmentation combining Transformer and MLP
    Huang, Xin
    Tang, Hongxiang
    Ding, Yan
    Li, Yuanyuan
    Zhu, Zhiqin
    Yang, Pan
    Computers in Biology and Medicine, 2024, 183
  • [26] LATrans-Unet: Improving CNN-Transformer with Location Adaptive for Medical Image Segmentation
    Lin, Qiqin
    Yao, Junfeng
    Hong, Qingqi
    Cao, Xianpeng
    Zhou, Rongzhou
    Xie, Weixing
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 223 - 234
  • [27] Semi-Supervised Medical Image Segmentation via Cross Teaching between CNN and Transformer
    Luo, Xiangde
    Hu, Minhao
    Song, Tao
    Wang, Guotai
    Zhang, Shaoting
    INTERNATIONAL CONFERENCE ON MEDICAL IMAGING WITH DEEP LEARNING, VOL 172, 2022, 172 : 820 - 833
  • [28] Hybrid 3D Medical Image Segmentation Using CNN and Frequency Transformer Fusion
    Labbihi, Ismayl
    Meslouhi, Othmane El
    Elassad, Zouhair Elamrani Abou
    Benaddy, Mohamed
    Kardouchi, Mustapha
    Akhloufi, Moulay
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,
  • [29] Aggregated Mutual Learning between CNN and Transformer for semi-supervised medical image segmentation
    Xu, Zhenghua
    Wang, Hening
    Yang, Runhe
    Yang, Yuchen
    Liu, Weipeng
    Lukasiewicz, Thomas
    KNOWLEDGE-BASED SYSTEMS, 2025, 311
  • [30] ScribFormer: Transformer Makes CNN Work Better for Scribble-Based Medical Image Segmentation
    Li, Zihan
    Zheng, Yuan
    Shan, Dandan
    Yang, Shuzhou
    Li, Qingde
    Wang, Beizhan
    Zhang, Yuanting
    Hong, Qingqi
    Shen, Dinggang
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (06) : 2254 - 2265