CONVFORMER: COMBINING CNN AND TRANSFORMER FOR MEDICAL IMAGE SEGMENTATION

被引:4
|
作者
Gu, Pengfei [1 ]
Zhang, Yejia [1 ]
Wang, Chaoli [1 ]
Chen, Danny Z. [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
关键词
D O I
10.1109/ISBI53787.2023.10230838
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural network (CNN) based methods have achieved great successes in medical image segmentation, but their capability to learn global representations is still limited due to using small effective receptive fields of convolution operations. Transformer based methods are capable of modelling long-range dependencies of information for capturing global representations, yet their ability to model local context is lacking. Integrating CNN and Transformer to learn both local and global representations while exploring multi-scale features is instrumental in further improving medical image segmentation. In this paper, we propose a hierarchical CNN and Transformer hybrid architecture, called ConvFormer, for medical image segmentation. ConvFormer is based on several simple yet effective designs. (1) A feed forward module of Deformable Transformer (DeTrans) is re-designed to introduce local information, called Enhanced DeTrans. (2) A residual-shaped hybrid stem based on a combination of convolutions and Enhanced DeTrans is developed to capture both local and global representations to enhance representation ability. (3) Our encoder utilizes the residual-shaped hybrid stem in a hierarchical manner to generate feature maps in different scales, and an additional Enhanced DeTrans encoder with residual connections is built to exploit multi-scale features with feature maps of different scales as input. Experiments on several datasets show that our ConvFormer, trained from scratch, outperforms various CNN- or Transformerbased architectures, achieving state-of-the-art performance.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] The Fully Convolutional Transformer for Medical Image Segmentation
    Tragakis, Athanasios
    Kaul, Chaitanya
    Murray-Smith, Roderick
    Husmeier, Dirk
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3649 - 3658
  • [42] Automatic Medical Image Segmentation with Vision Transformer
    Zhang, Jie
    Li, Fan
    Zhang, Xin
    Wang, Huaijun
    Hei, Xinhong
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [43] Coformer: Collaborative Transformer for Medical Image Segmentation
    Gao, Yufei
    Zhang, Shichao
    Zhang, Dandan
    Shi, Yucheng
    Zhao, Guohua
    Shi, Lei
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 : 240 - 250
  • [44] ENHANCING HYBRID CNN-TRANSFORMER VIA FREQUENCY-BASED BRIDGING FOR MEDICAL IMAGE SEGMENTATION
    Zeng Xinyi
    Tang Cheng
    Zeng Pinxian
    Cui Jiaqi
    Yan Binyu
    Wang Peng
    Wang Yan
    IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
  • [45] SPCTNet: A Series-Parallel CNN and Transformer Network for 3D Medical Image Segmentation
    Yu, Bin
    Zhou, Quan
    Zhang, Xuming
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 376 - 387
  • [46] Dual-path network combining CNN and transformer for pavement crack segmentation
    Wang, Jin
    Zeng, Zhigao
    Sharma, Pradip Kumar
    Alfarraj, Osama
    Tolba, Amr
    Zhang, Jianming
    Wang, Lei
    AUTOMATION IN CONSTRUCTION, 2024, 158
  • [47] Automatically Designing CNN Architectures for Medical Image Segmentation
    Mortazi, Aliasghar
    Bagci, Ulas
    MACHINE LEARNING IN MEDICAL IMAGING: 9TH INTERNATIONAL WORKSHOP, MLMI 2018, 2018, 11046 : 98 - 106
  • [48] DSC-Net: A Novel Interactive Two-Stream Network by Combining Transformer and CNN for Ultrasound Image Segmentation
    Hu, Kai
    Zhu, Yadong
    Zhou, Tianxin
    Zhang, Yuan
    Cao, Chunhong
    Xiao, Fen
    Gao, Xieping
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [49] CNN-Transformer Hybrid Architecture for Underwater Sonar Image Segmentation
    Lei, Juan
    Wang, Huigang
    Lei, Zelin
    Li, Jiayuan
    Rong, Shaowei
    REMOTE SENSING, 2025, 17 (04)
  • [50] Hybrid transformer-CNN with boundary-awareness network for 3D medical image segmentation
    He, Jianfei
    Xu, Canhui
    APPLIED INTELLIGENCE, 2023, 53 (23) : 28542 - 28554