Hierarchical Decoder with Parallel Transformer and CNN for Medical Image Segmentation

被引:0
作者
Li, Shijie [1 ]
Gong, Yu [1 ]
Xiang, Qingyuan [1 ]
Li, Zheng [1 ,2 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] Sichuan Univ, Tianfu Engn Oriented Numercial Simulat & Software, Chengdu 610207, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XIV | 2025年 / 15044卷
基金
中国国家自然科学基金;
关键词
Medical image segmentation; Hierarchical decoder; Attention mechanism; PLUS PLUS;
D O I
10.1007/978-981-97-8496-7_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the success of Transformers, hybrid Transformer and CNN methods gain considerable popularity in medical image segmentation. These methods utilize a hybrid architecture that combines Transformers and CNNs to fuse global and local information, supplemented by a pyramid structure to facilitate multi-scale interaction. However, they encounter two primary limitations: (i) Transformer struggle to capture complete global information due to the sliding window nature of the convolutional operator, and (ii) the pyramid structure within single decoder fails to provide sufficient multi-scale interaction necessary for restoring detailed features at higher levels. In this paper, we introduce the Hierarchical Decoder with Parallel Transformer and CNN (HiPar), a novel architecture designed to address these limitations. Firstly, we present a parallel structure of Transformer and CNN to maximize the capture of both global and local features. Subsequently, we propose a hierarchical decoder to model multi-scale information and progressively restore spatial details. Additionally, we incorporate lightweight components to enhance the efficiency of feature representation. Extensive experiments demonstrate that our HiPar achieves state-of-the-art results on three popular medical image segmentation benchmarks: Synapse, ACDC and GlaS.
引用
收藏
页码:133 / 147
页数:15
相关论文
共 50 条
[31]   Hybrid Transformer and Convolution for Medical Image Segmentation [J].
Wang, Fan ;
Wang, Bo .
2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, :156-159
[32]   ATFormer: Advanced transformer for medical image segmentation [J].
Chen, Yong ;
Lu, Xuesong ;
Xie, Oinlan .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
[33]   Coformer: Collaborative Transformer for Medical Image Segmentation [J].
Gao, Yufei ;
Zhang, Shichao ;
Zhang, Dandan ;
Shi, Yucheng ;
Zhao, Guohua ;
Shi, Lei .
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 :240-250
[34]   The Fully Convolutional Transformer for Medical Image Segmentation [J].
Tragakis, Athanasios ;
Kaul, Chaitanya ;
Murray-Smith, Roderick ;
Husmeier, Dirk .
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, :3649-3658
[35]   Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation [J].
Rahman, Md Mostafijur ;
Marculescu, Radu .
MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 :1526-1544
[36]   FDR-TransUNet: A novel encoder-decoder architecture with vision transformer for improved medical image segmentation [J].
Zhang, Chaoyang ;
Sun, Shibao ;
Hu, Wenmao ;
Zhao, Pengcheng .
COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169
[37]   FFUNet: A novel feature fusion makes strong decoder for medical image segmentation [J].
Xie, Junsong ;
Zhu, Renju ;
Wu, Zezhi ;
Ouyang, Jinling .
IET SIGNAL PROCESSING, 2022, 16 (05) :501-514
[38]   Rethinking the Encoder–decoder Structure in Medical Image Segmentation from Releasing Decoder Structure [J].
Jiajia Ni ;
Wei Mu ;
An Pan ;
Zhengming Chen .
Journal of Bionic Engineering, 2024, 21 :1511-1521
[39]   CTRANSNET: CONVOLUTIONAL NEURAL NETWORK COMBINED WITH TRANSFORMER FOR MEDICAL IMAGE SEGMENTATION [J].
Zhang, Zhixin ;
Jiang, Shuhao ;
Pan, Xuhua .
COMPUTING AND INFORMATICS, 2023, 42 (02) :392-410
[40]   FCT-Net: Efficient Bridge Fusion Incorporating CNN-Transformer Network for Medical Image Segmentation [J].
Zhou, Bowen ;
Dong, Xingbo ;
Zhao, Xiaowei ;
Li, Chenglong ;
Jin, Zhe ;
Wang, Huabin .
IEEE TRANSACTIONS ON RADIATION AND PLASMA MEDICAL SCIENCES, 2025, 9 (06) :762-775