Hierarchical Decoder with Parallel Transformer and CNN for Medical Image Segmentation

被引:0
作者
Li, Shijie [1 ]
Gong, Yu [1 ]
Xiang, Qingyuan [1 ]
Li, Zheng [1 ,2 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] Sichuan Univ, Tianfu Engn Oriented Numercial Simulat & Software, Chengdu 610207, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XIV | 2025年 / 15044卷
基金
中国国家自然科学基金;
关键词
Medical image segmentation; Hierarchical decoder; Attention mechanism; PLUS PLUS;
D O I
10.1007/978-981-97-8496-7_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the success of Transformers, hybrid Transformer and CNN methods gain considerable popularity in medical image segmentation. These methods utilize a hybrid architecture that combines Transformers and CNNs to fuse global and local information, supplemented by a pyramid structure to facilitate multi-scale interaction. However, they encounter two primary limitations: (i) Transformer struggle to capture complete global information due to the sliding window nature of the convolutional operator, and (ii) the pyramid structure within single decoder fails to provide sufficient multi-scale interaction necessary for restoring detailed features at higher levels. In this paper, we introduce the Hierarchical Decoder with Parallel Transformer and CNN (HiPar), a novel architecture designed to address these limitations. Firstly, we present a parallel structure of Transformer and CNN to maximize the capture of both global and local features. Subsequently, we propose a hierarchical decoder to model multi-scale information and progressively restore spatial details. Additionally, we incorporate lightweight components to enhance the efficiency of feature representation. Extensive experiments demonstrate that our HiPar achieves state-of-the-art results on three popular medical image segmentation benchmarks: Synapse, ACDC and GlaS.
引用
收藏
页码:133 / 147
页数:15
相关论文
共 50 条
[41]   MedFCT: A Frequency Domain Joint CNN-Transformer Network for Semi-supervised Medical Image Segmentation [J].
Xie, Shiao ;
Huang, Huimin ;
Niu, Ziwei ;
Lin, Lanfen ;
Chen, Yen-Wei .
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, :1913-1918
[42]   HT-Net: hierarchical context-attention transformer network for medical ct image segmentation [J].
Mingjun Ma ;
Haiying Xia ;
Yumei Tan ;
Haisheng Li ;
Shuxiang Song .
Applied Intelligence, 2022, 52 :10692-10705
[43]   Hybrid CNN-Transformer model for medical image segmentation with pyramid convolution and multi-layer perceptron [J].
Liu, Xiaowei ;
Hu, Yikun ;
Chen, Jianguo .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
[44]   HT-Net: hierarchical context-attention transformer network for medical ct image segmentation [J].
Ma, Mingjun ;
Xia, Haiying ;
Tan, Yumei ;
Li, Haisheng ;
Song, Shuxiang .
APPLIED INTELLIGENCE, 2022, 52 (09) :10692-10705
[45]   Dense deep transformer for medical image segmentation: DDTraMIS [J].
Joshi, Abhilasha ;
Sharma, K. K. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (06) :18073-18089
[46]   Medical Image Segmentation Based on Transformer and HarDNet Structures [J].
Shen, Tongping ;
Xu, Huanqing .
IEEE ACCESS, 2023, 11 :16621-16630
[47]   Combining frequency transformer and CNNs for medical image segmentation [J].
Ismayl Labbihi ;
Othmane El Meslouhi ;
Mohamed Benaddy ;
Mustapha Kardouchi ;
Moulay Akhloufi .
Multimedia Tools and Applications, 2024, 83 :21197-21212
[48]   Advancements in medical image segmentation: A review of transformer models [J].
Kumar, S. S. .
COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123
[49]   LiteTrans: Reconstruct Transformer with Convolution for Medical Image Segmentation [J].
Xu, Shuying ;
Quan, Hongyan .
BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 :300-313
[50]   A parallelly contextual convolutional transformer for medical image segmentation [J].
Feng, Yuncong ;
Su, Jianyu ;
Zheng, Jian ;
Zheng, Yupeng ;
Zhang, Xiaoli .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 98