Sub-pixel multi-scale fusion network for medical image segmentation

被引:0
作者
Jing Li [1 ]
Qiaohong Chen [1 ]
Xian Fang [1 ]
机构
[1] School of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou
关键词
Adaptive gate; CNN; Medical image segmentation; Multi-scale fusion; Sub-pixel feature; Transformer;
D O I
10.1007/s11042-024-20338-0
中图分类号
学科分类号
摘要
CNNs and Transformers have significantly advanced the domain of medical image segmentation. The integration of their strengths facilitates rich feature extraction but also introduces the challenge of mixed multi-scale feature fusion. To overcome this issue, we propose an innovative deep medical image segmentation framework termed Sub-pixel Multi-scale Fusion Network (SMFNet), which effectively incorporates the sub-pixel multi-scale feature fusion results of CNN and Transformer into the architecture. In particular, our design consists of three effective and practical modules. Primarily, we utilize the Sub-pixel Convolutional Module to synchronize the extracted features at multiple scales to a consistent resolution. In the next place, we develop the Three-level Enhancement Module to learn features from adjacent layers and perform information exchange. Lastly, we leverage the Hierarchical Adaptive Gate to fuse information from other contextual levels through the Sub-pixel Convolutional Module. Extensive experiments on the Synapse, ACDC, and ISIC 2018 datasets demonstrate the effectiveness of the proposed SMFNet, and our method is superior to other competitive CNN-based or Transformer-based segmentation methods. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:89355 / 89373
页数:18
相关论文
共 41 条
  • [11] Badrinarayanan V., Kendall A., Cipolla R., Segnet: A deep convolutional encoder-decoder architecture for image segmentation, IEEE Trans Pattern Anal Mach Intell, 39, 12, pp. 2481-2495, (2017)
  • [12] Christ P.F., Elshaer M.E.A., Ettlinger F., Tatavarty S., Bickel M., Bilic P., Rempfler M., Armbruster M., Hofmann F., D'Anastasi M., Automatic liver and lesion segmentation in ct using cascaded fully convolutional neural networks and 3d conditional random fields, In: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 415-423, (2016)
  • [13] Zhou Z., Siddiquee M.M.R., Tajbakhsh N., Liang J., Unet++: Redesigning skip connections to exploit multiscale features in image segmentation, IEEE Trans Med Imaging, 39, 6, pp. 1856-1867, (2019)
  • [14] Zhang Z., Liu Q., Wang Y., Road extraction by deep residual u-net, IEEE Geosci Remote Sens Lett, 15, 5, pp. 749-753, (2018)
  • [15] Huang H., Lin L., Tong R., Hu H., Zhang Q., Iwamoto Y., Han X., Chen Y.W.J., Unet 3+: A full-scale connected unet for medical image segmentation, In: ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1055-1059, (2020)
  • [16] Xue Y., Xu T., Zhang H., Long L.R., Huang X., () Segan: Adversarial network with multi-scale l1 loss for medical image segmentation, Neuroinformatics, 16, pp. 383-392
  • [17] Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A.N.L.I., Attention is all you need, Adv Neural Inf Process Syst, 30, (2017)
  • [18] Liu Z., Lin Y., Cao Y., Hu H., Wei Y., Zhang Z., Lin S., Guo B., Swin transformer: Hierarchical vision transformer using shifted windows, In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 10012-10022, (2021)
  • [19] Cao H., Wang Y., Chen J., Jiang D., Zhang X., Tian Q., Wang M., Swin-unet: Unet-like pure transformer for medical image segmentation. In, : European Conference on Computer Vision, pp. 205-218, (2022)
  • [20] Tran T.T., Vu D.T., Nguyen T.H., Pham V.T., A cnn-transformer-based approach for medical image segmentation, In: 2023 International Conference on System Science and Engineering (ICSSE), pp. 22-27, (2023)