MP-FocalUNet: Multiscale parallel focal self-attention U-Net for medical image segmentation

被引:0
作者
Wang, Chuan [1 ]
Jiang, Mingfeng [1 ]
Li, Yang [1 ]
Wei, Bo [1 ]
Li, Yongming [2 ]
Wang, Pin [2 ]
Yang, Guang [3 ,4 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Chongqing Univ, Coll Commun Engn, Chongqing, Peoples R China
[3] Royal Brompton Hosp, Cardiovasc Res Ctr, London SW3 6NP, England
[4] Imperial Coll London, Natl Heart & Lung Inst, London SW7 2AZ, England
关键词
Focal self-attention mechanism; Medical image segmentation; Multiscale; Deep learning;
D O I
10.1016/j.cmpb.2024.108562
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective: Medical image segmentation has been significantly improved in recent years with the progress of Convolutional Neural Networks (CNNs). Due to the inherent limitations of convolutional operations, CNNs perform poorly in learning the correlation information between global and long-range features. To solve this problem, some existing solutions rely on building deep encoders and down-sampling operations, but such methods are prone to produce redundant network structures and lose local details. Therefore, medical image segmentation tasks require better solutions to improve the modeling of the global context, while maintaining a strong grasp of the low-level details. Methods: We propose a novel multiscale parallel branch architecture (MP-FocalUNet). On the encoder side of MPFocalUNet, dual-scale sub-networks are used to extract information of different scales. A cross-scale "Feature Fusion" (FF) module was proposed to explore the potential of dual branch networks and fully utilize feature representations at different scales. On the decoder side, combined with the traditional CNN in parallel, focal selfattention is used for long-distance modeling, which can effectively capture the global dependencies and underlying spatial details in a shallower way. Results: Our proposed method is evaluated on both abdominal organ segmentation datasets and automatic cardiac diagnosis challenge datasets. Our method consistently outperforms several state-of-the-art segmentation methods with an average Dice score of 82.45% (2.68% higher than HC-Net) and 91.44% (0.35% higher than HC-Net) on the abdominal organ datasets and the automatic cardiac diagnosis challenge datasets, respectively. Conclusions: Our MP-FocalUNet is a novel encoder-decoder based multiscale parallel branch Transformer network, which solves the problem of insufficient long-distance modeling in CNNs and fuses image information at different scales. Extensive experiments on abdominal and cardiac medical image segmentation tasks show that our MP-FocalUNet outperforms other state-of-the-art methods. In the future, our work will focus on designing more lightweight Transformer-based models and better learning pixel-level intrinsic structural features generated by patch division in visual Transformers.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] MCI Net: Mamba- Convolutional lightweight self-attention medical image segmentation network
    Zhang, Yelin
    Wang, Guanglei
    Ma, Pengchong
    Li, Yan
    BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2025, 11 (01):
  • [42] BCU-Net: Bridging ConvNeXt and U-Net for medical image segmentation
    Zhang, Hongbin
    Zhong, Xiang
    Li, Guangli
    Liu, Wei
    Liu, Jiawei
    Ji, Donghong
    Li, Xiong
    Wu, Jianguo
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 159
  • [43] A Self-Spatial Adaptive Weighting Based U-Net for Image Segmentation
    Cho, Choongsang
    Lee, Young Han
    Park, Jongyoul
    Lee, Sangkeun
    ELECTRONICS, 2021, 10 (03) : 1 - 11
  • [44] Pyramidal Image Segmentation Based on U-Net for Automatic Multiscale Crater Extraction
    Hong, Zhonghua
    Fan, Ziyang
    Zhou, Ruyan
    Pan, Haiyan
    Zhang, Yun
    Han, Yanling
    Wang, Jing
    Yang, Shuhu
    Jin, Yanmin
    SENSORS AND MATERIALS, 2022, 34 (01) : 237 - 250
  • [45] TransAttUnet: Multi-Level Attention-Guided U-Net With Transformer for Medical Image Segmentation
    Chen, Bingzhi
    Liu, Yishu
    Zhang, Zheng
    Lu, Guangming
    Kong, Adams Wai Kin
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 55 - 68
  • [46] Multi-Convolutional Channel Residual Spatial Attention U-Net for Industrial and Medical Image Segmentation
    Chen, Haoyu
    Kim, Kyungbaek
    IEEE ACCESS, 2024, 12 : 76089 - 76101
  • [47] MDA-Unet: A Multi-Scale Dilated Attention U-Net for Medical Image Segmentation
    Amer, Alyaa
    Lambrou, Tryphon
    Ye, Xujiong
    APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [48] Residual-Attention UNet plus plus : A Nested Residual-Attention U-Net for Medical Image Segmentation
    Li, Zan
    Zhang, Hong
    Li, Zhengzhen
    Ren, Zuyue
    APPLIED SCIENCES-BASEL, 2022, 12 (14):
  • [49] Application of U-Net and Optimized Clustering in Medical Image Segmentation: A Review
    Shao, Jiaqi
    Chen, Shuwen
    Zhou, Jin
    Zhu, Huisheng
    Wang, Ziyi
    Brown, Mackenzie
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 136 (03): : 2173 - 2219
  • [50] Shape-intensity-guided U-net for medical image segmentation
    Dong, Wenhui
    Du, Bo
    Xu, Yongchao
    NEUROCOMPUTING, 2024, 610