MLCA-UNet: medical image segmentation networks with multiscale linear and convolutional attention

被引:0
|
作者
Zhou, Jinzhi [1 ,2 ]
He, Haoyang [1 ,2 ]
Ma, Guangcen [1 ,2 ]
Li, Saifeng [1 ,2 ]
Zhang, Guopeng [1 ,2 ]
机构
[1] Southwest Univ Sci & Technol, Sch Informat Engn, Mianyang 621000, Peoples R China
[2] Robot Technol Used Special Environm Key Lab Sichua, Mianyang 621000, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; UNet; Multi-scale linear attention; Convolutional visual transformer;
D O I
10.1007/s11760-025-03962-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Transformers have been widely studied in medical image segmentation. However, due to the limitations of high-quality annotated medical image data and model computational efficiency, Transformer models struggle to extract diverse global features and are prone to attention collapse. Therefore, this paper proposes a lightweight network, MLCA-UNet (Medical image segmentation network integrating multiscale linear and convolutional attention), which integrates multiscale linear attention and convolutional attention. The network consists of an encoding layer, convolutional attention layer, and decoding layer. First, to improve the diversity of medical image features and the representation of segmentation semantic details, this paper designs a multiscale linear attention module to capture features with different receptive fields. Second, to address the collapse phenomenon caused by Transformers when learning attention, a convolutional attention module is designed to achieve self-attention and diversification of features. Finally, to verify the effectiveness of the proposed method, experiments were conducted on the ACDC cardiac dataset, ISIC skin lesion dataset, BUSI breast ultrasound dataset, and TNSCUI thyroid nodule ultrasound dataset. The results show that MLCA-UNet outperforms existing mainstream segmentation networks. It achieves the best Dice (dice similarity coefficient) on the ACDC and ISIC datasets, 92.36 and 90.81%, respectively. Additionally, on the BUSI dataset and TNSCUI dataset, it achieves the highest IOU (Intersection over Union) values of 73.27 and 77.52% respectively. MLCA-UNet achieves superior performance with better inference efficiency and parameter volume, striking a balance between parameter count and accuracy.
引用
收藏
页数:11
相关论文
共 50 条
  • [11] Enhancing medical image segmentation with MA-UNet: a multi-scale attention framework
    Li, Hongzhi
    Ren, Zhanghao
    Zhu, Guoqing
    Liang, Yaoju
    Cui, Han
    Wang, Chaozeyu
    Wang, Jiaxi
    VISUAL COMPUTER, 2025,
  • [12] Windowed axial shuffle attention networks for medical image segmentation
    Yi, Yugen
    Wu, Xuan
    He, Yi
    Wu, Han
    Zhou, Bin
    Luo, Siwei
    Dai, Jiangyan
    Du, Yingkui
    Zhou, Wei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 102
  • [13] DENSELY CONNECTED SWIN-UNET FOR MULTISCALE INFORMATION AGGREGATION IN MEDICAL IMAGE SEGMENTATION
    Wang, Ziyang
    Su, Meiwen
    Zheng, Jian-Qing
    Liu, Yang
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 940 - 944
  • [14] ConTrans: Improving Transformer with Convolutional Attention for Medical Image Segmentation
    Lin, Ailiang
    Xu, Jiayu
    Li, Jinxing
    Lu, Guangming
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 297 - 307
  • [15] CA-Net: Comprehensive Attention Convolutional Neural Networks for Explainable Medical Image Segmentation
    Gu, Ran
    Wang, Guotai
    Song, Tao
    Huang, Rui
    Aertsen, Michael
    Deprest, Jan
    Ourselin, Sebastien
    Vercauteren, Tom
    Zhang, Shaoting
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (02) : 699 - 711
  • [16] LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image Segmentation
    Wang, Jinhong
    Chen, Jintai
    Chen, Danny
    Wu, Jian
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VIII, 2024, 15008 : 360 - 370
  • [17] MufiNet: Multiscale Fusion Residual Networks for Medical Image Segmentation
    Wang, Chun
    Wang, Zhi
    Xi, Wei
    Yang, Zhao
    Bai, Gairui
    Wang, Ruimeng
    Duan, Meichen
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [18] VIG-UNET: VISION GRAPH NEURAL NETWORKS FOR MEDICAL IMAGE SEGMENTATION
    Jiang, Juntao
    Chen, Xiyu
    Tian, Guanzhong
    Liu, Yong
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [19] NFMPAtt-Unet: Neighborhood Fuzzy C-means Multi-scale Pyramid Hybrid Attention Unet for medical image segmentation
    Zhao, Xinpeng
    Xu, Weihua
    NEURAL NETWORKS, 2024, 178
  • [20] TAC-UNet: transformer-assisted convolutional neural network for medical image segmentation
    He, Jingliu
    Ma, Yuqi
    Yang, Mingyue
    Yang, Wensong
    Wu, Chunming
    Chen, Shanxiong
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2024, 14 (12) : 8824 - 8839