MLCA-UNet: medical image segmentation networks with multiscale linear and convolutional attention

被引:0
|
作者
Zhou, Jinzhi [1 ,2 ]
He, Haoyang [1 ,2 ]
Ma, Guangcen [1 ,2 ]
Li, Saifeng [1 ,2 ]
Zhang, Guopeng [1 ,2 ]
机构
[1] Southwest Univ Sci & Technol, Sch Informat Engn, Mianyang 621000, Peoples R China
[2] Robot Technol Used Special Environm Key Lab Sichua, Mianyang 621000, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; UNet; Multi-scale linear attention; Convolutional visual transformer;
D O I
10.1007/s11760-025-03962-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Transformers have been widely studied in medical image segmentation. However, due to the limitations of high-quality annotated medical image data and model computational efficiency, Transformer models struggle to extract diverse global features and are prone to attention collapse. Therefore, this paper proposes a lightweight network, MLCA-UNet (Medical image segmentation network integrating multiscale linear and convolutional attention), which integrates multiscale linear attention and convolutional attention. The network consists of an encoding layer, convolutional attention layer, and decoding layer. First, to improve the diversity of medical image features and the representation of segmentation semantic details, this paper designs a multiscale linear attention module to capture features with different receptive fields. Second, to address the collapse phenomenon caused by Transformers when learning attention, a convolutional attention module is designed to achieve self-attention and diversification of features. Finally, to verify the effectiveness of the proposed method, experiments were conducted on the ACDC cardiac dataset, ISIC skin lesion dataset, BUSI breast ultrasound dataset, and TNSCUI thyroid nodule ultrasound dataset. The results show that MLCA-UNet outperforms existing mainstream segmentation networks. It achieves the best Dice (dice similarity coefficient) on the ACDC and ISIC datasets, 92.36 and 90.81%, respectively. Additionally, on the BUSI dataset and TNSCUI dataset, it achieves the highest IOU (Intersection over Union) values of 73.27 and 77.52% respectively. MLCA-UNet achieves superior performance with better inference efficiency and parameter volume, striking a balance between parameter count and accuracy.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Multiresolution Aggregation Transformer UNet Based on Multiscale Input and Coordinate Attention for Medical Image Segmentation
    Chen, Shaolong
    Qiu, Changzhen
    Yang, Weiping
    Zhang, Zhiyong
    SENSORS, 2022, 22 (10)
  • [2] Improved UNet with Attention for Medical Image Segmentation
    AL Qurri, Ahmed
    Almekkawy, Mohamed
    SENSORS, 2023, 23 (20)
  • [3] GSAC-UFormer: Groupwise Self-Attention Convolutional Transformer-Based UNet for Medical Image Segmentation
    Garbaz, Anass
    Oukdach, Yassine
    Charfi, Said
    El Ansari, Mohamed
    Koutti, Lahcen
    Salihoun, Mouna
    COGNITIVE COMPUTATION, 2025, 17 (02)
  • [4] Efficient UNet fusion of convolutional neural networks and state space models for medical image segmentation
    Meng, Wenjie
    Mu, Aiming
    Wang, Huajun
    DIGITAL SIGNAL PROCESSING, 2025, 158
  • [5] DSKCA-UNet: Dynamic selective kernel channel attention for medical image segmentation
    Shen, Longfeng
    Wang, Qiong
    Zhang, Yingjie
    Qin, Fenglan
    Jin, Hengjun
    Zhao, Wei
    MEDICINE, 2023, 102 (39) : E35328
  • [6] DI-Unet: Dimensional interaction self-attention for medical image segmentation
    Wu, Yanlin
    Wang, Guanglei
    Wang, Zhongyang
    Wang, Hongrui
    Li, Yan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
  • [7] MSRD-Unet: Multiscale Residual Dilated U-Net for Medical Image Segmentation
    Khalaf, Muna
    Dhannoon, Ban N.
    BAGHDAD SCIENCE JOURNAL, 2022, 19 (06) : 1603 - 1611
  • [8] A Novel Elastomeric UNet for Medical Image Segmentation
    Cai, Sijing
    Wu, Yi
    Chen, Guannan
    FRONTIERS IN AGING NEUROSCIENCE, 2022, 14
  • [9] DMSA-UNet: Dual Multi-Scale Attention makes UNet more strong for medical image segmentation
    Li, Xiang
    Fu, Chong
    Wang, Qun
    Zhang, Wenchao
    Sham, Chiu-Wing
    Chen, Junxin
    KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [10] Swin-HAUnet: A Swin-Hierarchical Attention Unet For Enhanced Medical Image Segmentation
    Chen, Jiarong
    Zhang, Xuyang
    Li, Rongwen
    Zhou, Peng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XIV, 2025, 15044 : 371 - 385