MLCA-UNet: medical image segmentation networks with multiscale linear and convolutional attention

被引:0
作者
Zhou, Jinzhi [1 ,2 ]
He, Haoyang [1 ,2 ]
Ma, Guangcen [1 ,2 ]
Li, Saifeng [1 ,2 ]
Zhang, Guopeng [1 ,2 ]
机构
[1] Southwest Univ Sci & Technol, Sch Informat Engn, Mianyang 621000, Peoples R China
[2] Robot Technol Used Special Environm Key Lab Sichua, Mianyang 621000, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; UNet; Multi-scale linear attention; Convolutional visual transformer;
D O I
10.1007/s11760-025-03962-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Transformers have been widely studied in medical image segmentation. However, due to the limitations of high-quality annotated medical image data and model computational efficiency, Transformer models struggle to extract diverse global features and are prone to attention collapse. Therefore, this paper proposes a lightweight network, MLCA-UNet (Medical image segmentation network integrating multiscale linear and convolutional attention), which integrates multiscale linear attention and convolutional attention. The network consists of an encoding layer, convolutional attention layer, and decoding layer. First, to improve the diversity of medical image features and the representation of segmentation semantic details, this paper designs a multiscale linear attention module to capture features with different receptive fields. Second, to address the collapse phenomenon caused by Transformers when learning attention, a convolutional attention module is designed to achieve self-attention and diversification of features. Finally, to verify the effectiveness of the proposed method, experiments were conducted on the ACDC cardiac dataset, ISIC skin lesion dataset, BUSI breast ultrasound dataset, and TNSCUI thyroid nodule ultrasound dataset. The results show that MLCA-UNet outperforms existing mainstream segmentation networks. It achieves the best Dice (dice similarity coefficient) on the ACDC and ISIC datasets, 92.36 and 90.81%, respectively. Additionally, on the BUSI dataset and TNSCUI dataset, it achieves the highest IOU (Intersection over Union) values of 73.27 and 77.52% respectively. MLCA-UNet achieves superior performance with better inference efficiency and parameter volume, striking a balance between parameter count and accuracy.
引用
收藏
页数:11
相关论文
共 50 条
[21]   CA-Net: Comprehensive Attention Convolutional Neural Networks for Explainable Medical Image Segmentation [J].
Gu, Ran ;
Wang, Guotai ;
Song, Tao ;
Huang, Rui ;
Aertsen, Michael ;
Deprest, Jan ;
Ourselin, Sebastien ;
Vercauteren, Tom ;
Zhang, Shaoting .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (02) :699-711
[22]   SLGMA-UNet: Comprehensive Feature Aggregation With Context-Sensitive Attention for Medical Image Segmentation [J].
Ye, Xinghuo ;
Wang, Na .
IEEE ACCESS, 2025, 13 :133457-133464
[23]   MufiNet: Multiscale Fusion Residual Networks for Medical Image Segmentation [J].
Wang, Chun ;
Wang, Zhi ;
Xi, Wei ;
Yang, Zhao ;
Bai, Gairui ;
Wang, Ruimeng ;
Duan, Meichen .
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[24]   LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image Segmentation [J].
Wang, Jinhong ;
Chen, Jintai ;
Chen, Danny ;
Wu, Jian .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VIII, 2024, 15008 :360-370
[25]   VIG-UNET: VISION GRAPH NEURAL NETWORKS FOR MEDICAL IMAGE SEGMENTATION [J].
Jiang, Juntao ;
Chen, Xiyu ;
Tian, Guanzhong ;
Liu, Yong .
2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
[26]   NFMPAtt-Unet: Neighborhood Fuzzy C-means Multi-scale Pyramid Hybrid Attention Unet for medical image segmentation [J].
Zhao, Xinpeng ;
Xu, Weihua .
NEURAL NETWORKS, 2024, 178
[27]   TAC-UNet: transformer-assisted convolutional neural network for medical image segmentation [J].
He, Jingliu ;
Ma, Yuqi ;
Yang, Mingyue ;
Yang, Wensong ;
Wu, Chunming ;
Chen, Shanxiong .
QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2024, 14 (12) :8824-8839
[28]   VM-UNET-V2: Rethinking Vision Mamba UNet for Medical Image Segmentation [J].
Zhang, Mingya ;
Yu, Yue ;
Jin, Sun ;
Gu, Limei ;
Ling, Tingsheng ;
Tao, Xianping .
BIOINFORMATICS RESEARCH AND APPLICATIONS, PT I, ISBRA 2024, 2024, 14954 :335-346
[29]   MSA-Net: Multiscale spatial attention network for medical image segmentation [J].
Fu, Zhaojin ;
Li, Jinjiang ;
Hua, Zhen .
ALEXANDRIA ENGINEERING JOURNAL, 2023, 70 :453-473
[30]   SCSE-UNet: A Spatial Channel Squeeze-and-Excitation UNet for Medical Image Segmentation [J].
Liu, Xiuli ;
Wu, Xiangqiong ;
Chen, Yang ;
Wang, Peng .
2024 12TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND COMPUTING TECHNOLOGY, ISCTECH, 2024,