MLCA-UNet: medical image segmentation networks with multiscale linear and convolutional attention

被引:0
作者
Zhou, Jinzhi [1 ,2 ]
He, Haoyang [1 ,2 ]
Ma, Guangcen [1 ,2 ]
Li, Saifeng [1 ,2 ]
Zhang, Guopeng [1 ,2 ]
机构
[1] Southwest Univ Sci & Technol, Sch Informat Engn, Mianyang 621000, Peoples R China
[2] Robot Technol Used Special Environm Key Lab Sichua, Mianyang 621000, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; UNet; Multi-scale linear attention; Convolutional visual transformer;
D O I
10.1007/s11760-025-03962-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Transformers have been widely studied in medical image segmentation. However, due to the limitations of high-quality annotated medical image data and model computational efficiency, Transformer models struggle to extract diverse global features and are prone to attention collapse. Therefore, this paper proposes a lightweight network, MLCA-UNet (Medical image segmentation network integrating multiscale linear and convolutional attention), which integrates multiscale linear attention and convolutional attention. The network consists of an encoding layer, convolutional attention layer, and decoding layer. First, to improve the diversity of medical image features and the representation of segmentation semantic details, this paper designs a multiscale linear attention module to capture features with different receptive fields. Second, to address the collapse phenomenon caused by Transformers when learning attention, a convolutional attention module is designed to achieve self-attention and diversification of features. Finally, to verify the effectiveness of the proposed method, experiments were conducted on the ACDC cardiac dataset, ISIC skin lesion dataset, BUSI breast ultrasound dataset, and TNSCUI thyroid nodule ultrasound dataset. The results show that MLCA-UNet outperforms existing mainstream segmentation networks. It achieves the best Dice (dice similarity coefficient) on the ACDC and ISIC datasets, 92.36 and 90.81%, respectively. Additionally, on the BUSI dataset and TNSCUI dataset, it achieves the highest IOU (Intersection over Union) values of 73.27 and 77.52% respectively. MLCA-UNet achieves superior performance with better inference efficiency and parameter volume, striking a balance between parameter count and accuracy.
引用
收藏
页数:11
相关论文
共 50 条
[41]   Reducing the Hausdorff Distance in Medical Image Segmentation With Convolutional Neural Networks [J].
Karimi, Davood ;
Salcudean, Septimiu E. .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (02) :499-513
[42]   EMED-UNet: An Efficient Multi-Encoder-Decoder Based UNet for Medical Image Segmentation [J].
Shah, Kashish D. ;
Patel, Dhaval K. ;
Thaker, Minesh P. ;
Patel, Harsh A. ;
Saikia, Manob Jyoti ;
Ranger, Bryan J. .
IEEE ACCESS, 2023, 11 :95253-95266
[43]   Evaluation of multislice inputs to convolutional neural networks for medical image segmentation [J].
Vu, Minh H. ;
Grimbergen, Guus ;
Nyholm, Tufve ;
Lofstedt, Tommy .
MEDICAL PHYSICS, 2020, 47 (12) :6216-6231
[44]   RM-UNet: UNet-like Mamba with rotational SSM module for medical image segmentation [J].
Tang, Hao ;
Huang, Guoheng ;
Cheng, Lianglun ;
Yuan, Xiaochen ;
Tao, Qi ;
Chen, Xuhang ;
Zhong, Guo ;
Yang, Xiaohui .
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) :8427-8443
[45]   MCI Net: Mamba- Convolutional lightweight self-attention medical image segmentation network [J].
Zhang, Yelin ;
Wang, Guanglei ;
Ma, Pengchong ;
Li, Yan .
BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2025, 11 (01)
[46]   DSCA-Net: A depthwise separable convolutional neural network with attention mechanism for medical image segmentation [J].
Shan, Tong ;
Yan, Jiayong ;
Cui, Xiaoyao ;
Xie, Lijian .
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (01) :365-382
[47]   Focusing the View: Enhancing U-Net with Convolutional Block Attention for Superior Medical Image Segmentation [J].
Nhu-Tai Do ;
Dat Nguyen Khanh ;
Tram-Tran Nguyen-Quynh ;
Quoc-Huy Nguyen .
INTELLIGENCE OF THINGS: TECHNOLOGIES AND APPLICATIONS, ICIT 2024, VOL 2, 2025, 230 :156-165
[48]   A parallelly contextual convolutional transformer for medical image segmentation [J].
Feng, Yuncong ;
Su, Jianyu ;
Zheng, Jian ;
Zheng, Yupeng ;
Zhang, Xiaoli .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 98
[49]   ERDUnet: An Efficient Residual Double-Coding Unet for Medical Image Segmentation [J].
Li, Hao ;
Zhai, Di-Hua ;
Xia, Yuanqing .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) :2083-2096
[50]   TMU: Transmission-Enhanced Mamba-UNet for Medical Image Segmentation [J].
Yang, Xiongfeng ;
Luo, Ziyang ;
Wu, Yanlin ;
Xie, Xueshuo ;
Nan, Li ;
Li, Tao .
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT X, ICIC 2024, 2024, 14871 :428-438