MLCA-UNet: medical image segmentation networks with multiscale linear and convolutional attention

被引：0

作者：

Zhou, Jinzhi ^{[1
,2
]}

He, Haoyang ^{[1
,2
]}

Ma, Guangcen ^{[1
,2
]}

Li, Saifeng ^{[1
,2
]}

Zhang, Guopeng ^{[1
,2
]}

机构：

[1] Southwest Univ Sci & Technol, Sch Informat Engn, Mianyang 621000, Peoples R China

[2] Robot Technol Used Special Environm Key Lab Sichua, Mianyang 621000, Peoples R China

来源：

SIGNAL IMAGE AND VIDEO PROCESSING | 2025年 / 19卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Medical image segmentation; UNet; Multi-scale linear attention; Convolutional visual transformer;

D O I：

10.1007/s11760-025-03962-7

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Transformers have been widely studied in medical image segmentation. However, due to the limitations of high-quality annotated medical image data and model computational efficiency, Transformer models struggle to extract diverse global features and are prone to attention collapse. Therefore, this paper proposes a lightweight network, MLCA-UNet (Medical image segmentation network integrating multiscale linear and convolutional attention), which integrates multiscale linear attention and convolutional attention. The network consists of an encoding layer, convolutional attention layer, and decoding layer. First, to improve the diversity of medical image features and the representation of segmentation semantic details, this paper designs a multiscale linear attention module to capture features with different receptive fields. Second, to address the collapse phenomenon caused by Transformers when learning attention, a convolutional attention module is designed to achieve self-attention and diversification of features. Finally, to verify the effectiveness of the proposed method, experiments were conducted on the ACDC cardiac dataset, ISIC skin lesion dataset, BUSI breast ultrasound dataset, and TNSCUI thyroid nodule ultrasound dataset. The results show that MLCA-UNet outperforms existing mainstream segmentation networks. It achieves the best Dice (dice similarity coefficient) on the ACDC and ISIC datasets, 92.36 and 90.81%, respectively. Additionally, on the BUSI dataset and TNSCUI dataset, it achieves the highest IOU (Intersection over Union) values of 73.27 and 77.52% respectively. MLCA-UNet achieves superior performance with better inference efficiency and parameter volume, striking a balance between parameter count and accuracy.

引用

页数：11

共 50 条

[1] Multiresolution Aggregation Transformer UNet Based on Multiscale Input and Coordinate Attention for Medical Image Segmentation
Chen, Shaolong
Qiu, Changzhen
Yang, Weiping
Zhang, Zhiyong
SENSORS, 2022, 22 (10)
[2] Improved UNet with Attention for Medical Image Segmentation
AL Qurri, Ahmed
Almekkawy, Mohamed
SENSORS, 2023, 23 (20)
[3] GSAC-UFormer: Groupwise Self-Attention Convolutional Transformer-Based UNet for Medical Image Segmentation
Garbaz, Anass
Oukdach, Yassine
Charfi, Said
El Ansari, Mohamed
Koutti, Lahcen
Salihoun, Mouna
COGNITIVE COMPUTATION, 2025, 17 (02)
[4] Efficient UNet fusion of convolutional neural networks and state space models for medical image segmentation
Meng, Wenjie
Mu, Aiming
Wang, Huajun
DIGITAL SIGNAL PROCESSING, 2025, 158
[5] DSKCA-UNet: Dynamic selective kernel channel attention for medical image segmentation
Shen, Longfeng
Wang, Qiong
Zhang, Yingjie
Qin, Fenglan
Jin, Hengjun
Zhao, Wei
MEDICINE, 2023, 102 (39) : E35328
[6] DI-Unet: Dimensional interaction self-attention for medical image segmentation
Wu, Yanlin
Wang, Guanglei
Wang, Zhongyang
Wang, Hongrui
Li, Yan
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
[7] MSRD-Unet: Multiscale Residual Dilated U-Net for Medical Image Segmentation
Khalaf, Muna
Dhannoon, Ban N.
BAGHDAD SCIENCE JOURNAL, 2022, 19 (06) : 1603 - 1611
[8] A Novel Elastomeric UNet for Medical Image Segmentation
Cai, Sijing
Wu, Yi
Chen, Guannan
FRONTIERS IN AGING NEUROSCIENCE, 2022, 14
[9] DMSA-UNet: Dual Multi-Scale Attention makes UNet more strong for medical image segmentation
Li, Xiang
Fu, Chong
Wang, Qun
Zhang, Wenchao
Sham, Chiu-Wing
Chen, Junxin
KNOWLEDGE-BASED SYSTEMS, 2024, 299
[10] Swin-HAUnet: A Swin-Hierarchical Attention Unet For Enhanced Medical Image Segmentation
Chen, Jiarong
Zhang, Xuyang
Li, Rongwen
Zhou, Peng
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XIV, 2025, 15044 : 371 - 385

← 1 2 3 4 5 →