MLCA-UNet: medical image segmentation networks with multiscale linear and convolutional attention

被引：0

作者：

Zhou, Jinzhi ^{[1
,2
]}

He, Haoyang ^{[1
,2
]}

Ma, Guangcen ^{[1
,2
]}

Li, Saifeng ^{[1
,2
]}

Zhang, Guopeng ^{[1
,2
]}

机构：

[1] Southwest Univ Sci & Technol, Sch Informat Engn, Mianyang 621000, Peoples R China

[2] Robot Technol Used Special Environm Key Lab Sichua, Mianyang 621000, Peoples R China

来源：

SIGNAL IMAGE AND VIDEO PROCESSING | 2025年 / 19卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Medical image segmentation; UNet; Multi-scale linear attention; Convolutional visual transformer;

D O I：

10.1007/s11760-025-03962-7

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Transformers have been widely studied in medical image segmentation. However, due to the limitations of high-quality annotated medical image data and model computational efficiency, Transformer models struggle to extract diverse global features and are prone to attention collapse. Therefore, this paper proposes a lightweight network, MLCA-UNet (Medical image segmentation network integrating multiscale linear and convolutional attention), which integrates multiscale linear attention and convolutional attention. The network consists of an encoding layer, convolutional attention layer, and decoding layer. First, to improve the diversity of medical image features and the representation of segmentation semantic details, this paper designs a multiscale linear attention module to capture features with different receptive fields. Second, to address the collapse phenomenon caused by Transformers when learning attention, a convolutional attention module is designed to achieve self-attention and diversification of features. Finally, to verify the effectiveness of the proposed method, experiments were conducted on the ACDC cardiac dataset, ISIC skin lesion dataset, BUSI breast ultrasound dataset, and TNSCUI thyroid nodule ultrasound dataset. The results show that MLCA-UNet outperforms existing mainstream segmentation networks. It achieves the best Dice (dice similarity coefficient) on the ACDC and ISIC datasets, 92.36 and 90.81%, respectively. Additionally, on the BUSI dataset and TNSCUI dataset, it achieves the highest IOU (Intersection over Union) values of 73.27 and 77.52% respectively. MLCA-UNet achieves superior performance with better inference efficiency and parameter volume, striking a balance between parameter count and accuracy.

引用

页数：11

共 50 条

[1] Multiresolution Aggregation Transformer UNet Based on Multiscale Input and Coordinate Attention for Medical Image Segmentation [J].

Chen, Shaolong ;

Qiu, Changzhen ;

Yang, Weiping ;

Zhang, Zhiyong .

SENSORS, 2022, 22 (10)

[2] Efficient hierarchical multiscale convolutional attention for accurate medical image segmentation [J].

Wang, Bing ;

Wei, Zhihong ;

Ju, Mengyi ;

Zhao, Zutong ;

Zhang, Shiyin .

VISUAL COMPUTER, 2025,

[3] Improved UNet with Attention for Medical Image Segmentation [J].

AL Qurri, Ahmed ;

Almekkawy, Mohamed .

SENSORS, 2023, 23 (20)

[4] GSAC-UFormer: Groupwise Self-Attention Convolutional Transformer-Based UNet for Medical Image Segmentation [J].

Garbaz, Anass ;

Oukdach, Yassine ;

Charfi, Said ;

El Ansari, Mohamed ;

Koutti, Lahcen ;

Salihoun, Mouna .

COGNITIVE COMPUTATION, 2025, 17 (02)

[5] Efficient UNet fusion of convolutional neural networks and state space models for medical image segmentation [J].

Meng, Wenjie ;

Mu, Aiming ;

Wang, Huajun .

DIGITAL SIGNAL PROCESSING, 2025, 158

[6] MRCA-UNet: A Multiscale Recombined Channel Attention U-Net Model for Medical Image Segmentation [J].

Liu, Lei ;

Li, Xiang ;

Wang, Shuai ;

Wang, Jun ;

Melo, Silas N. .

SYMMETRY-BASEL, 2025, 17 (06)

[7] DSKCA-UNet: Dynamic selective kernel channel attention for medical image segmentation [J].

Shen, Longfeng ;

Wang, Qiong ;

Zhang, Yingjie ;

Qin, Fenglan ;

Jin, Hengjun ;

Zhao, Wei .

MEDICINE, 2023, 102 (39) :E35328

[8] Vision Transformer based UNet with Multi-Head Attention for Medical Image Segmentation [J].

Yu, Huina ;

Gao, Lingyan ;

Yu, Huitao ;

Zhang, Anguo .

PROCEEDINGS OF THE 36TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC 2024, 2024, :1737-1741

[9] An Advanced Cascaded UNet Architecture with Dual Attention for Bio-Medical Image Segmentation [J].

Shyam, Satirtha Paul ;

Bhuiyan, Rahat K. ;

Mim, Salma S. ;

Rahman, C. M. A. .

2024 IEEE INTERNATIONAL WOMEN IN ENGINEERING (WIE) CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, WIECON-ECE, 2024, :001-006

[10] DI-Unet: Dimensional interaction self-attention for medical image segmentation [J].

Wu, Yanlin ;

Wang, Guanglei ;

Wang, Zhongyang ;

Wang, Hongrui ;

Li, Yan .

BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78

← 1 2 3 4 5 →