AESC-TransUnet:Attention Enhanced Selective Channel Transformer U-Net for Medical Image Segmentation

被引：0

作者：

Huang, Wenlei ^{[1
,2
]}

Xiao, Hongxiang ^{[1
,2
]}

机构：

[1] Guilin Univ Technol, Coll Comp Sci & Engn, Guilin 541006, Guangxi, Peoples R China

[2] Guilin Univ Technol, Guangxi Key Lab Embedded Technol & Intelligent Sys, Guilin 541004, Peoples R China

来源：

SIGNAL IMAGE AND VIDEO PROCESSING | 2025年 / 19卷 / 09期

关键词：

Medical image segmentation; U-Net; Transformer; Efficient Selective Channel Attention; Deep learning;

D O I：

10.1007/s11760-025-04311-4

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Deep learning has greatly advanced medical image segmentation, especially with the integration of U-Net and Transformer architectures. However, challenges remain in clinical practice, such as low resolution, blurred edges, and difficulty in accurately segmenting small lesions. Existing methods, including various TransUNet variants, struggle with these issues. For instance, while TransUNet leverages the Transformer to capture global context, it suffers from the inability to preserve fine-grained local details. Similarly, Swin-Unet, which uses the Swin Transformer, excels in global feature extraction but often loses precision in complex backgrounds and small structures. The classic U-Net model, despite its strong performance in extracting local features, struggles with segmentation accuracy in low-contrast or complex areas due to its upsampling and downsampling processes.To address these limitations, we propose AESC-TransUNet, a novel network combining Efficient Selective Channel Attention (ESCA) and Convolution-Transformer fusion (ConvFormer). ESCA improves feature representation by using parallel processing and selective channel enhancement, effectively addressing the low resolution and blurred edges problem while preserving local details. ConvFormer optimizes the integration of both global and local information by combining convolution with self-attention mechanisms, reducing positional loss during upsampling and improving edge segmentation. Experimental results show that AESC-TransUNet significantly outperforms existing methods, achieving higher segmentation accuracy, particularly for small lesions and complex structures.

引用

页数：11

共 28 条

[1] TransNorm: Transformer Provides a Strong Spatial Normalization Mechanism for a Deep Segmentation Model [J].

Azad, Reza ;

Al-Antary, Mohammad T. ;

Heidari, Moein ;

Merhof, Dorit .

IEEE ACCESS, 2022, 10 :108205-108215

[2] Bi-Directional ConvLSTM U-Net with Densley Connected Convolutions [J].

Azad, Reza ;

Asadi-Aghbolaghi, Maryam ;

Fathy, Mahmood ;

Escalera, Sergio .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :406-415

[3] Polyp-Net: A Multimodel Fusion Network for Polyp Segmentation [J].

Banik, Debapriya ;

Roy, Kaushiki ;

Bhattacharjee, Debotosh ;

Nasipuri, Mita ;

Krejcar, Ondrej .

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70

[4]

Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9

[5]

Chen J., 2021, PREPRINT

[6]

Dai W., 2024, arXiv

[7]

Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26

[8] The Importance of Skip Connections in Biomedical Image Segmentation [J].

Drozdzal, Michal ;

Vorontsov, Eugene ;

Chartrand, Gabriel ;

Kadoury, Samuel ;

Pal, Chris .

DEEP LEARNING AND DATA LABELING FOR MEDICAL APPLICATIONS, 2016, 10008 :179-187

[9] Selective Feature Aggregation Network with Area-Boundary Constraints for Polyp Segmentation [J].

Fang, Yuqi ;

Chen, Cheng ;

Yuan, Yixuan ;

Tong, Kai-yu .

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT I, 2019, 11764 :302-310

[10] MultiResUNet : Rethinking the U-Net architecture for multimodal biomedical image segmentation [J].

Ibtehaz, Nabil ;

Rahman, M. Sohel .

NEURAL NETWORKS, 2020, 121 :74-87

← 1 2 3 →