MRCA-UNet: A Multiscale Recombined Channel Attention U-Net Model for Medical Image Segmentation

被引:0
作者
Liu, Lei [1 ,2 ]
Li, Xiang [1 ]
Wang, Shuai [1 ,2 ]
Wang, Jun [3 ]
Melo, Silas N. [4 ]
机构
[1] Huaibei Normal Univ, Sch Comp Sci & Technol, Huaibei 235000, Peoples R China
[2] Huaibei Key Lab Digital Multimedia Intelligent Inf, Sch Comp Sci & Technol, Huaibei 235000, Peoples R China
[3] Hebei Univ, Coll Elect & Informat Engn, Baoding 071000, Peoples R China
[4] Univ Estadual Maranhao, Dept Geog, BR-65055000 Sao Luis, Brazil
来源
SYMMETRY-BASEL | 2025年 / 17卷 / 06期
关键词
multiscale information; channel attention; medical image segmentation;
D O I
10.3390/sym17060892
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep learning techniques play a crucial role in medical image segmentation for diagnostic purposes, with traditional convolutional neural networks (CNNs) and emerging transformers having achieved satisfactory results. CNN-based methods focus on extracting the local features of an image, which are beneficial for handling image details and textural features. However, the receptive fields of CNNs are relatively small, resulting in poor performance when processing images with long-range dependencies. Conversely, transformer-based methods are effective in handling global information; however, they suffer from significant computational complexity arising from the building of long-range dependencies. Additionally, they lack the ability to perceive image details and adopt channel features. These problems can result in unclear image segmentation and blurred boundaries. Accordingly, in this study, a multiscale recombined channel attention (MRCA) module is proposed, which can simultaneously extract both global and local features and has the capability of exploring channel features during feature fusion. Specifically, the proposed MRCA first employs multibranch extraction of image features and performs operations such as blocking, shifting, and aggregating the image at different scales. This step enables the model to recognize multiscale information locally and globally. Feature selection is then performed to enhance the predictive capability of the model. Finally, features from different branches are connected and recombined across channels to complete the feature fusion. Benefiting from fully exploring the channel features, an MRCA-based U-Net (MRCA-UNet) framework is proposed for medical image segmentation. Experiments conducted on the Synapse multi-organ segmentation (Synapse) dataset and the International Skin Imaging Collaboration (ISIC-2018) dataset demonstrate the competitive segmentation performance of the proposed MRCA-UNet, achieving an average Dice Similarity Coefficient (DSC) of 81.61% and a Hausdorff Distance (HD) of 23.36 on Synapse and an Accuracy of 95.94% on ISIC-2018.
引用
收藏
页数:17
相关论文
共 38 条
[1]  
Cai PZ, 2024, Arxiv, DOI [arXiv:2310.00289, DOI 10.48550/ARXIV.2310.00289]
[2]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[3]  
Chen J., 2021, PREPRINT
[4]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[5]   FoPro-KD: Fourier Prompted Effective Knowledge Distillation for Long-Tailed Medical Image Recognition [J].
Elbatel, Marawan ;
Marti, Robert ;
Li, Xiaomeng .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (03) :954-965
[6]   Towards markerless computer-aided surgery combining deep segmentation and geometric pose estimation: application in total knee arthroplasty [J].
Felix, Ines ;
Raposo, Carolina ;
Antunes, Michel ;
Rodrigues, Pedro ;
Barreto, Joao P. .
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2021, 9 (03) :271-278
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]   Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition [J].
Hou, Qibin ;
Lu, Cheng-Ze ;
Cheng, Ming-Ming ;
Feng, Jiashi .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) :8274-8283
[9]   Strip Pooling: Rethinking Spatial Pooling for Scene Parsing [J].
Hou, Qibin ;
Zhang, Li ;
Cheng, Ming-Ming ;
Feng, Jiashi .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4002-4011
[10]   Squeeze-and-Excitation Networks [J].
Hu, Jie ;
Shen, Li ;
Sun, Gang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7132-7141