E-TransConvNet: An enhanced transformer and convolutional network for medical image segmentation from ultrasound and CT images

被引:0
作者
Atabansi, Chukwuemeka Clinton [1 ]
Nie, Jing [1 ]
Huang, Jiachen [1 ]
Feng, Yujie [2 ]
Liu, Haijun [1 ]
Xie, Jin [3 ]
Zhou, Xichuan [1 ]
机构
[1] Chongqing Univ, Sch Microelect & Commun Engn, Chongqing 400044, Peoples R China
[2] Chongqing Univ Canc Hosp, Chongqing Key Lab Intelligent Oncol Breast Canc i, Chongqing 400030, Peoples R China
[3] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 400044, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Medical image segmentation; Transformer; CNN; Ultrasound images;
D O I
10.1016/j.eswa.2025.128022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Medical image segmentation in ultrasound and CT images plays a crucial role in the prevention, diagnosis, and treatment of various diseases and cancers. Convolutional neural networks (CNNs) are computationally efficient for medical image segmentation but struggle to capture long-range dependencies, which are essential for global contextual understanding. Transformers address this limitation through self-attention mechanisms. However, architectures composed solely of transformer blocks impose prohibitive computational costs and compromise local spatial continuity. To overcome these challenges, we propose E-TransConvNet, a hybrid framework that integrates a CNN backbone with a single enhanced transformer block, effectively balancing computational efficiency, local continuity, and global contextual reasoning. The enhanced transformer block introduces two novel components: a Convolution Attention Block (CAB), which captures global context while simultaneously modeling local spatial relationships by integrating depth-wise convolutional operations into multi-head self-attention mechanisms, and a Progressive Feature Enhancement Feed-Forward Network (PFE-FFN), which further refines boundary delineation by selectively capturing features across various receptive fields to preserve fine-grained structural details. Extensive experiments are conducted on six public medical image segmentation datasets spanning two medical imaging modalities to validate the capacity and effectiveness of E-TransConvNet. Our model achieves superior segmentation accuracy and enhanced boundary delineation across all evaluated datasets. Specifically, E-TransConvNet outperformed MedSAM by an average of 3.55 % in terms of Intersection over Union (IoU) across the six datasets. The source code for E-TransConvNet will be available at https://github.com/S-domain/E-TransConvNet.
引用
收藏
页数:13
相关论文
共 54 条
[1]   Dataset of breast ultrasound images [J].
Al-Dhabyani, Walid ;
Gomaa, Mohammed ;
Khaled, Hussien ;
Fahmy, Aly .
DATA IN BRIEF, 2020, 28
[2]  
Ayidzoe MA, 2022, TURK J ELECTR ENG CO, V30, P978, DOI [10.55730/1300-0632.3822, 10.3906/elk-2107-146]
[3]   DAE-Former: Dual Attention-Guided Efficient Transformer for Medical Image Segmentation [J].
Azad, Reza ;
Arimond, Rene ;
Aghdam, Ehsan Khodapanah ;
Kazerouni, Amirhossein ;
Merhof, Dorit .
PREDICTIVE INTELLIGENCE IN MEDICINE, PRIME 2023, 2023, 14277 :83-95
[4]   Skin lesion segmentation by using object detection networks, DeepLab3+, and active contours [J].
Bagheri, Fatemeh ;
Tarokh, Mohammad Jafar ;
Ziaratban, Majid .
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2022, 30 (07) :2489-2507
[5]   BPAT-UNet: Boundary preserving assembled transformer UNet for ultrasound thyroid nodule segmentation [J].
Bi, Hui ;
Cai, Chengjie ;
Sun, Jiawei ;
Jiang, Yibo ;
Lu, Gang ;
Shu, Huazhong ;
Ni, Xinye .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 238
[6]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[7]   Asymmetric U-shaped network with hybrid attention mechanism for kidney ultrasound images segmentation [J].
Chen, Gong -Ping ;
Zhao, Yu ;
Dai, Yu ;
Zhang, Jian-Xun ;
Yin, Xiao-Tao ;
Cui, Liang ;
Qian, Jiang .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212
[8]   Rethinking the unpretentious U-net for medical ultrasound image segmentation [J].
Chen, Gongping ;
Li, Lei ;
Zhang, Jianxun ;
Dai, Yu .
PATTERN RECOGNITION, 2023, 142
[9]   DSEU-net: A novel deep supervision SEU-net for medical ultrasound image segmentation [J].
Chen, Gongping ;
Liu, Yuming ;
Qian, Jiang ;
Zhang, Jianxun ;
Yin, Xiaotao ;
Cui, Liang ;
Dai, Yu .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 223
[10]   RRCNet: Refinement residual convolutional network for breast ultrasound images segmentation [J].
Chen, Gongping ;
Dai, Yu ;
Zhang, Jianxun .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117