Multi-TranResUnet: An Improved Transformer Network for Solving Multi-Scale Issues in Image Segmentation

被引:1
作者
Kang, Yajing [1 ]
Cheng, Shuai [1 ]
Guo, Liang [2 ]
Zheng, Chao [1 ]
Zhao, Jizhuang [1 ]
机构
[1] China Telecom Res Inst, Beijing 102209, Peoples R China
[2] CAICT, Inst Cloud Comp & Big Data, Beijing 100191, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Image segmentation; Feature extraction; Medical diagnostic imaging; Convolutional neural networks; Accuracy; Computational modeling; Low latency communication; Medical image segmentation; deep learning; transformer; low-latency model;
D O I
10.1109/ACCESS.2024.3457823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep-learning-driven medical image segmentation marks a significant milestone in the evolution of intelligent healthcare systems. Despite remarkable accuracy achievements, real-world clinical applications still grapple with complex challenges, particularly in handling multi-scale medical targets. This paper introduces a novel and efficient medical image segmentation network that leverages Transformer technology. The proposed network utilizes the Transformer's global feature extraction capabilities, enriched with spatial context, to substantially elevate segmentation accuracy. Additionally, the fusion encoder we build by combining Transformer modules and Convolutional structures through feature fusion strategies can improve feature extraction capabilities. Acknowledging the computational demands of Transformer models in practical scenarios, we have meticulously optimized our Transformer architecture. This optimization focuses on reducing parameter complexity and inference latency, tailoring the model to address the typical sample scarcity in medical applications. We evaluated our model on two different medical datasets: the 2018 Lesion Boundary Segmentation Challenge, the 2018 Data Science Bowl Challenge and the Kvasir-Instrument dataset. Our model demonstrates state-of-the-art performance in both Dice and MIoU metrics, while maintaining robust real-time processing capabilities. Our code will be released at https://github.com/migouKang/Multi-TranResUnet.
引用
收藏
页码:129000 / 129011
页数:12
相关论文
共 50 条
[41]   Transformer-Based Multi-Scale Feature Remote Sensing Image Classification Model [J].
Sun, Ting ;
Li, Jun ;
Zhou, Xiangrui ;
Chen, Zan .
IEEE ACCESS, 2025, 13 :34095-34104
[42]   MS-TCNet: An effective Transformer-CNN combined network using multi-scale feature learning for 3D medical image segmentation [J].
Ao, Yu ;
Shi, Weili ;
Ji, Bai ;
Miao, Yu ;
He, Wei ;
Jiang, Zhengang .
COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 170
[43]   A Multi-Scale Cross-Fusion Medical Image Segmentation Network Based on Dual-Attention Mechanism Transformer [J].
Cui, Jianguo ;
Wang, Liejun ;
Jiang, Shaochen .
APPLIED SCIENCES-BASEL, 2023, 13 (19)
[44]   Image Quality Assessment Based on Multi-Scale Representation and Shifting Transformer [J].
Fu, Geng ;
Wang, Ziyu ;
Zhang, Cuijuan ;
Qi, Zerong ;
Hu, Mingzheng ;
Fu, Shujun ;
Zhang, Yunfeng .
IEEE ACCESS, 2025, 13 :24276-24286
[45]   Multi-scale nested UNet with transformer for colorectal polyp segmentation [J].
Wang, Zenan ;
Liu, Zhen ;
Yu, Jianfeng ;
Gao, Yingxin ;
Liu, Ming .
JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2024, 25 (06)
[46]   MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation [J].
Xi, Heran ;
Dong, Haoji ;
Sheng, Yue ;
Cui, Hui ;
Huang, Chengying ;
Li, Jinbao ;
Zhu, Jinghua .
PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (01)
[47]   EMSFomer: Efficient Multi-Scale Transformer for Real-Time Semantic Segmentation [J].
Xia, Zhengyu ;
Kim, Joohee .
IEEE ACCESS, 2025, 13 :18239-18252
[48]   Multi-Scale Orthogonal Model CNN-Transformer for Medical Image Segmentation [J].
Zhou, Wuyi ;
Zeng, Xianhua ;
Zhou, Mingkun .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (10)
[49]   LTMSegnet: Lightweight multi-scale medical image segmentation combining Transformer and MLP [J].
Huang, Xin ;
Tang, Hongxiang ;
Ding, Yan ;
Li, Yuanyuan ;
Zhu, Zhiqin ;
Yang, Pan .
Computers in Biology and Medicine, 2024, 183
[50]   MSA-MaxNet: Multi-Scale Attention Enhanced Multi-Axis Vision Transformer Network for Medical Image Segmentation [J].
Wu, Wei ;
Huang, Junfeng ;
Zhang, Mingxuan ;
Li, Yichen ;
Yu, Qijia ;
Zhao, Qi .
JOURNAL OF CELLULAR AND MOLECULAR MEDICINE, 2024, 28 (24)