Multi-TranResUnet: An Improved Transformer Network for Solving Multi-Scale Issues in Image Segmentation

被引:0
作者
Kang, Yajing [1 ]
Cheng, Shuai [1 ]
Guo, Liang [2 ]
Zheng, Chao [1 ]
Zhao, Jizhuang [1 ]
机构
[1] China Telecom Res Inst, Beijing 102209, Peoples R China
[2] CAICT, Inst Cloud Comp & Big Data, Beijing 100191, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Image segmentation; Feature extraction; Medical diagnostic imaging; Convolutional neural networks; Accuracy; Computational modeling; Low latency communication; Medical image segmentation; deep learning; transformer; low-latency model;
D O I
10.1109/ACCESS.2024.3457823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep-learning-driven medical image segmentation marks a significant milestone in the evolution of intelligent healthcare systems. Despite remarkable accuracy achievements, real-world clinical applications still grapple with complex challenges, particularly in handling multi-scale medical targets. This paper introduces a novel and efficient medical image segmentation network that leverages Transformer technology. The proposed network utilizes the Transformer's global feature extraction capabilities, enriched with spatial context, to substantially elevate segmentation accuracy. Additionally, the fusion encoder we build by combining Transformer modules and Convolutional structures through feature fusion strategies can improve feature extraction capabilities. Acknowledging the computational demands of Transformer models in practical scenarios, we have meticulously optimized our Transformer architecture. This optimization focuses on reducing parameter complexity and inference latency, tailoring the model to address the typical sample scarcity in medical applications. We evaluated our model on two different medical datasets: the 2018 Lesion Boundary Segmentation Challenge, the 2018 Data Science Bowl Challenge and the Kvasir-Instrument dataset. Our model demonstrates state-of-the-art performance in both Dice and MIoU metrics, while maintaining robust real-time processing capabilities. Our code will be released at https://github.com/migouKang/Multi-TranResUnet.
引用
收藏
页码:129000 / 129011
页数:12
相关论文
共 50 条
  • [31] A Medical Image Segmentation Network with Multi-Scale and Dual-Branch Attention
    Zhu, Cancan
    Cheng, Ke
    Hua, Xuecheng
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [32] Fusion multi-scale Transformer skin lesion segmentation algorithm
    Liang L.-M.
    Zhou L.-S.
    Yin J.
    Sheng X.-Q.
    [J]. Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2024, 54 (04): : 1086 - 1098
  • [33] Automatic lumbar spinal MRI image segmentation with a multi-scale attention network
    Li, Haixing
    Luo, Haibo
    Huan, Wang
    Shi, Zelin
    Yan, Chongnan
    Wang, Lanbo
    Mu, Yueming
    Liu, Yunpeng
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (18) : 11589 - 11602
  • [34] Automatic lumbar spinal MRI image segmentation with a multi-scale attention network
    Haixing Li
    Haibo Luo
    Wang Huan
    Zelin Shi
    Chongnan Yan
    Lanbo Wang
    Yueming Mu
    Yunpeng Liu
    [J]. Neural Computing and Applications, 2021, 33 : 11589 - 11602
  • [35] Sub-pixel multi-scale fusion network for medical image segmentation
    Jing Li
    Qiaohong Chen
    Xian Fang
    [J]. Multimedia Tools and Applications, 2024, 83 (41) : 89355 - 89373
  • [36] Transformer-Based Multi-Scale Feature Remote Sensing Image Classification Model
    Sun, Ting
    Li, Jun
    Zhou, Xiangrui
    Chen, Zan
    [J]. IEEE ACCESS, 2025, 13 : 34095 - 34104
  • [37] A Multi-Scale Cross-Fusion Medical Image Segmentation Network Based on Dual-Attention Mechanism Transformer
    Cui, Jianguo
    Wang, Liejun
    Jiang, Shaochen
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (19):
  • [38] Multi-scale nested UNet with transformer for colorectal polyp segmentation
    Wang, Zenan
    Liu, Zhen
    Yu, Jianfeng
    Gao, Yingxin
    Liu, Ming
    [J]. JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2024, 25 (06):
  • [39] MS-TCNet: An effective Transformer-CNN combined network using multi-scale feature learning for 3D medical image segmentation
    Ao, Yu
    Shi, Weili
    Ji, Bai
    Miao, Yu
    He, Wei
    Jiang, Zhengang
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 170
  • [40] Image Quality Assessment Based on Multi-Scale Representation and Shifting Transformer
    Fu, Geng
    Wang, Ziyu
    Zhang, Cuijuan
    Qi, Zerong
    Hu, Mingzheng
    Fu, Shujun
    Zhang, Yunfeng
    [J]. IEEE ACCESS, 2025, 13 : 24276 - 24286