Automatic Medical Image Segmentation with Vision Transformer

被引:0
|
作者
Zhang, Jie [1 ,2 ]
Li, Fan [1 ]
Zhang, Xin [1 ]
Wang, Huaijun [1 ]
Hei, Xinhong [1 ]
机构
[1] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
[2] Xian Univ Technol, Prov Key Lab Network Comp & Secur Technol, Xian 710048, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 07期
关键词
automatic segmentation; transformer; medical images; NETWORKS;
D O I
10.3390/app14072741
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application This work investigates the application of transformers for general medical image segmentation tasks. The key future directions involve developing unified AI-based system for integrated diagnosis, interactive segmentation to enable human.Abstract Automatic image segmentation is vital for the computer-aided determination of treatment directions, particularly in terms of labelling lesions or infected areas. However, the manual labelling of disease regions is inconsistent and a time-consuming assignment. Meanwhile, radiologists' comments are exceedingly subjective, regularly impacted by personal clinical encounters. To address these issues, we proposed a transformer learning strategy to automatically recognize infected areas in medical images. We firstly utilize a parallel partial decoder to aggregate high-level features and then generate a global feature map. Explicit edge attention and implicit reverse attention are applied to demonstrate boundaries and enhance their expression. Additionally, to alleviate the need for extensive labeled data, we propose a segmentation network combining propagation and transformer architectures that requires only a small amount of labeled data while leveraging fundamentally unlabeled images. The attention mechanisms are integrated within convolutional networks, keeping their global structures intact. Standalone transformers connected straightforwardly and receiving image patches can also achieve impressive segmentation performance. Our network enhanced the learning ability and attained a higher quality execution. We conducted a variety of ablation studies to demonstrate the adequacy of each modelling component. Experiments conducted across various medical imaging modalities illustrate that our model beats the most popular segmentation models. The comprehensive results also show that our transformer architecture surpasses established frameworks in accuracy while better preserving the natural variations in anatomy. Both quantitatively and qualitatively, our model achieves a higher overlap with ground truth segmentations and improved boundary adhesion.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] LViT: Language Meets Vision Transformer in Medical Image Segmentation
    Li, Zihan
    Li, Yunxiang
    Li, Qingde
    Wang, Puyang
    Guo, Dazhou
    Lu, Le
    Jin, Dakai
    Zhang, You
    Hong, Qingqi
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (01) : 96 - 107
  • [2] Grouped multi-scale vision transformer for medical image segmentation
    Zexuan Ji
    Zheng Chen
    Xiao Ma
    Scientific Reports, 15 (1)
  • [3] MetaSwin: a unified meta vision transformer model for medical image segmentation
    Lee, Soyeon
    Lee, Minhyeok
    PEERJ COMPUTER SCIENCE, 2024, 10 : 1 - 17
  • [4] MetaSwin: a unified meta vision transformer model for medical image segmentation
    Lee, Soyeon
    Lee, Minhyeok
    PeerJ Computer Science, 2024, 10 : 1 - 17
  • [5] High-Resolution Swin Transformer for Automatic Medical Image Segmentation
    Wei, Chen
    Ren, Shenghan
    Guo, Kaitai
    Hu, Haihong
    Liang, Jimin
    SENSORS, 2023, 23 (07)
  • [6] Ctnet: rethinking convolutional neural networks and vision transformer for medical image segmentation
    Zhang, Zhixin
    Jiang, Shuhao
    Pan, Xuhua
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 2265 - 2275
  • [7] Ctnet: rethinking convolutional neural networks and vision transformer for medical image segmentation
    Zhixin Zhang
    Shuhao Jiang
    Xuhua Pan
    Signal, Image and Video Processing, 2024, 18 : 2265 - 2275
  • [8] ViTBIS: Vision Transformer for Biomedical Image Segmentation
    Sagar, Abhinav
    CLINICAL IMAGE-BASED PROCEDURES, DISTRIBUTED AND COLLABORATIVE LEARNING, ARTIFICIAL INTELLIGENCE FOR COMBATING COVID-19 AND SECURE AND PRIVACY-PRESERVING MACHINE LEARNING, CLIP 2021, DCL 2021, LL-COVID19 2021, PPML 2021, 2021, 12969 : 34 - 45
  • [9] MDViT: Multi-domain Vision Transformer for Small Medical Image Segmentation Datasets
    Du, Siyi
    Bayasi, Nourhan
    Hamarneh, Ghassan
    Garbi, Rafeef
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 448 - 458
  • [10] Laformer: Vision Transformer for Panoramic Image Semantic Segmentation
    Yuan, Zheng
    Wang, Junhua
    Lv, Yuxin
    Wang, Ding
    Fang, Yi
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1792 - 1796