DENSELY CONNECTED SWIN-UNET FOR MULTISCALE INFORMATION AGGREGATION IN MEDICAL IMAGE SEGMENTATION

被引:5
作者
Wang, Ziyang [1 ]
Su, Meiwen [2 ]
Zheng, Jian-Qing [3 ]
Liu, Yang [4 ]
机构
[1] Univ Oxford, Dept Comp Sci, Oxford, England
[2] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Peoples R China
[3] Univ Oxford, Kennedy Inst Rheumatol, Oxford, England
[4] Univ Plymouth, Dept Comp Sci, Plymouth, Devon, England
来源
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年
关键词
Semantic Segmentation; UNet; Vision Transformer;
D O I
10.1109/ICIP49359.2023.10222451
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image semantic segmentation is a dense prediction task in computer vision that is dominated by deep learning techniques in recent years. UNet, which is a symmetric encoder-decoder end-to-end Convolutional Neural Network (CNN) with skip connections, has shown promising performance. Aiming to process the multiscale feature information efficiently, we propose a new Densely Connected Swin-UNet (DCS-UNet) with multiscale information aggregation for medical image segmentation. Firstly, inspired by Swin-Transformer to model long-range dependencies via shift-window-based self-attention, this work proposes the use of fully ViT-based network blocks with a shift-window approach, resulting in a purely self-attention-based U-shape segmentation network. The relevant layers including feature sampling and image tokenization are re-designed to align with the ViT fashion. Secondly, a full-scale deep supervision scheme is developed to process the aggregated feature map with various resolutions generated by different levels of decoders. Thirdly, dense skip connections are proposed that allow the semantic feature information to be thoroughly transferred from different levels of encoders to lower level decoders. Our proposed method is validated on a public benchmark MRI Cardiac segmentation data set with comprehensive validation metrics showing competitive performance against other variant encoder-decoder networks. The code is available at https://github.com/ziyangwang007/VIT4UNet.
引用
收藏
页码:940 / 944
页数:5
相关论文
共 50 条
  • [41] DSKCA-UNet: Dynamic selective kernel channel attention for medical image segmentation
    Shen, Longfeng
    Wang, Qiong
    Zhang, Yingjie
    Qin, Fenglan
    Jin, Hengjun
    Zhao, Wei
    MEDICINE, 2023, 102 (39) : E35328
  • [42] MH UNet: A Multi-Scale Hierarchical Based Architecture for Medical Image Segmentation
    Ahmad, Parvez
    Jin, Hai
    Alroobaea, Roobaea
    Qamar, Saqib
    Zheng, Ran
    Alnajjar, Fady
    Aboudi, Fathia
    IEEE ACCESS, 2021, 9 : 148384 - 148408
  • [43] Half-UNet: A Simplified U-Net Architecture for Medical Image Segmentation
    Lu, Haoran
    She, Yifei
    Tie, Jun
    Xu, Shengzhou
    FRONTIERS IN NEUROINFORMATICS, 2022, 16
  • [44] MDA-Unet: A Multi-Scale Dilated Attention U-Net for Medical Image Segmentation
    Amer, Alyaa
    Lambrou, Tryphon
    Ye, Xujiong
    APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [45] MD-UNet: a medical image segmentation network based on mixed depthwise convolution
    Yun Liu
    Shuanglong Yao
    Xing Wang
    Ji Chen
    Xiaole Li
    Medical & Biological Engineering & Computing, 2024, 62 : 1201 - 1212
  • [46] DI-Unet: Dimensional interaction self-attention for medical image segmentation
    Wu, Yanlin
    Wang, Guanglei
    Wang, Zhongyang
    Wang, Hongrui
    Li, Yan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
  • [47] RFE-UNet: Remote Feature Exploration with Local Learning for Medical Image Segmentation
    Zhong, Xiuxian
    Xu, Lianghui
    Li, Chaoqun
    An, Lijing
    Wang, Liejun
    SENSORS, 2023, 23 (13)
  • [48] Swin SMT: Global Sequential Modeling for Enhancing 3D Medical Image Segmentation
    Plotka, Szymon
    Chrabaszcz, Maciej
    Biecek, Przemyslaw
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VIII, 2024, 15008 : 689 - 698
  • [49] The Domain Shift Problem of Medical Image Segmentation and Vendor-Adaptation by Unet-GAN
    Yan, Wenjun
    Wang, Yuanyuan
    Gu, Shengjia
    Huang, Lu
    Yan, Fuhua
    Xia, Liming
    Tao, Qian
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 : 623 - 631
  • [50] GSAC-UFormer: Groupwise Self-Attention Convolutional Transformer-Based UNet for Medical Image Segmentation
    Garbaz, Anass
    Oukdach, Yassine
    Charfi, Said
    El Ansari, Mohamed
    Koutti, Lahcen
    Salihoun, Mouna
    COGNITIVE COMPUTATION, 2025, 17 (02)