Scale-wise discriminative region learning for medical image segmentation

被引:0
作者
Zhang, Jing [1 ]
Lai, Xiaoting [1 ]
Yang, Hai [1 ]
Ruan, Tong [1 ]
机构
[1] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China
基金
上海市自然科学基金;
关键词
Discriminative region; Deformable attention; Medical image segmentation; TRANSFORMER; ATTENTION;
D O I
10.1016/j.bspc.2023.105663
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Vision Transformer (ViT) has shown comparable capabilities to convolutional neural networks for medical image segmentation in recent years. However, most ViT-based models fail to effectively model long-range feature dependencies at multi-scales and ignore the crucial importance of the semantic richness of features at each scale for medical segmentation. To address this problem, we propose a novel Scale-wise Discriminative Region Learning Network (SDRL-Net) in this paper, which guides the model to focus on salient regions by differential modeling the global context relationships at each scale. In SDRL-Net, a scale-wise enhancement module is proposed to achieve more distinguishing feature representations in the encoder by concentrating spatially localized information and differentiated regional interactions simultaneously. Furthermore, we propose a multi-scale upsampling module that focuses on global multi-scale information through pyramid attention and then complements the local upsampling information to achieve better segmentation. Extensive experiments on three widely used public datasets demonstrate that our proposed SDRL-Net can perform excellently and outperform most state-of-the-art medical image segmentation methods. Code is available at https://github.com/MiniCoCo-be/SDRL-Net.
引用
收藏
页数:9
相关论文
共 52 条
  • [41] Wang HN, 2022, AAAI CONF ARTIF INTE, P2441
  • [42] Vision Transformer with Deformable Attention
    Xia, Zhuofan
    Pan, Xuran
    Song, Shiji
    Li, Li Erran
    Huang, Gao
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4784 - 4793
  • [43] Weighted Res-UNet for High-quality Retina Vessel Segmentation
    Xiao, Xiao
    Lian, Sheng
    Luo, Zhiming
    Li, Shaozi
    [J]. 2018 NINTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME 2018), 2018, : 327 - 331
  • [44] CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation
    Xie, Yutong
    Zhang, Jianpeng
    Shen, Chunhua
    Xia, Yong
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 171 - 180
  • [45] LeViT-UNet: Make Faster Encoders with Transformer for Medical Image Segmentation
    Xu, Guoping
    Zhang, Xuan
    He, Xinwei
    Wu, Xinglong
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 42 - 53
  • [46] Yao C, 2021, Arxiv, DOI arXiv:2107.05188
  • [47] CAMS-Net: An attention-guided feature selection network for rib segmentation in chest X-rays
    Zhang, Dandan
    Wang, Hongyu
    Deng, Jiahui
    Wang, Tonghui
    Shen, Cong
    Feng, Jun
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 156
  • [48] ST-Unet: Swin Transformer boosted U-Net with Cross-Layer Feature Enhancement for medical image segmentation
    Zhang, Jing
    Qin, Qiuge
    Ye, Qi
    Ruan, Tong
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
  • [49] Zhang Y., 2021, LECT NOTES COMPUT SC, P14, DOI [DOI 10.1007/978-3-030-87193-2_2, 10.1007/978-3-030-87193-22]
  • [50] Zhou DQ, 2021, Arxiv, DOI arXiv:2103.11886