Scale-wise discriminative region learning for medical image segmentation

被引：0

作者：

Zhang, Jing ^{[1
]}

Lai, Xiaoting ^{[1
]}

Yang, Hai ^{[1
]}

Ruan, Tong ^{[1
]}

机构：

[1] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China

来源：

BIOMEDICAL SIGNAL PROCESSING AND CONTROL | 2024年 / 89卷

基金：

上海市自然科学基金;

关键词：

Discriminative region; Deformable attention; Medical image segmentation; TRANSFORMER; ATTENTION;

D O I：

10.1016/j.bspc.2023.105663

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Vision Transformer (ViT) has shown comparable capabilities to convolutional neural networks for medical image segmentation in recent years. However, most ViT-based models fail to effectively model long-range feature dependencies at multi-scales and ignore the crucial importance of the semantic richness of features at each scale for medical segmentation. To address this problem, we propose a novel Scale-wise Discriminative Region Learning Network (SDRL-Net) in this paper, which guides the model to focus on salient regions by differential modeling the global context relationships at each scale. In SDRL-Net, a scale-wise enhancement module is proposed to achieve more distinguishing feature representations in the encoder by concentrating spatially localized information and differentiated regional interactions simultaneously. Furthermore, we propose a multi-scale upsampling module that focuses on global multi-scale information through pyramid attention and then complements the local upsampling information to achieve better segmentation. Extensive experiments on three widely used public datasets demonstrate that our proposed SDRL-Net can perform excellently and outperform most state-of-the-art medical image segmentation methods. Code is available at https://github.com/MiniCoCo-be/SDRL-Net.

引用

页数：9

共 52 条

[41] Wang HN, 2022, AAAI CONF ARTIF INTE, P2441
[42] Vision Transformer with Deformable Attention
Xia, Zhuofan
Pan, Xuran
Song, Shiji
Li, Li Erran
Huang, Gao
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4784 - 4793
[43] Weighted Res-UNet for High-quality Retina Vessel Segmentation
Xiao, Xiao
Lian, Sheng
Luo, Zhiming
Li, Shaozi
[J]. 2018 NINTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME 2018), 2018, : 327 - 331
[44] CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation
Xie, Yutong
Zhang, Jianpeng
Shen, Chunhua
Xia, Yong
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 171 - 180
[45] LeViT-UNet: Make Faster Encoders with Transformer for Medical Image Segmentation
Xu, Guoping
Zhang, Xuan
He, Xinwei
Wu, Xinglong
[J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 42 - 53
[46] Yao C, 2021, Arxiv, DOI arXiv:2107.05188
[47] CAMS-Net: An attention-guided feature selection network for rib segmentation in chest X-rays
Zhang, Dandan
Wang, Hongyu
Deng, Jiahui
Wang, Tonghui
Shen, Cong
Feng, Jun
[J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 156
[48] ST-Unet: Swin Transformer boosted U-Net with Cross-Layer Feature Enhancement for medical image segmentation
Zhang, Jing
Qin, Qiuge
Ye, Qi
Ruan, Tong
[J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
[49] Zhang Y., 2021, LECT NOTES COMPUT SC, P14, DOI [DOI 10.1007/978-3-030-87193-2_2, 10.1007/978-3-030-87193-22]
[50] Zhou DQ, 2021, Arxiv, DOI arXiv:2103.11886

← 1 2 3 4 5 6 →