MSDANet: A multi-scale dilation attention network for medical image segmentation

被引:17
作者
Zhang, Jinquan [1 ]
Luan, Zhuang [1 ]
Ni, Lina [1 ,2 ]
Qi, Liang [1 ]
Gong, Xu [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[2] Tongji Univ, Minist Educ Embedded Syst & Serv Comp, Key Lab, Shanghai 201804, Peoples R China
基金
美国国家科学基金会;
关键词
Deep learning; Medical image segmentation; Multi-scale dilation; Attention mechanism; NEURAL-NETWORKS; NET;
D O I
10.1016/j.bspc.2023.105889
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Deep learning shows excellent performance in medical image segmentation. However, a pooling operation in its encoding stage leads to feature loss and the ability of multi-scale contextual information extraction is insufficient. In addition, there is a semantic gap between low-level features and high-level features in the encoder-decoder. Inspired by an U-shaped network (UNet), this work proposes a multi-scale dilation attention network (MSDANet) for medical image segmentation. Specifically, it is mainly composed of a parallel dilation pooling module (PD), a multi-scale channel attention mechanism (MSCA), a multilayer perceptron squeeze and excitation module (MSE), a big kernel convolution module (BC), and a skip feature pyramid module (SFP). The proposed PD reduces the loss of subtle features during downsampling. In order to improve the extraction of effective features, MSCA, MSE and BC are used in encoding and decoding stages. In the skip connection stage, SPF is used to reduce the difference between the low-level and high-level semantic information to accelerate the network learning. We validate the proposed model on two publicly available medical image datasets. The results show that MSDANet has superior performance on several evaluation metrics compared to state-of-the-art methods.
引用
收藏
页数:14
相关论文
共 45 条
  • [1] Dataset of breast ultrasound images
    Al-Dhabyani, Walid
    Gomaa, Mohammed
    Khaled, Hussien
    Fahmy, Aly
    [J]. DATA IN BRIEF, 2020, 28
  • [2] Ba J.L., 2016, arXiv
  • [3] WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians
    Bernal, Jorge
    Javier Sanchez, F.
    Fernandez-Esparrach, Gloria
    Gil, Debora
    Rodriguez, Cristina
    Vilarino, Fernando
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 : 99 - 111
  • [4] Chen LC, 2016, Arxiv, DOI arXiv:1412.7062
  • [5] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
  • [6] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [7] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [8] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [9] Multiscale fused network with additive channel-spatial attention for image segmentation
    Gao, Chengling
    Ye, Hailiang
    Cao, Feilong
    Wen, Chenglin
    Zhang, Qinghua
    Zhang, Feng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 214
  • [10] Glorot X., 2011, PMLR, P315