MSDANet: A multi-scale dilation attention network for medical image segmentation

被引：17

作者：

Zhang, Jinquan ^{[1
]}

Luan, Zhuang ^{[1
]}

Ni, Lina ^{[1
,2
]}

Qi, Liang ^{[1
]}

Gong, Xu ^{[1
]}

机构：

[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China

[2] Tongji Univ, Minist Educ Embedded Syst & Serv Comp, Key Lab, Shanghai 201804, Peoples R China

来源：

BIOMEDICAL SIGNAL PROCESSING AND CONTROL | 2024年 / 90卷

基金：

美国国家科学基金会;

关键词：

Deep learning; Medical image segmentation; Multi-scale dilation; Attention mechanism; NEURAL-NETWORKS; NET;

D O I：

10.1016/j.bspc.2023.105889

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Deep learning shows excellent performance in medical image segmentation. However, a pooling operation in its encoding stage leads to feature loss and the ability of multi-scale contextual information extraction is insufficient. In addition, there is a semantic gap between low-level features and high-level features in the encoder-decoder. Inspired by an U-shaped network (UNet), this work proposes a multi-scale dilation attention network (MSDANet) for medical image segmentation. Specifically, it is mainly composed of a parallel dilation pooling module (PD), a multi-scale channel attention mechanism (MSCA), a multilayer perceptron squeeze and excitation module (MSE), a big kernel convolution module (BC), and a skip feature pyramid module (SFP). The proposed PD reduces the loss of subtle features during downsampling. In order to improve the extraction of effective features, MSCA, MSE and BC are used in encoding and decoding stages. In the skip connection stage, SPF is used to reduce the difference between the low-level and high-level semantic information to accelerate the network learning. We validate the proposed model on two publicly available medical image datasets. The results show that MSDANet has superior performance on several evaluation metrics compared to state-of-the-art methods.

引用

页数：14

共 45 条

[1] Dataset of breast ultrasound images
Al-Dhabyani, Walid
Gomaa, Mohammed
Khaled, Hussien
Fahmy, Aly
[J]. DATA IN BRIEF, 2020, 28
[2] Ba J.L., 2016, arXiv
[3] WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians
Bernal, Jorge
Javier Sanchez, F.
Fernandez-Esparrach, Gloria
Gil, Debora
Rodriguez, Cristina
Vilarino, Fernando
[J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 : 99 - 111
[4] Chen LC, 2016, Arxiv, DOI arXiv:1412.7062
[5] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[6] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[7] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[8] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[9] Multiscale fused network with additive channel-spatial attention for image segmentation
Gao, Chengling
Ye, Hailiang
Cao, Feilong
Wen, Chenglin
Zhang, Qinghua
Zhang, Feng
[J]. KNOWLEDGE-BASED SYSTEMS, 2021, 214
[10] Glorot X., 2011, PMLR, P315

← 1 2 3 4 5 →