MSA-Net: Multiscale spatial attention network for medical image segmentation

被引:17
作者
Fu, Zhaojin [1 ,2 ]
Li, Jinjiang [1 ,2 ,3 ]
Hua, Zhen [1 ,2 ]
机构
[1] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Peoples R China
[2] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai 264005, Peoples R China
[3] Coinnovat Ctr Shandong Coll & Univ Future Intellig, Yantai 264005, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical Image Segmenta-tion; Multiscale Feature Extrac-tion; Deep learning; Attention Mechanism;
D O I
10.1016/j.aej.2023.02.039
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Background: Edge accuracy and positional accuracy are the two goals pursued by med-ical image segmentation. In clinical medicine diagnosis and research, these two goals enable medical image segmentation techniques to help in the effective determination of lesions and lesion analysis. At present, U-Net has become the most important network in the field of image segmentation, and the technologies used in various achievements are derived from its architecture, which also proves from practice that the network structure proposed by U-Net is effective. Objective: We have found in a large number of experiments that classical networks indeed show good performance in the field of medical segmentation, but there are still some deficiencies in edge determination and network robustness, especially in the face of blurred edges, the processing results often fail to achieve the expected results. In order to be able to locate segmentation targets and achieve effective determination of blurred edges, a Multiscale Spatial Attention Network (MSA-Net) is proposed as in Fig. 1. Method: In MSA-Net, the Multiscale Pyramid Attention Block (MPAB) is created to enhance the capture of high-level semantic information. In addition, the network uses ASPP, which not only expands the network's field of view, but also captures richer feature information. In the decoding phase, the Feature Fusion Block (FFB) is created to enable better focus on different dimensional information features and to enhance the feature fusion process. Result: To demonstrate the effectiveness of the network, we validate the performance of MSA-Net on four datasets (ISIC2016, DSB2018, JSRT, GlaS) in three different categories. Compared with mainstream networks, MSA-Net shows better results in detail features, target localization, and edge processing. Finally, we also demonstrate the effectiveness of the MSA-Net architecture through ablation experiments. (c) 2023 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Alexandria University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/ licenses/by-nc-nd/4.0/).
引用
收藏
页码:453 / 473
页数:21
相关论文
共 66 条
[1]   Efficient 3D Deep Learning Model for Medical Image Semantic Segmentation [J].
Alalwan, Nasser ;
Abozeid, Amr ;
ElHabshy, AbdAllah A. ;
Alzahrani, Ahmed .
ALEXANDRIA ENGINEERING JOURNAL, 2021, 60 (01) :1231-1239
[2]   Versatile Convolutional Networks Applied to Computed Tomography and Magnetic Resonance Image Segmentation [J].
Almeida, Goncalo ;
Tavares, Joao Manuel R. S. .
JOURNAL OF MEDICAL SYSTEMS, 2021, 45 (08)
[3]   Recurrent residual U-Net for medical image segmentation [J].
Alom, Md Zahangir ;
Yakopcic, Chris ;
Hasan, Mahmudul ;
Taha, Tarek M. ;
Asari, Vijayan K. .
JOURNAL OF MEDICAL IMAGING, 2019, 6 (01)
[4]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[5]   Dual-Path Adversarial Learning for Fully Convolutional Network (FCN)-Based Medical Image Segmentation [J].
Bi, Lei ;
Feng, Dagan ;
Kim, Jinman .
VISUAL COMPUTER, 2018, 34 (6-8) :1043-1052
[6]   Dermoscopic Image Segmentation via Multistage Fully Convolutional Networks [J].
Bi, Lei ;
Kim, Jinman ;
Ahn, Euijoon ;
Kumar, Ashnil ;
Fulham, Michael ;
Feng, Dagan .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2017, 64 (09) :2065-2074
[7]   HistoSegNet: Semantic Segmentation of Histological Tissue Type in Whole Slide Images [J].
Chan, Lyndon ;
Hosseini, Mahdi S. ;
Rowsell, Corwyn ;
Plataniotis, Konstantinos N. ;
Damaskinos, Savvas .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :10661-10670
[8]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[9]   Reverse Attention for Salient Object Detection [J].
Chen, Shuhan ;
Tan, Xiuli ;
Wang, Ben ;
Hu, Xuelong .
COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 :236-252
[10]   DDU-Net: A dual dense U-structure network for medical image segmentation [J].
Cheng, Junlong ;
Tian, Shengwei ;
Yu, Long ;
Liu, Shijia ;
Wang, Chaoqing ;
Ren, Yuan ;
Lu, Hongchun ;
Zhu, Min .
APPLIED SOFT COMPUTING, 2022, 126