MSAByNet: A multiscale subtraction attention network framework based on Bayesian loss for medical image segmentation

被引:1
作者
Zhao, Longxuan [1 ,2 ]
Wang, Tao [1 ,2 ]
Chen, Yuanbin [1 ,2 ]
Zhang, Xinlin [1 ,2 ,3 ]
Tang, Hui [1 ,2 ]
Zong, Ruige [1 ,2 ]
Tan, Tao [4 ]
Chen, Shun [5 ,6 ]
Tong, Tong [1 ,2 ,3 ]
机构
[1] Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou, Peoples R China
[2] Fuzhou Univ, Fujian Key Lab Med Instrumentat & Pharmaceut Techn, Fuzhou, Peoples R China
[3] Imperial Vis Technol, Fuzhou, Peoples R China
[4] Macao Polytech Univ, Macau, Peoples R China
[5] Fujian Med Univ, Union Hosp, Dept Ultrasound, Fuzhou, Peoples R China
[6] Fujian Med Ultrasound Res Inst, Fuzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Medical image segmentation; Multi-scale processing; Deep convolutional neural networks; Bayesian loss; NET;
D O I
10.1016/j.bspc.2024.107393
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Medical image segmentation is a critical and complex process in medical image processing and analysis. With the development of artificial intelligence, the application of deep learning in medical image segmentation is becoming increasingly widespread. Existing techniques are mostly based on the U-shaped convolutional neural network and its variants, such as the U-Net framework, which uses skip connections or element-wise addition to fuse features from different levels in the decoder. However, these operations often weaken the compatibility between features at different levels, leading to a significant amount of redundant information and imprecise lesion segmentation. The construction of the loss function is a key factor in neural network design, but traditional loss functions lack high domain generalization and the interpretability of domain-invariant features needs improvement. To address these issues, we propose a Bayesian loss-based Multi-Scale Subtraction Attention Network (MSAByNet). Specifically, we propose an inter-layer and intra-layer multi-scale subtraction attention module, and different sizes of receptive fields were set for different levels of modules to avoid loss of feature map resolution and edge detail features. Additionally, we design a multi-scale deep spatial attention mechanism to learn spatial dimension information and enrich multi-scale differential information. Furthermore, we introduce Bayesian loss, re-modeling the image in spatial terms, enabling our MSAByNet to capture stable shapes, improving domain generalization performance. We have evaluated our proposed network on two publicly available datasets: the BUSI dataset and the Kvasir-SEG dataset. Experimental results demonstrate that the proposed MSAByNet outperforms several state-of-the-art segmentation methods. The codes are available at https://github.com/zlxokok/MSAByNet.
引用
收藏
页数:12
相关论文
共 55 条
[1]  
Abraham N, 2019, I S BIOMED IMAGING, P683, DOI 10.1109/ISBI.2019.8759329
[2]   Dataset of breast ultrasound images [J].
Al-Dhabyani, Walid ;
Gomaa, Mohammed ;
Khaled, Hussien ;
Fahmy, Aly .
DATA IN BRIEF, 2020, 28
[3]   Recurrent residual U-Net for medical image segmentation [J].
Alom, Md Zahangir ;
Yakopcic, Chris ;
Hasan, Mahmudul ;
Taha, Tarek M. ;
Asari, Vijayan K. .
JOURNAL OF MEDICAL IMAGING, 2019, 6 (01)
[4]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[5]   Variational Inference: A Review for Statisticians [J].
Blei, David M. ;
Kucukelbir, Alp ;
McAuliffe, Jon D. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) :859-877
[6]  
Chen J., 2021, arXiv
[7]  
Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, 10.48550/arXiv.1706.05587]
[8]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[9]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[10]   CoTrFuse: a novel framework by fusing CNN and transformer for medical image segmentation [J].
Chen, Yuanbin ;
Wang, Tao ;
Tang, Hui ;
Zhao, Longxuan ;
Zhang, Xinlin ;
Tan, Tao ;
Gao, Qinquan ;
Du, Min ;
Tong, Tong .
PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (17)