Deep Encoder-decoder Network with Saliency Guidance and Uncertainty Supervision

被引:0
|
作者
Wang X. [1 ,2 ]
Li Z.-S. [1 ,2 ]
Chen H.-P. [1 ,2 ]
机构
[1] College of Computer Science and Technology, Jilin University, Changchun
[2] Key Laboratory of Symbolic Computation and Knowledge Engineering, Ministry of Education, Jilin University, Changchun
来源
Ruan Jian Xue Bao/Journal of Software | 2022年 / 33卷 / 09期
关键词
encoder-decoder network; multimodal; saliency map; semantic segmentation of medical image; uncertainty probability map;
D O I
10.13328/j.cnki.jos.006624
中图分类号
学科分类号
摘要
The encoder-decoder network based on U-Net and its variants have achieved excellent performance in semantic segmentation of medical images. However, some spatial details are lost during feature extraction, which affects the accuracy of segmentation, and the generalization ability and robustness of these models are unsatisfactory. Therefore, this study proposes a deep convolutional encoder-decoder network with saliency guidance and uncertainty supervision to solve the semantic segmentation problem in multimodal medical images. In this method, the initially generated saliency map and the uncertainty probability map are used as the supervised information to optimize the parameters of the semantic segmentation network. Specifically, the saliency map is generated by the saliency detection network to preliminarily locate the target region in an image, and on this basis, the set of pixel points with uncertain classification is calculated to generate the uncertainty probability map. Then, the two maps are sent into the multi-scale feature fusion network together with the original image to guide the network to focus on the learning of the features in the target region and to enhance the representational capacity of regions with uncertain classification and complex boundaries. In this way, the segmentation performance of the network can be improved. The experimental results reveal that the proposed method can capture more semantic information and outperforms existing semantic segmentation methods in semantic segmentation of multimodal medical images, with strong generalization capability and robustness. © 2022 Chinese Academy of Sciences. All rights reserved.
引用
收藏
相关论文
共 42 条
  • [1] Song J, Xiao L, Lian ZC, Cai ZY, Jiang GP., Overview and prospect of deep learning for image segmentation in digital pathology, Ruan Jian Xue Bao/Journal of Software, 32, 5, (2021)
  • [2] Ibtehaz N, Rahman MS., MultiResUNet: Rethinking the U-Net architecture for multimodal biomedical image segmentation, Neural Networks, 121, pp. 74-87, (2020)
  • [3] Azad R, Asadi-Aghbolaghi M, Fathy M, Escalera S., Bi-directional convLSTM U-Net with densley connected convolutions, Proc. of the IEEE/CVF Int’l Conf. on Computer Vision Workshop, pp. 406-415, (2019)
  • [4] Jha D, Smedsrud PH, Riegler MA, Johansen D, De Lange T, Halvorsen P, Johansen HD., ResUNet++: An advanced architecture for medical image segmentation, Proc. of the 2019 IEEE Int’l Symp. on Multimedia (ISM), pp. 225-230, (2019)
  • [5] Long J, Shelhamer E, Darrell T., Fully convolutional networks for semantic segmentation, Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pp. 3431-3440, (2015)
  • [6] Ronneberger O, Fischer P, Brox T., U-Net: Convolutional networks for biomedical image segmentation, Proc. of the 18th Int’l Conf. on Medical Image Computing and Computer-Assisted Intervention (MICCAI), pp. 234-241, (2015)
  • [7] Gu R, Wang GT, Song T, Huang R, Aertsen M, Deprest J, Ourselin S, Vercauteren T, Zhang ST., CA-Net: Comprehensive attention convolutional neural networks for explainable medical image segmentation, IEEE Trans. on Medical Imaging, 40, 2, pp. 699-711, (2021)
  • [8] Wang CY, Wang YL, Liu YF, He ZF, He R, Sun ZN., ScleraSegNet: An attention assisted U-Net model for accurate sclera segmentation, IEEE Trans. on Biometrics, Behavior, and Identity Science, 2, 1, pp. 40-54, (2020)
  • [9] Gu ZW, Cheng J, Fu HZ, Zhou K, Hao HY, Zhao YT, Zhang TY, Gao SH, Liu J., CE-Net: Context encoder network for 2D medical image segmentation, IEEE Trans. on Medical Imaging, 38, 10, pp. 2281-2292, (2019)
  • [10] Zhou ZW, Siddiquee MMR, Tajbakhsh N, Liang JM., UNet++: A nested U-Net architecture for medical image segmentation, Proc. of the Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, pp. 3-11, (2018)