Self-supervised few-shot medical image segmentation with spatial transformations

被引:0
作者
Titoriya, Ankit Kumar [1 ]
Singh, Maheshwari Prasad [1 ]
Singh, Amit Kumar [1 ]
机构
[1] Engineering, National Institute of Technology Patna, Ashok Rajpath, Bihar, Patna
关键词
Few-shot learning; Few-shot segmentation; Image segmentation; Machine learning; Medical image; Self-supervised learning;
D O I
10.1007/s00521-024-10184-4
中图分类号
学科分类号
摘要
Deep learning-based segmentation models often struggle to achieve optimal performance when encountering new, unseen semantic classes. Their effectiveness hinges on vast amounts of annotated data and high computational resources for training. However, a promising solution to mitigate these challenges is the adoption of few-shot segmentation (FSS) networks, which can train models with reduced annotated data. The inherent complexity of medical images limits the applicability of FSS in medical imaging, despite its potential. Recent advancements in self-supervised label-efficient FSS models have demonstrated remarkable efficacy in medical image segmentation tasks. This paper presents a novel FSS architecture that enhances segmentation accuracy by utilising fewer features than existing methodologies. Additionally, this paper proposes a novel self-supervised learning approach that utilises supervoxel and augmented superpixel images to further enhance segmentation accuracy. This paper assesses the efficacy of the proposed model on two different datasets: abdominal magnetic resonance imaging (MRI) and cardiac MRI. The proposed model achieves a mean dice score and mean intersection over union of 81.62% and 70.38% for abdominal images, and 79.38% and 65.23% for cardiac images. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:18675 / 18691
页数:16
相关论文
共 62 条
[21]  
Snell J., Swersky K., Zemel R., Prototypical networks for few-shot learning, Adv Neural Inf Process Syst, 30, (2017)
[22]  
Nguyen V.N., Lokse S., Wickstrom K., Kampffmeyer M., Roverso D., Jenssen R., Sen: A novel feature normalization dissimilarity measure for prototypical few-shot learning networks. In Computer Vision-ECCV 2020 Proceedings, Part, 23, 16, pp. 118-134, (2020)
[23]  
Shaban A., Bansal S., Liu Z., Essa I., Boots B., One-shot learning for semantic segmentation, Arxiv Preprint Arxiv, 1709, (2017)
[24]  
Rakelly K., Shelhamer E., Darrell T., Efros A., Levine S., Conditional Networks for Few-Shot Semantic Segmentation, (2018)
[25]  
Hu J., Shen L., Sun G., Squeeze-and-excitation networks, In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132-7141, (2018)
[26]  
Larsson G., Maire M., Shakhnarovich G., Learning representations for automatic colorization, Computer Vision-Eccv 2016 Proceedings, Part IV, 14, pp. 577-593, (2016)
[27]  
Pathak D., Krahenbuhl P., Donahue J., Darrell T., Efros A.A., Context encoders: Feature learning by inpainting, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2536-2544, (2016)
[28]  
Chen T., Kornblith S., Norouzi M., Hinton G., A simple framework for contrastive learning of visual representations, International Conference on Machine Learning, pp. 1597-1607, (2020)
[29]  
Misra I., Maaten L.V.D., Self-supervised learning of pretext-invariant representations, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6707-6717, (2020)
[30]  
Zhao X., Wang S., Song Z., Shen Z., Yao L., Yuan H., Zhang L., Adler: Adversarial Training with Label Error Rectification for One-Shot Medical Image Segmentation, (2023)