GMIM: Self-supervised pre-training for 3D medical image segmentation with adaptive and hierarchical masked image modeling

被引:1
|
作者
Qi L. [1 ]
Jiang Z. [1 ,2 ]
Shi W. [1 ,2 ]
Qu F. [1 ]
Feng G. [1 ]
机构
[1] Department of Computer Science and Technology, Changchun University of Science and Technology, Jilin, Changchun
[2] Zhongshan Institute of Changchun University of Science and Technology, Guangzhou, Zhongshan
关键词
Brain tumor segmentation; Masked image modeling; Self-supervised learning;
D O I
10.1016/j.compbiomed.2024.108547
中图分类号
学科分类号
摘要
Self-supervised pre-training and fully supervised fine-tuning paradigms have received much attention to solve the data annotation problem in deep learning fields. Compared with traditional pre-training on large natural image datasets, medical self-supervised learning methods learn rich representations derived from unlabeled data itself thus avoiding the distribution shift between different image domains. However, nowadays state-of-the-art medical pre-training methods were specifically designed for downstream tasks making them less flexible and difficult to apply to new tasks. In this paper, we propose grid mask image modeling, a flexible and general self-supervised method to pre-train medical vision transformers for 3D medical image segmentation. Our goal is to guide networks to learn the correlations between organs and tissues by reconstructing original images based on partial observations. The relationships are consistent within the human body and invariant to disease type or imaging modality. To achieve this, we design a Siamese framework consisting of an online branch and a target branch. An adaptive and hierarchical masking strategy is employed in the online branch to (1) learn the boundaries or small contextual mutation regions within images; (2) to learn high-level semantic representations from deeper layers of the multiscale encoder. In addition, the target branch provides representations for contrastive learning to further reduce representation redundancy. We evaluate our method through segmentation performance on two public datasets. The experimental results demonstrate our method outperforms other self-supervised methods. Codes are available at https://github.com/mobiletomb/Gmim. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [31] Abdominal Organs and Pan-Cancer Segmentation Based on Self-supervised Pre-training and Self-training
    Li, He
    Han, Meng
    Wang, Guotai
    FAST, LOW-RESOURCE, AND ACCURATE ORGAN AND PAN-CANCER SEGMENTATION IN ABDOMEN CT, FLARE 2023, 2024, 14544 : 130 - 142
  • [32] A Spatial Guided Self-supervised Clustering Network for Medical Image Segmentation
    Ahn, Euijoon
    Feng, Dagan
    Kim, Jinman
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 379 - 388
  • [33] Ssman: self-supervised masked adaptive network for 3D human pose estimation
    Shi, Yu
    Yue, Tianyi
    Zhao, Hu
    He, Guoping
    Ren, Keyan
    MACHINE VISION AND APPLICATIONS, 2024, 35 (03)
  • [34] Self-Supervised Learning for Few-Shot Medical Image Segmentation
    Ouyang, Cheng
    Biffi, Carlo
    Chen, Chen
    Kart, Turkay
    Qiu, Huaqi
    Rueckert, Daniel
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (07) : 1837 - 1848
  • [35] Medical image segmentation based on self-supervised hybrid fusion network
    Zhao, Liang
    Jia, Chaoran
    Ma, Jiajun
    Shao, Yu
    Liu, Zhuo
    Yuan, Hong
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [36] Hierarchical Self-supervised Learning for Medical Image Segmentation Based on Multi-domain Data Aggregation
    Zheng, Hao
    Han, Jun
    Wang, Hongxiao
    Yang, Lin
    Zhao, Zhuo
    Wang, Chaoli
    Chen, Danny Z.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 622 - 632
  • [37] Self-supervised Image-based 3D Model Retrieval
    Song, Dan
    Zhang, Chu-Meng
    Zhao, Xiao-Qian
    Wang, Teng
    Nie, Wei-Zhi
    Li, Xuan-Ya
    Liu, An-An
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [38] W2V-BERT: COMBINING CONTRASTIVE LEARNING AND MASKED LANGUAGE MODELING FOR SELF-SUPERVISED SPEECH PRE-TRAINING
    Chung, Yu-An
    Zhang, Yu
    Han, Wei
    Chiu, Chung-Cheng
    Qin, James
    Pang, Ruoming
    Wu, Yonghui
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 244 - 250
  • [39] Masked Modeling-Based Ultrasound Image Classification via Self-Supervised Learning
    Xu, Kele
    You, Kang
    Zhu, Boqing
    Feng, Ming
    Feng, Dawei
    Yang, Cheng
    IEEE OPEN JOURNAL OF ENGINEERING IN MEDICINE AND BIOLOGY, 2024, 5 : 226 - 237
  • [40] Self-supervised few-shot medical image segmentation with spatial transformations
    Titoriya, Ankit Kumar
    Singh, Maheshwari Prasad
    Singh, Amit Kumar
    Neural Computing and Applications, 2024, 36 (30) : 18675 - 18691