GMIM: Self-supervised pre-training for 3D medical image segmentation with adaptive and hierarchical masked image modeling

被引:1
|
作者
Qi L. [1 ]
Jiang Z. [1 ,2 ]
Shi W. [1 ,2 ]
Qu F. [1 ]
Feng G. [1 ]
机构
[1] Department of Computer Science and Technology, Changchun University of Science and Technology, Jilin, Changchun
[2] Zhongshan Institute of Changchun University of Science and Technology, Guangzhou, Zhongshan
关键词
Brain tumor segmentation; Masked image modeling; Self-supervised learning;
D O I
10.1016/j.compbiomed.2024.108547
中图分类号
学科分类号
摘要
Self-supervised pre-training and fully supervised fine-tuning paradigms have received much attention to solve the data annotation problem in deep learning fields. Compared with traditional pre-training on large natural image datasets, medical self-supervised learning methods learn rich representations derived from unlabeled data itself thus avoiding the distribution shift between different image domains. However, nowadays state-of-the-art medical pre-training methods were specifically designed for downstream tasks making them less flexible and difficult to apply to new tasks. In this paper, we propose grid mask image modeling, a flexible and general self-supervised method to pre-train medical vision transformers for 3D medical image segmentation. Our goal is to guide networks to learn the correlations between organs and tissues by reconstructing original images based on partial observations. The relationships are consistent within the human body and invariant to disease type or imaging modality. To achieve this, we design a Siamese framework consisting of an online branch and a target branch. An adaptive and hierarchical masking strategy is employed in the online branch to (1) learn the boundaries or small contextual mutation regions within images; (2) to learn high-level semantic representations from deeper layers of the multiscale encoder. In addition, the target branch provides representations for contrastive learning to further reduce representation redundancy. We evaluate our method through segmentation performance on two public datasets. The experimental results demonstrate our method outperforms other self-supervised methods. Codes are available at https://github.com/mobiletomb/Gmim. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [41] A self-supervised pre-training scheme for multi-source heterogeneous remote sensing image land cover classification
    Xue Z.
    Yu X.
    Liu J.
    Yang G.
    Liu B.
    Yu A.
    Zhou J.
    Jin S.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2024, 53 (03): : 512 - 525
  • [42] Localized Region Contrast for Enhancing Self-supervised Learning in Medical Image Segmentation
    Yan, Xiangyi
    Naushad, Junayed
    You, Chenyu
    Tang, Hao
    Sun, Shanlin
    Han, Kun
    Ma, Haoyu
    Duncan, James S.
    Xie, Xiaohui
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 : 468 - 478
  • [43] Boundary-aware information maximization for self-supervised medical image segmentation
    Peng, Jizong
    Wang, Ping
    Pedersoli, Marco
    Desrosiers, Christian
    MEDICAL IMAGE ANALYSIS, 2024, 94
  • [44] ATTENTION-GUIDED CONTRASTIVE MASKED IMAGE MODELING FOR TRANSFORMER-BASED SELF-SUPERVISED LEARNING
    Zhan, Yucheng
    Zhao, Yucheng
    Luo, Chong
    Zhang, Yueyi
    Sun, Xiaoyan
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2490 - 2494
  • [45] SDCluster: A clustering based self-supervised pre-training method for semantic segmentation of remote sensing images
    Xu, Hanwen
    Zhang, Chenxiao
    Yue, Peng
    Wang, Kaixuan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2025, 223 : 1 - 14
  • [46] Rubik's Cube plus : A self-supervised feature learning framework for 3D medical image analysis
    Zhu, Jiuwen
    Li, Yuexiang
    Hu, Yifan
    Ma, Kai
    Zhou, S. Kevin
    Zheng, Yefeng
    MEDICAL IMAGE ANALYSIS, 2020, 64
  • [47] BTSwin-Unet: 3D U-shaped Symmetrical Swin Transformer-based Network for Brain Tumor Segmentation with Self-supervised Pre-training
    Junjie Liang
    Cihui Yang
    Jingting Zhong
    Xiaoli Ye
    Neural Processing Letters, 2023, 55 : 3695 - 3713
  • [48] Self-supervised Learning Based on Max-tree Representation for Medical Image Segmentation
    Tang, Qian
    Du, Bo
    Xu, Yongchao
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [49] BTSwin-Unet: 3D U-shaped Symmetrical Swin Transformer-based Network for Brain Tumor Segmentation with Self-supervised Pre-training
    Liang, Junjie
    Yang, Cihui
    Zhong, Jingting
    Ye, Xiaoli
    NEURAL PROCESSING LETTERS, 2023, 55 (04) : 3695 - 3713
  • [50] VoxSeP: semi-positive voxels assist self-supervised 3D medical segmentation
    Yang, Zijie
    Xie, Lingxi
    Zhou, Wei
    Huo, Xinyue
    Wei, Longhui
    Lu, Jian
    Tian, Qi
    Tang, Sheng
    MULTIMEDIA SYSTEMS, 2023, 29 (01) : 33 - 48