Masked Deformation Modeling for Volumetric Brain MRI Self-Supervised Pre-Training

被引:0
作者
Lyu, Junyan [1 ,2 ]
Bartlett, Perry F. [2 ]
Nasrallah, Fatima A. [2 ]
Tang, Xiaoying [1 ,3 ]
机构
[1] Southern Univ Sci & Technol, Dept Elect & Elect Engn, Shenzhen 518055, Peoples R China
[2] Univ Queensland, Queensland Brain Inst, St Lucia, Qld 4072, Australia
[3] Southern Univ Sci & Technol, Jiaxing Res Inst, Jiaxing 314031, Peoples R China
基金
中国国家自然科学基金;
关键词
Brain; Magnetic resonance imaging; Deformation; Brain modeling; Image segmentation; Image restoration; Biomedical imaging; Annotations; Feature extraction; Lesions; Self-supervised learning; masked deformation modeling; brain segmentation; DIFFEOMORPHIC IMAGE REGISTRATION; SEGMENTATION; HIPPOCAMPUS; MORPHOMETRY; PATTERNS; RESOURCE; ATLAS;
D O I
10.1109/TMI.2024.3510922
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Self-supervised learning (SSL) has been proposed to alleviate neural networks' reliance on annotated data and to improve downstream tasks' performance, which has obtained substantial success in several volumetric medical image segmentation tasks. However, most existing approaches are designed and pre-trained on CT or MRI datasets of non-brain organs. The lack of brain prior limits those methods' performance on brain segmentation, especially on fine-grained brain parcellation. To overcome this limitation, we here propose a novel SSL strategy for MRI of the human brain, named Masked Deformation Modeling (MDM). MDM first conducts atlas-guided patch sampling on individual brain MRI scans (moving volumes) and an MNI152 template (a fixed volume). The sampled moving volumes are randomly masked in a feature-aligned manner, and then sent into a U-Net-based network to extract latent features. An intensity head and a deformation field head are used to decode the latent features, respectively restoring the masked volume and predicting the deformation field from the moving volume to the fixed volume. The proposed MDM is fine-tuned and evaluated on three brain parcellation datasets with different granularities (JHU, Mindboggle-101, CANDI), a brain lesion segmentation dataset (ATLAS2), and a brain tumor segmentation dataset (BraTS21). Results demonstrate that MDM outperforms various state-of-the-art medical SSL methods by considerable margins, and can effectively reduce the annotation effort by at least 40%. Codes and pre-trained weights will be released at https://github.com/CRazorback/MDM.
引用
收藏
页码:1596 / 1607
页数:12
相关论文
共 71 条
  • [21] Gao P, 2022, arXiv, DOI DOI 10.48550/ARXIV.2
  • [22] Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking
    Gao, Peng
    Lin, Ziyi
    Zhang, Renrui
    Fang, Rongyao
    Li, Hongyang
    Li, Hongsheng
    Qiao, Yu
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (05) : 1546 - 1556
  • [23] Learning Representations by Predicting Bags of Visual Words
    Gidaris, Spyros
    Bursuc, Andrei
    Komodakis, Nikos
    Perez, Patrick
    Cord, Matthieu
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6926 - 6936
  • [24] Graham B, 2017, Arxiv, DOI [arXiv:1706.01307, DOI 10.48550/ARXIV.1706.01307]
  • [25] Transferable Visual Words: Exploiting the Semantics of Anatomical Patterns for Self-Supervised Learning
    Haghighi, Fatemeh
    Taher, Mohammad Reza Hosseinzadeh
    Zhou, Zongwei
    Gotway, Michael B.
    Liang, Jianming
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (10) : 2857 - 2868
  • [26] Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images
    Hatamizadeh, Ali
    Nath, Vishwesh
    Tang, Yucheng
    Yang, Dong
    Roth, Holger R.
    Xu, Daguang
    [J]. BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2021, PT I, 2022, 12962 : 272 - 284
  • [27] Masked Autoencoders Are Scalable Vision Learners
    He, Kaiming
    Chen, Xinlei
    Xie, Saining
    Li, Yanghao
    Dollar, Piotr
    Girshick, Ross
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15979 - 15988
  • [28] Momentum Contrast for Unsupervised Visual Representation Learning
    He, Kaiming
    Fan, Haoqi
    Wu, Yuxin
    Xie, Saining
    Girshick, Ross
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9726 - 9735
  • [29] SwinUNETR-V2: Stronger Swin Transformers with Stagewise Convolutions for 3D Medical Image Segmentation
    He, Yufan
    Nath, Vishwesh
    Yang, Dong
    Tang, Yucheng
    Myronenko, Andriy
    Xu, Daguang
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 416 - 426
  • [30] Geometric Visual Similarity Learning in 3D Medical Image Self-supervised Pre-training
    He, Yuting
    Yang, Guanyu
    Ge, Rongjun
    Chen, Yang
    Coatrieux, Jean-Louis
    Wang, Boyu
    Li, Shuo
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9538 - 9547