Masked Deformation Modeling for Volumetric Brain MRI Self-Supervised Pre-Training

被引:0
作者
Lyu, Junyan [1 ,2 ]
Bartlett, Perry F. [2 ]
Nasrallah, Fatima A. [2 ]
Tang, Xiaoying [1 ,3 ]
机构
[1] Southern Univ Sci & Technol, Dept Elect & Elect Engn, Shenzhen 518055, Peoples R China
[2] Univ Queensland, Queensland Brain Inst, St Lucia, Qld 4072, Australia
[3] Southern Univ Sci & Technol, Jiaxing Res Inst, Jiaxing 314031, Peoples R China
基金
中国国家自然科学基金;
关键词
Brain; Magnetic resonance imaging; Deformation; Brain modeling; Image segmentation; Image restoration; Biomedical imaging; Annotations; Feature extraction; Lesions; Self-supervised learning; masked deformation modeling; brain segmentation; DIFFEOMORPHIC IMAGE REGISTRATION; SEGMENTATION; HIPPOCAMPUS; MORPHOMETRY; PATTERNS; RESOURCE; ATLAS;
D O I
10.1109/TMI.2024.3510922
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Self-supervised learning (SSL) has been proposed to alleviate neural networks' reliance on annotated data and to improve downstream tasks' performance, which has obtained substantial success in several volumetric medical image segmentation tasks. However, most existing approaches are designed and pre-trained on CT or MRI datasets of non-brain organs. The lack of brain prior limits those methods' performance on brain segmentation, especially on fine-grained brain parcellation. To overcome this limitation, we here propose a novel SSL strategy for MRI of the human brain, named Masked Deformation Modeling (MDM). MDM first conducts atlas-guided patch sampling on individual brain MRI scans (moving volumes) and an MNI152 template (a fixed volume). The sampled moving volumes are randomly masked in a feature-aligned manner, and then sent into a U-Net-based network to extract latent features. An intensity head and a deformation field head are used to decode the latent features, respectively restoring the masked volume and predicting the deformation field from the moving volume to the fixed volume. The proposed MDM is fine-tuned and evaluated on three brain parcellation datasets with different granularities (JHU, Mindboggle-101, CANDI), a brain lesion segmentation dataset (ATLAS2), and a brain tumor segmentation dataset (BraTS21). Results demonstrate that MDM outperforms various state-of-the-art medical SSL methods by considerable margins, and can effectively reduce the annotation effort by at least 40%. Codes and pre-trained weights will be released at https://github.com/CRazorback/MDM.
引用
收藏
页码:1596 / 1607
页数:12
相关论文
共 71 条
[21]  
Gao P., 2022, arXiv
[22]   Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking [J].
Gao, Peng ;
Lin, Ziyi ;
Zhang, Renrui ;
Fang, Rongyao ;
Li, Hongyang ;
Li, Hongsheng ;
Qiao, Yu .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (05) :1546-1556
[23]   Learning Representations by Predicting Bags of Visual Words [J].
Gidaris, Spyros ;
Bursuc, Andrei ;
Komodakis, Nikos ;
Perez, Patrick ;
Cord, Matthieu .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6926-6936
[24]  
Graham B, 2017, Arxiv, DOI [arXiv:1706.01307, 10.48550/arXiv.1706.01307]
[25]   Transferable Visual Words: Exploiting the Semantics of Anatomical Patterns for Self-Supervised Learning [J].
Haghighi, Fatemeh ;
Taher, Mohammad Reza Hosseinzadeh ;
Zhou, Zongwei ;
Gotway, Michael B. ;
Liang, Jianming .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (10) :2857-2868
[26]   Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images [J].
Hatamizadeh, Ali ;
Nath, Vishwesh ;
Tang, Yucheng ;
Yang, Dong ;
Roth, Holger R. ;
Xu, Daguang .
BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2021, PT I, 2022, 12962 :272-284
[27]   Masked Autoencoders Are Scalable Vision Learners [J].
He, Kaiming ;
Chen, Xinlei ;
Xie, Saining ;
Li, Yanghao ;
Dollar, Piotr ;
Girshick, Ross .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :15979-15988
[28]   Momentum Contrast for Unsupervised Visual Representation Learning [J].
He, Kaiming ;
Fan, Haoqi ;
Wu, Yuxin ;
Xie, Saining ;
Girshick, Ross .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9726-9735
[29]   SwinUNETR-V2: Stronger Swin Transformers with Stagewise Convolutions for 3D Medical Image Segmentation [J].
He, Yufan ;
Nath, Vishwesh ;
Yang, Dong ;
Tang, Yucheng ;
Myronenko, Andriy ;
Xu, Daguang .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 :416-426
[30]   Geometric Visual Similarity Learning in 3D Medical Image Self-supervised Pre-training [J].
He, Yuting ;
Yang, Guanyu ;
Ge, Rongjun ;
Chen, Yang ;
Coatrieux, Jean-Louis ;
Wang, Boyu ;
Li, Shuo .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :9538-9547