Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI Synthesis

被引:14
作者
Zhu, Lingting [1 ]
Xue, Zeyue [1 ]
Jin, Zhenchao [1 ]
Liu, Xian [2 ]
He, Jingzhen [3 ]
Liu, Ziwei [4 ]
Yu, Lequan [1 ]
机构
[1] Univ Hong Kong, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[3] Shandong Univ, Qilu Hosp, Jinan, Peoples R China
[4] Nanyang Technol Univ, S Lab, Singapore, Singapore
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT X | 2023年 / 14229卷
关键词
Cross-modality medical image synthesis; Volumetric data; Latent diffusion model; Brain MRI;
D O I
10.1007/978-3-031-43999-5_56
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-modality medical image synthesis is a critical topic and has the potential to facilitate numerous applications in the medical imaging field. Despite recent successes in deep-learning-based generative models, most current medical image synthesis methods rely on generative adversarial networks and suffer from notorious mode collapse and unstable training. Moreover, the 2D backbone-driven approaches would easily result in volumetric inconsistency, while 3D backbones are challenging and impractical due to the tremendous memory cost and training difficulty. In this paper, we introduce a new paradigm for volumetric medical data synthesis by leveraging 2D backbones and present a diffusion-based framework, Make-A-Volume, for cross-modality 3D medical image synthesis. To learn the cross-modality slice-wise mapping, we employ a latent diffusion model and learn a low-dimensional latent space, resulting in high computational efficiency. To enable the 3D image synthesis and mitigate volumetric inconsistency, we further insert a series of volumetric layers in the 2D slice-mapping model and fine-tune them with paired 3D data. This paradigm extends the 2D image diffusion model to a volumetric version with a slightly increasing number of parameters and computation, offering a principled solution for generic cross-modality 3D medical image synthesis. We showcase the effectiveness of our Make-A-Volume framework on an in-house SWI-MRA brain MRI dataset and a public T1-T2 brain MRI dataset. Experimental results demonstrate that our framework achieves superior synthesis results with volumetric consistency.
引用
收藏
页码:592 / 601
页数:10
相关论文
共 33 条
[1]   Seeing What a GAN Cannot Generate [J].
Bau, David ;
Zhu, Jun-Yan ;
Wulff, Jonas ;
Peebles, William ;
Strobelt, Hendrik ;
Zhou, Bolei ;
Torralba, Antonio .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4501-4510
[2]   Cross-modality synthesis from CT to PET using FCN and GAN networks for improved automated lesion detection [J].
Ben-Cohen, Avi ;
Klang, Eyal ;
Raskin, Stephen P. ;
Soffer, Shelly ;
Ben-Haim, Simona ;
Konen, Eli ;
Amitai, Michal Marianne ;
Greenspan, Hayit .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 78 :186-194
[3]  
Chung H, 2022, arXiv
[4]   ResViT: Residual Vision Transformers for Multimodal Medical Image Synthesis [J].
Dalmaz, Onat ;
Yurt, Mahmut ;
Cukur, Tolga .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (10) :2598-2614
[5]   Image Synthesis in Multi-Contrast MRI With Conditional Generative Adversarial Networks [J].
Dar, Salman U. H. ;
Yurt, Mahmut ;
Karacan, Levent ;
Erdem, Aykut ;
Erdem, Erkut ;
Cukur, Tolga .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (10) :2375-2388
[6]  
Dhariwal P, 2021, ADV NEUR IN, V34
[7]   Recent advances on the development of phantoms using 3D printing for imaging with CT, MRI, PET, SPECT, and ultrasound [J].
Filippou, Valeria ;
Tsoumpas, Charalampos .
MEDICAL PHYSICS, 2018, 45 (09) :E740-E760
[8]   Generative Adversarial Networks [J].
Goodfellow, Ian ;
Pouget-Abadie, Jean ;
Mirza, Mehdi ;
Xu, Bing ;
Warde-Farley, David ;
Ozair, Sherjil ;
Courville, Aaron ;
Bengio, Yoshua .
COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144
[9]  
Ho Jonathan., 2020, P 34 INT C NEURAL IN, P6840
[10]   AutoGAN-Synthesizer: Neural Architecture Search for Cross-Modality MRI Synthesis [J].
Hu, Xiaobin ;
Shen, Ruolin ;
Luo, Donghao ;
Tai, Ying ;
Wang, Chengjie ;
Menze, Bjoern H. .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VI, 2022, 13436 :397-409