SADM: Sequence-Aware Diffusion Model for Longitudinal Medical Image Generation

被引：12

作者：

Yoon, Jee Seok ^{[1
,3
]}

Zhang, Chenghao ^{[2
]}

Suk, Heung-Il ^{[1
]}

Guo, Jia ^{[2
]}

Li, Xiaoxiao ^{[3
]}

机构：

[1] Korea Univ, Seoul 02841, South Korea

[2] Columbia Univ, New York, NY 10027 USA

[3] Univ British Columbia, Vancouver, BC V6T 1Z4, Canada

来源：

INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2023 | 2023年 / 13939卷

基金：

加拿大自然科学与工程研究理事会;

关键词：

Diffusion model; Sequential image generation; Autoregressive conditioning;

D O I：

10.1007/978-3-031-34048-2_30

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human organs constantly undergo anatomical changes due to a complex mix of short-term (e.g., heartbeat) and long-term (e.g., aging) factors. Evidently, prior knowledge of these factors will be beneficial when modeling their future state, i.e., via image generation. However, most of the medical image generation tasks only rely on the input from a single image, thus ignoring the sequential dependency even when longitudinal data is available. Sequence-aware deep generative models, where model input is a sequence of ordered and timestamped images, are still underexplored in the medical imaging domain that is featured by several unique challenges: 1) Sequences with various lengths; 2) Missing data or frame, and 3) High dimensionality. To this end, we propose a sequence-aware diffusion model (SADM) for the generation of longitudinal medical images. Recently, diffusion models have shown promising results in high-fidelity image generation. Our method extends this new technique by introducing a sequence-aware transformer as the conditional module in a diffusion model. The novel design enables learning longitudinal dependency even with missing data during training and allows autoregressive generation of a sequence of images during inference. Our extensive experiments on 3D longitudinal medical images demonstrate the effectiveness of SADM compared with baselines and alternative methods. The code is available at https://github.com/ubc-tea/SADM-Longitudinal-Medical-Image-Generation.

引用

页码：388 / 400

页数：13

共 22 条

[1] ViViT: A Video Vision Transformer
Arnab, Anurag
Dehghani, Mostafa
Heigold, Georg
Sun, Chen
Lucic, Mario
Schmid, Cordelia
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6816 - 6826
[2] VoxelMorph: A Learning Framework for Deformable Medical Image Registration
Balakrishnan, Guha
Zhao, Amy
Sabuncu, Mert R.
Guttag, John
Dalca, Adrian, V
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (08) : 1788 - 1800
[3] Deep Learning Techniques for Automatic MRI Cardiac Multi-Structures Segmentation and Diagnosis: Is the Problem Solved?
Bernard, Olivier
Lalande, Alain
Zotti, Clement
Cervenansky, Frederick
Yang, Xin
Heng, Pheng-Ann
Cetin, Irem
Lekadir, Karim
Camara, Oscar
Gonzalez Ballester, Miguel Angel
Sanroma, Gerard
Napel, Sandy
Petersen, Steffen
Tziritas, Georgios
Grinias, Elias
Khened, Mahendra
Kollerathu, Varghese Alex
Krishnamurthi, Ganapathy
Rohe, Marc-Michel
Pennec, Xavier
Sermesant, Maxime
Isensee, Fabian
Jaeger, Paul
Maier-Hein, Klaus H.
Full, Peter M.
Wolf, Ivo
Engelhardt, Sandy
Baumgartner, Christian F.
Koch, Lisa M.
Wolterink, Jelmer M.
Isgum, Ivana
Jang, Yeonggul
Hong, Yoonmi
Patravali, Jay
Jain, Shubham
Humbert, Olivier
Jodoin, Pierre-Marc
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (11) : 2514 - 2525
[4] Cardiac aging synthesis from cross-sectional data with conditional generative adversarial networks
Campello, Victor M.
Xia, Tian
Liu, Xiao
Sanchez, Pedro
Martin-Isla, Carlos
Petersen, Steffen E.
Segui, Santi
Tsaftaris, Sotirios A.
Lekadir, Karim
[J]. FRONTIERS IN CARDIOVASCULAR MEDICINE, 2022, 9
[5] Dhariwal P, 2021, ADV NEUR IN, V34
[6] Estimating brain age based on a uniform healthy population with deep learning and structural magnetic resonance imaging
Feng, Xinyang
Lipton, Zachary C.
Yang, Jie
Small, Scott A.
Provenzano, Frank A.
[J]. NEUROBIOLOGY OF AGING, 2020, 91 : 15 - 25
[7] Harvey W, 2022, ADV NEUR IN
[8] Ho J., 2022, Imagen Video: High Definition Video Generation with Diffusion Models
[9] Ho J., 2022, Classifier-Free Diffusion Guidance
[10] Ho J., 2020, P NIPS, V33, P6840

← 1 2 3 →