SADM: Sequence-Aware Diffusion Model for Longitudinal Medical Image Generation

被引:12
作者
Yoon, Jee Seok [1 ,3 ]
Zhang, Chenghao [2 ]
Suk, Heung-Il [1 ]
Guo, Jia [2 ]
Li, Xiaoxiao [3 ]
机构
[1] Korea Univ, Seoul 02841, South Korea
[2] Columbia Univ, New York, NY 10027 USA
[3] Univ British Columbia, Vancouver, BC V6T 1Z4, Canada
来源
INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2023 | 2023年 / 13939卷
基金
加拿大自然科学与工程研究理事会;
关键词
Diffusion model; Sequential image generation; Autoregressive conditioning;
D O I
10.1007/978-3-031-34048-2_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human organs constantly undergo anatomical changes due to a complex mix of short-term (e.g., heartbeat) and long-term (e.g., aging) factors. Evidently, prior knowledge of these factors will be beneficial when modeling their future state, i.e., via image generation. However, most of the medical image generation tasks only rely on the input from a single image, thus ignoring the sequential dependency even when longitudinal data is available. Sequence-aware deep generative models, where model input is a sequence of ordered and timestamped images, are still underexplored in the medical imaging domain that is featured by several unique challenges: 1) Sequences with various lengths; 2) Missing data or frame, and 3) High dimensionality. To this end, we propose a sequence-aware diffusion model (SADM) for the generation of longitudinal medical images. Recently, diffusion models have shown promising results in high-fidelity image generation. Our method extends this new technique by introducing a sequence-aware transformer as the conditional module in a diffusion model. The novel design enables learning longitudinal dependency even with missing data during training and allows autoregressive generation of a sequence of images during inference. Our extensive experiments on 3D longitudinal medical images demonstrate the effectiveness of SADM compared with baselines and alternative methods. The code is available at https://github.com/ubc-tea/SADM-Longitudinal-Medical-Image-Generation.
引用
收藏
页码:388 / 400
页数:13
相关论文
共 22 条
  • [1] ViViT: A Video Vision Transformer
    Arnab, Anurag
    Dehghani, Mostafa
    Heigold, Georg
    Sun, Chen
    Lucic, Mario
    Schmid, Cordelia
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6816 - 6826
  • [2] VoxelMorph: A Learning Framework for Deformable Medical Image Registration
    Balakrishnan, Guha
    Zhao, Amy
    Sabuncu, Mert R.
    Guttag, John
    Dalca, Adrian, V
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (08) : 1788 - 1800
  • [3] Deep Learning Techniques for Automatic MRI Cardiac Multi-Structures Segmentation and Diagnosis: Is the Problem Solved?
    Bernard, Olivier
    Lalande, Alain
    Zotti, Clement
    Cervenansky, Frederick
    Yang, Xin
    Heng, Pheng-Ann
    Cetin, Irem
    Lekadir, Karim
    Camara, Oscar
    Gonzalez Ballester, Miguel Angel
    Sanroma, Gerard
    Napel, Sandy
    Petersen, Steffen
    Tziritas, Georgios
    Grinias, Elias
    Khened, Mahendra
    Kollerathu, Varghese Alex
    Krishnamurthi, Ganapathy
    Rohe, Marc-Michel
    Pennec, Xavier
    Sermesant, Maxime
    Isensee, Fabian
    Jaeger, Paul
    Maier-Hein, Klaus H.
    Full, Peter M.
    Wolf, Ivo
    Engelhardt, Sandy
    Baumgartner, Christian F.
    Koch, Lisa M.
    Wolterink, Jelmer M.
    Isgum, Ivana
    Jang, Yeonggul
    Hong, Yoonmi
    Patravali, Jay
    Jain, Shubham
    Humbert, Olivier
    Jodoin, Pierre-Marc
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (11) : 2514 - 2525
  • [4] Cardiac aging synthesis from cross-sectional data with conditional generative adversarial networks
    Campello, Victor M.
    Xia, Tian
    Liu, Xiao
    Sanchez, Pedro
    Martin-Isla, Carlos
    Petersen, Steffen E.
    Segui, Santi
    Tsaftaris, Sotirios A.
    Lekadir, Karim
    [J]. FRONTIERS IN CARDIOVASCULAR MEDICINE, 2022, 9
  • [5] Dhariwal P, 2021, ADV NEUR IN, V34
  • [6] Estimating brain age based on a uniform healthy population with deep learning and structural magnetic resonance imaging
    Feng, Xinyang
    Lipton, Zachary C.
    Yang, Jie
    Small, Scott A.
    Provenzano, Frank A.
    [J]. NEUROBIOLOGY OF AGING, 2020, 91 : 15 - 25
  • [7] Harvey W, 2022, ADV NEUR IN
  • [8] Ho J., 2022, Imagen Video: High Definition Video Generation with Diffusion Models
  • [9] Ho J., 2022, Classifier-Free Diffusion Guidance
  • [10] Ho J., 2020, P NIPS, V33, P6840