Controllable Cardiac Synthesis via Disentangled Anatomy Arithmetic

被引:12
|
作者
Thermos, Spyridon [1 ]
Liu, Xiao [1 ]
O'Neil, Alison [1 ,3 ]
Tsaftaris, Sotirios A. [1 ,2 ]
机构
[1] Univ Edinburgh, Sch Engn, Edinburgh EH9 3FB, Midlothian, Scotland
[2] Alan Turing Inst, London NW1 2DB, England
[3] Canon Med Res Europe, Edinburgh EH6 5NP, Midlothian, Scotland
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III | 2021年 / 12903卷
基金
英国工程与自然科学研究理事会;
关键词
Disentangled anatomy arithmetic; Semantic image synthesis; Cardiac data augmentation;
D O I
10.1007/978-3-030-87199-4_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acquiring annotated data at scale with rare diseases or conditions remains a challenge. It would be extremely useful to have a method that controllably synthesizes images that can correct such under-representation. Assuming a proper latent representation, the idea of a "latent vector arithmetic" could offer the means of achieving such synthesis. A proper representation must encode the fidelity of the input data, preserve invariance and equivariance, and permit arithmetic operations. Motivated by the ability to disentangle images into spatial anatomy (tensor) factors and accompanying imaging (vector) representations, we propose a framework termed "disentangled anatomy arithmetic" , in which a generative model learns to combine anatomical factors of different input images such that when they are re-entangled with the desired imaging modality (e.g. MRI), plausible new cardiac images are created with the target characteristics. To encourage a realistic combination of anatomy factors after the arithmetic step, we propose a localized noise injection network that precedes the generator. Our model is used to generate realistic images, pathology labels, and segmentation masks that are used to augment the existing datasets and subsequently improve post-hoc classification and segmentation tasks. Code is publicly available at https://github.com/vios-s/DAA-GAN.
引用
收藏
页码:160 / 170
页数:11
相关论文
共 50 条
  • [1] StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models
    Wang, Zhizhong
    Zhao, Lei
    Xing, Wei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7643 - 7655
  • [2] Disentangled Representation Learning for Controllable Image Synthesis: an Information-Theoretic Perspective
    Tang, Shichang
    Zhou, Xu
    He, Xuming
    Ma, Yi
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10042 - 10049
  • [3] Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis
    Wang, Duomin
    Deng, Yu
    Yin, Zixin
    Shum, Heung-Yeung
    Wang, Baoyuan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17979 - 17989
  • [4] Unsupervised Retina Image Synthesis via Disentangled Representation Learning
    Li, Kang
    Yu, Lequan
    Wang, Shujun
    Heng, Pheng-Ann
    SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2019, 2019, 11827 : 32 - 41
  • [5] Semi-supervised learning for continuous emotional intensity controllable speech synthesis with disentangled representations
    Oh, Yoori
    Lee, Juheon
    Han, Yoseob
    Lee, Kyogu
    INTERSPEECH 2023, 2023, : 4818 - 4822
  • [6] Encouraging Disentangled and Convex Representation with Controllable Interpolation Regularization
    Ge, Yunhao
    Xu, Zhi
    Xiao, Yao
    Xin, Gan
    Pang, Yunkui
    Itti, Laurent
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4750 - 4758
  • [7] Learning disentangled representations for controllable human motion prediction
    Gu, Chunzhi
    Yu, Jun
    Zhang, Chao
    PATTERN RECOGNITION, 2024, 146
  • [8] Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning
    Deng, Yu
    Yang, Jiaolong
    Chen, Dong
    Wen, Fang
    Tong, Xin
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5153 - 5162
  • [9] Disentangled Representation Learning for Controllable Person Image Generation
    Xu, Wenju
    Long, Chengjiang
    Nie, Yongwei
    Wang, Guanghui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6065 - 6077
  • [10] Disentangled Lifespan Synthesis via Transformer-Based Nonlinear Regression
    Li, Mingyuan
    Guo, Yingchun
    COMPUTER GRAPHICS FORUM, 2024, 43 (07)