Controllable Cardiac Synthesis via Disentangled Anatomy Arithmetic

被引：12

作者：

Thermos, Spyridon ^{[1
]}

Liu, Xiao ^{[1
]}

O'Neil, Alison ^{[1
,3
]}

Tsaftaris, Sotirios A. ^{[1
,2
]}

机构：

[1] Univ Edinburgh, Sch Engn, Edinburgh EH9 3FB, Midlothian, Scotland

[2] Alan Turing Inst, London NW1 2DB, England

[3] Canon Med Res Europe, Edinburgh EH6 5NP, Midlothian, Scotland

来源：

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III | 2021年 / 12903卷

基金：

英国工程与自然科学研究理事会;

关键词：

Disentangled anatomy arithmetic; Semantic image synthesis; Cardiac data augmentation;

D O I：

10.1007/978-3-030-87199-4_15

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Acquiring annotated data at scale with rare diseases or conditions remains a challenge. It would be extremely useful to have a method that controllably synthesizes images that can correct such under-representation. Assuming a proper latent representation, the idea of a "latent vector arithmetic" could offer the means of achieving such synthesis. A proper representation must encode the fidelity of the input data, preserve invariance and equivariance, and permit arithmetic operations. Motivated by the ability to disentangle images into spatial anatomy (tensor) factors and accompanying imaging (vector) representations, we propose a framework termed "disentangled anatomy arithmetic" , in which a generative model learns to combine anatomical factors of different input images such that when they are re-entangled with the desired imaging modality (e.g. MRI), plausible new cardiac images are created with the target characteristics. To encourage a realistic combination of anatomy factors after the arithmetic step, we propose a localized noise injection network that precedes the generator. Our model is used to generate realistic images, pathology labels, and segmentation masks that are used to augment the existing datasets and subsequently improve post-hoc classification and segmentation tasks. Code is publicly available at https://github.com/vios-s/DAA-GAN.

引用

页码：160 / 170

页数：11

共 50 条

[1] StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models
Wang, Zhizhong
Zhao, Lei
Xing, Wei
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7643 - 7655
[2] Disentangled Representation Learning for Controllable Image Synthesis: an Information-Theoretic Perspective
Tang, Shichang
Zhou, Xu
He, Xuming
Ma, Yi
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10042 - 10049
[3] Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis
Wang, Duomin
Deng, Yu
Yin, Zixin
Shum, Heung-Yeung
Wang, Baoyuan
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17979 - 17989
[4] Unsupervised Retina Image Synthesis via Disentangled Representation Learning
Li, Kang
Yu, Lequan
Wang, Shujun
Heng, Pheng-Ann
SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2019, 2019, 11827 : 32 - 41
[5] Semi-supervised learning for continuous emotional intensity controllable speech synthesis with disentangled representations
Oh, Yoori
Lee, Juheon
Han, Yoseob
Lee, Kyogu
INTERSPEECH 2023, 2023, : 4818 - 4822
[6] Encouraging Disentangled and Convex Representation with Controllable Interpolation Regularization
Ge, Yunhao
Xu, Zhi
Xiao, Yao
Xin, Gan
Pang, Yunkui
Itti, Laurent
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4750 - 4758
[7] Learning disentangled representations for controllable human motion prediction
Gu, Chunzhi
Yu, Jun
Zhang, Chao
PATTERN RECOGNITION, 2024, 146
[8] Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning
Deng, Yu
Yang, Jiaolong
Chen, Dong
Wen, Fang
Tong, Xin
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5153 - 5162
[9] Disentangled Representation Learning for Controllable Person Image Generation
Xu, Wenju
Long, Chengjiang
Nie, Yongwei
Wang, Guanghui
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6065 - 6077
[10] Disentangled Lifespan Synthesis via Transformer-Based Nonlinear Regression
Li, Mingyuan
Guo, Yingchun
COMPUTER GRAPHICS FORUM, 2024, 43 (07)

← 1 2 3 4 5 →