Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance

被引：0

作者：

Zhang, Weiyi ^{[1
]}

Huang, Siyu ^{[2
]}

Yang, Jiancheng ^{[3
]}

Chen, Ruoyu ^{[1
]}

Ge, Zongyuan ^{[4
]}

Zheng, Yingfeng ^{[5
]}

Shi, Danli ^{[1
]}

He, Mingguang ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Kowloon, Hong Kong, Peoples R China

[2] Clemson Univ, Clemson, SC USA

[3] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

[4] Monash Univ, Melbourne, Vic, Australia

[5] Sun Yat Sen Univ, Guangzhou, Peoples R China

来源：

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT I | 2024年 / 15001卷

关键词：

Video Generation; Generative Adversarial Network; Autoregressive Generation; Retinal Fundus Photography; Fluorescence Angiography; FLUORESCEIN ANGIOGRAPHY; DIABETIC-RETINOPATHY; AUTOMATED DETECTION;

D O I：

10.1007/978-3-031-72378-0_64

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fundus Fluorescein Angiography (FFA) is a critical tool for assessing retinal vascular dynamics and aiding in the diagnosis of eye diseases. However, its invasive nature and less accessibility compared to Color Fundus (CF) images pose significant challenges. Current CF to FFA translation methods are limited to static generation. In this work, we pioneer dynamic FFA video generation from static CF images. We introduce an autoregressive GAN for smooth, memory-saving frame-by-frame FFA synthesis. To enhance the focus on dynamic lesion changes in FFA regions, we design a knowledge mask based on clinical experience. Leveraging this mask, our approach integrates innovative knowledge mask-guided techniques, including knowledge-boosted attention, knowledge-aware discriminators, and mask-enhanced patchNCE loss, aimed at refining generation in critical areas and addressing the pixel misalignment challenge. Our method achieves the best FVD of 1503.21 and PSNR of 11.81 compared to other common video generation approaches. Human assessment by an ophthalmologist confirms its high generation quality. Notably, our knowledge mask surpasses supervised lesion segmentation masks, offering a promising non-invasive alternative to traditional FFA for research and clinical applications. The code is available at https://github.com/Michi-3000/Fundus2Video.

引用

页码：689 / 699

页数：11

共 29 条

[1] Translating color fundus photography to indocyanine green angiography using deep-learning for age-related macular degeneration screening
Chen, Ruoyu
Zhang, Weiyi
Song, Fan
Yu, Honghua
Cao, Dan
Zheng, Yingfeng
He, Mingguang
Shi, Danli
[J]. NPJ DIGITAL MEDICINE, 2024, 7 (01)
[2] Series-Parallel Generative Adversarial Network Architecture for Translating from Fundus Structure Image to Fluorescence Angiography
Chen, Yiwei
He, Yi
Li, Wanyue
Wang, Jing
Li, Ping
Xing, Lina
Zhang, Xin
Shi, Guohua
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (20):
[3] Quantification of retinal blood leakage in fundus fluorescein angiography in a retinal angiogenesis model
Comin, Cesar H.
Tsirukis, Demetrios, I
Sun, Ye
Xu, Xiaoyin
[J]. SCIENTIFIC REPORTS, 2021, 11 (01)
[4] de Carlo Talisa E, 2015, Int J Retina Vitreous, V1, P5
[5] Dorjsembe Z, 2024, Arxiv, DOI [arXiv:2305.18453, 10.48550/arXiv.2305.18453]
[6] Algorithms for the Automated Detection of Diabetic Retinopathy Using Digital Fundus Images: A Review
Faust, Oliver
Acharya U, Rajendra
Ng, E. Y. K.
Ng, Kwan-Hoong
Suri, Jasjit S.
[J]. JOURNAL OF MEDICAL SYSTEMS, 2012, 36 (01) : 145 - 157
[7] Simultaneous indocyanine green and fluorescein angiography using a confocal scanning laser ophthalmoscope
Freeman, WR
Bartsch, DU
Mueller, AJ
Banker, AS
Weinreb, RN
[J]. ARCHIVES OF OPHTHALMOLOGY, 1998, 116 (04) : 455 - 463
[8] Scope of validity of PSNR in image/video quality assessment
Huynh-Thu, Q.
Ghanbari, M.
[J]. ELECTRONICS LETTERS, 2008, 44 (13) : 800 - U35
[9] Globally and Locally Consistent Image Completion
Iizuka, Satoshi
Simo-Serra, Edgar
Ishikawa, Hiroshi
[J]. ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):
[10] Image-to-Image Translation with Conditional Adversarial Networks
Isola, Phillip
Zhu, Jun-Yan
Zhou, Tinghui
Efros, Alexei A.
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5967 - 5976

← 1 2 3 →