Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance

被引:0
作者
Zhang, Weiyi [1 ]
Huang, Siyu [2 ]
Yang, Jiancheng [3 ]
Chen, Ruoyu [1 ]
Ge, Zongyuan [4 ]
Zheng, Yingfeng [5 ]
Shi, Danli [1 ]
He, Mingguang [1 ]
机构
[1] Hong Kong Polytech Univ, Kowloon, Hong Kong, Peoples R China
[2] Clemson Univ, Clemson, SC USA
[3] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[4] Monash Univ, Melbourne, Vic, Australia
[5] Sun Yat Sen Univ, Guangzhou, Peoples R China
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT I | 2024年 / 15001卷
关键词
Video Generation; Generative Adversarial Network; Autoregressive Generation; Retinal Fundus Photography; Fluorescence Angiography; FLUORESCEIN ANGIOGRAPHY; DIABETIC-RETINOPATHY; AUTOMATED DETECTION;
D O I
10.1007/978-3-031-72378-0_64
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fundus Fluorescein Angiography (FFA) is a critical tool for assessing retinal vascular dynamics and aiding in the diagnosis of eye diseases. However, its invasive nature and less accessibility compared to Color Fundus (CF) images pose significant challenges. Current CF to FFA translation methods are limited to static generation. In this work, we pioneer dynamic FFA video generation from static CF images. We introduce an autoregressive GAN for smooth, memory-saving frame-by-frame FFA synthesis. To enhance the focus on dynamic lesion changes in FFA regions, we design a knowledge mask based on clinical experience. Leveraging this mask, our approach integrates innovative knowledge mask-guided techniques, including knowledge-boosted attention, knowledge-aware discriminators, and mask-enhanced patchNCE loss, aimed at refining generation in critical areas and addressing the pixel misalignment challenge. Our method achieves the best FVD of 1503.21 and PSNR of 11.81 compared to other common video generation approaches. Human assessment by an ophthalmologist confirms its high generation quality. Notably, our knowledge mask surpasses supervised lesion segmentation masks, offering a promising non-invasive alternative to traditional FFA for research and clinical applications. The code is available at https://github.com/Michi-3000/Fundus2Video.
引用
收藏
页码:689 / 699
页数:11
相关论文
共 29 条
  • [1] Translating color fundus photography to indocyanine green angiography using deep-learning for age-related macular degeneration screening
    Chen, Ruoyu
    Zhang, Weiyi
    Song, Fan
    Yu, Honghua
    Cao, Dan
    Zheng, Yingfeng
    He, Mingguang
    Shi, Danli
    [J]. NPJ DIGITAL MEDICINE, 2024, 7 (01)
  • [2] Series-Parallel Generative Adversarial Network Architecture for Translating from Fundus Structure Image to Fluorescence Angiography
    Chen, Yiwei
    He, Yi
    Li, Wanyue
    Wang, Jing
    Li, Ping
    Xing, Lina
    Zhang, Xin
    Shi, Guohua
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [3] Quantification of retinal blood leakage in fundus fluorescein angiography in a retinal angiogenesis model
    Comin, Cesar H.
    Tsirukis, Demetrios, I
    Sun, Ye
    Xu, Xiaoyin
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [4] de Carlo Talisa E, 2015, Int J Retina Vitreous, V1, P5
  • [5] Dorjsembe Z, 2024, Arxiv, DOI [arXiv:2305.18453, 10.48550/arXiv.2305.18453]
  • [6] Algorithms for the Automated Detection of Diabetic Retinopathy Using Digital Fundus Images: A Review
    Faust, Oliver
    Acharya U, Rajendra
    Ng, E. Y. K.
    Ng, Kwan-Hoong
    Suri, Jasjit S.
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2012, 36 (01) : 145 - 157
  • [7] Simultaneous indocyanine green and fluorescein angiography using a confocal scanning laser ophthalmoscope
    Freeman, WR
    Bartsch, DU
    Mueller, AJ
    Banker, AS
    Weinreb, RN
    [J]. ARCHIVES OF OPHTHALMOLOGY, 1998, 116 (04) : 455 - 463
  • [8] Scope of validity of PSNR in image/video quality assessment
    Huynh-Thu, Q.
    Ghanbari, M.
    [J]. ELECTRONICS LETTERS, 2008, 44 (13) : 800 - U35
  • [9] Globally and Locally Consistent Image Completion
    Iizuka, Satoshi
    Simo-Serra, Edgar
    Ishikawa, Hiroshi
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):
  • [10] Image-to-Image Translation with Conditional Adversarial Networks
    Isola, Phillip
    Zhu, Jun-Yan
    Zhou, Tinghui
    Efros, Alexei A.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5967 - 5976