Generative artificial intelligence for ophthalmic images: developments, applications and challenges

被引:0
作者
Li, Tingyao [1 ]
Wang, Zheyuan [1 ]
Jiang, Zehua [2 ]
Zhong, Huaiqin [3 ]
Qin, Yiming [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
[2] Tsinghua Univ, Beijing Tsinghua Changgung Hosp, Beijing Visual Sci & Translat Eye Res Inst BERI, Eye Ctr,Sch Clin Med,Tsinghua Med, Beijing, Peoples R China
[3] Shanghai Hlth & Med Ctr, Dept Geriatr, Wuxi, Peoples R China
关键词
Generative artificial intelligence; Generative models; Ophthalmic imaging; ADVERSARIAL NETWORK; SUPERRESOLUTION; PREDICTION;
D O I
10.1007/s00371-025-03988-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Generative Artificial Intelligence (GenAI) is revolutionizing ophthalmology imaging by addressing critical limitations in data availability, annotation costs, and clinical workflow automation. This review provides a comprehensive analysis of GenAI's technical innovations, clinical applications, and persistent challenges within the ophthalmic imaging domain. We first survey the evolution of generative architectures, from Generative Adversarial Networks to Diffusion Models and vision-language frameworks. These innovations enable novel applications including counterfactual pathology synthesis, longitudinal disease progression modeling, and post-treatment outcome visualization, which enhance diagnostic precision and patient engagement. We then systematically review methodological advancements in GenAI, with a focused analysis on key clinical application categories: image generation, cross-modal domain transfer, image enhancement, post-treatment prediction, image segmentation, and vision-language tasks. Finally, we critically evaluate generative models, evaluation methods, and persistent challenges, such as the need for standardized evaluation frameworks, anatomical fidelity validation, and equitable integration into global healthcare systems. By addressing these barriers, GenAI holds transformative potential to improve diagnostic accuracy, streamline personalized treatment workflows, and democratize access to high-quality ophthalmic care.
引用
收藏
页数:27
相关论文
共 162 条
[1]   Bridging the resources gap: deep learning for fluorescein angiography and optical coherence tomography macular thickness map image translation [J].
Abdelmotaal, Hazem ;
Sharaf, Mohamed ;
Soliman, Wael ;
Wasfi, Ehab ;
Kedwany, Salma M. .
BMC OPHTHALMOLOGY, 2022, 22 (01)
[2]  
Agrawal P, 2024, Arxiv, DOI arXiv:2410.07073
[3]  
Alaa AM, 2022, PR MACH LEARN RES, P290
[4]  
Alayrac JB, 2022, ADV NEUR IN
[5]   Advancing Retinal Image Segmentation: A Denoising Diffusion Probabilistic Model Perspective [J].
Alimanov, Alnur ;
Islam, Md Baharul .
2024 IEEE 7TH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL, MIPR 2024, 2024, :572-578
[6]   Denoising Diffusion Probabilistic Model for Retinal Image Generation and Segmentation [J].
Alimanov, Alnur ;
Islam, Md Baharul .
2023 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL PHOTOGRAPHY, ICCP, 2023,
[7]   A Two-Stage GAN for High-Resolution Retinal Image Generation and Segmentation [J].
Andreini, Paolo ;
Ciano, Giorgio ;
Bonechi, Simone ;
Graziani, Caterina ;
Lachi, Veronica ;
Mecocci, Alessandro ;
Sodi, Andrea ;
Scarselli, Franco ;
Bianchini, Monica .
ELECTRONICS, 2022, 11 (01)
[8]  
Anil R., 2023, arXiv
[9]   VQA: Visual Question Answering [J].
Antol, Stanislaw ;
Agrawal, Aishwarya ;
Lu, Jiasen ;
Mitchell, Margaret ;
Batra, Dhruv ;
Zitnick, C. Lawrence ;
Parikh, Devi .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2425-2433
[10]  
Arjovsky M, 2017, PR MACH LEARN RES, V70