VSG-GAN: A high-fidelity image synthesis method with semantic manipulation in retinal fundus image

被引:5
作者
Liu, Junjie [2 ,3 ,4 ,5 ]
Xu, Shixin [6 ]
He, Ping [2 ,3 ]
Wu, Sirong [2 ,3 ,4 ]
Luo, Xi [2 ,3 ,4 ]
Deng, Yuhui [2 ,3 ]
Huang, Huaxiong [1 ,2 ,7 ]
机构
[1] Beijing Normal Univ, Res Ctr Math, Zhuhai, Peoples R China
[2] Guangdong Prov Key Lab Interdisciplinary Res & App, Zhuhai, Peoples R China
[3] BNU HKBU United Int Coll, Zhuhai, Peoples R China
[4] Hong Kong Baptist Univ, Fac Sci, Hong Kong, Peoples R China
[5] Trinity Coll Dublin, Dublin, Ireland
[6] Duke Kunshan Univ, Data Sci Res Ctr, Kunshan, Jiangsu, Peoples R China
[7] York Univ, Dept Math & Stat, Toronto, ON, Canada
基金
欧洲研究理事会; 中国国家自然科学基金;
关键词
SEGMENTATION;
D O I
10.1016/j.bpj.2024.02.019
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
In recent years, advancements in retinal image analysis, driven by machine learning and deep learning techniques, have enhanced disease detection and diagnosis through automated feature extraction. However, challenges persist, including limited data set diversity due to privacy concerns and imbalanced sample pairs, hindering effective model training. To address these issues, we introduce the vessel and style guided generative adversarial network (VSG-GAN), an innovative algorithm building upon the foundational concept of GAN. In VSG-GAN, a generator and discriminator engage in an adversarial process to produce realistic retinal images. Our approach decouples retinal image generation into distinct modules: the vascular skeleton and background style. Leveraging style transformation and GAN inversion, our proposed hierarchical variational autoencoder module generates retinal images with diverse morphological traits. In addition, the spatially adaptive denormalization module ensures consistency between input and generated images. We evaluate our model on MESSIDOR and RITE data sets using various metrics, including structural similarity index measure, inception score, Frechet inception distance, and kernel inception distance. Our results demonstrate the superiority of VSG-GAN, outperforming existing methods across all evaluation assessments. This underscores its effectiveness in addressing data set limitations and imbalances. Our algorithm provides a novel solution to challenges in retinal image analysis by offering diverse and realistic retinal image generation. Implementing the VSG-GAN augmentation approach on downstream diabetic retinopathy classification tasks has shown enhanced disease diagnosis accuracy, further advancing the utility of machine learning in this domain.
引用
收藏
页码:2815 / 2829
页数:15
相关论文
共 55 条
[21]   Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization [J].
Huang, Xun ;
Belongie, Serge .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1510-1519
[22]   Deep Reinforcement Learning-Based Retinal Imaging in Alzheimer's Disease: Potential and Perspectives [J].
Hui, Herbert Y. H. ;
Ran, An Ran ;
Dai, Jia Jia ;
Cheung, Carol Y. .
JOURNAL OF ALZHEIMERS DISEASE, 2023, 94 (01) :39-50
[23]   Image-to-Image Translation with Conditional Adversarial Networks [J].
Isola, Phillip ;
Zhu, Jun-Yan ;
Zhou, Tinghui ;
Efros, Alexei A. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5967-5976
[24]  
Jahanian A., 2019, On the" steerability" of generative adversarial networks", DOI 10.48550/arXiv:1907.07171
[25]   Perceptual Losses for Real-Time Style Transfer and Super-Resolution [J].
Johnson, Justin ;
Alahi, Alexandre ;
Li Fei-Fei .
COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :694-711
[26]   Contrastive Learning for Generating Optical Coherence Tomography Images of the Retina [J].
Kaplan, Sinan ;
Lensu, Lasse .
SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2022, 2022, 13570 :112-121
[27]  
Karras T, 2018, Arxiv, DOI arXiv:1710.10196
[28]   A Style-Based Generator Architecture for Generative Adversarial Networks [J].
Karras, Tero ;
Laine, Samuli ;
Aila, Timo .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4396-4405
[29]   Synthesizing realistic high-resolution retina image by style-based generative adversarial network and its utilization [J].
Kim, Mingyu ;
Kim, You Na ;
Jang, Miso ;
Hwang, Jeongeun ;
Kim, Hong-Kyu ;
Yoon, Sang Chul ;
Kim, Yoon Jeon ;
Kim, Namkug .
SCIENTIFIC REPORTS, 2022, 12 (01)
[30]   Combining Markov Random Fields and Convolutional Neural Networks for Image Synthesis [J].
Li, Chuan ;
Wand, Michael .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2479-2486