Improved VGG Algorithm for Visual Prosthesis Image Recognition

被引:4
作者
Li, Zhiguang [1 ]
Li, Baitao [2 ]
Jahng, Surng Gahb [1 ]
Jung, Changyong [1 ]
机构
[1] Chung Ang Univ, Sch Adv Imaging Sci Multimedia & Film, Seoul 06974, South Korea
[2] Henan Univ, Sch Journalism & Commun, Kaifeng 475000, Peoples R China
关键词
Visual prostheses; convolutional neural network; VGG; image recognition; Drosophila optimization algorithm;
D O I
10.1109/ACCESS.2024.3380839
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of bio-informatics, visual prostheses have become an effective method for low vision people to restore visual function. To meet the visual needs of visual implant recipients, this study explored the problem of in vitro image processor image recognition and classification. It selected the convolutional neural network framework VGG as the technical core, introduced the fruit fly optimization algorithm for optimizing the VGG recognition model, and constructed a visual prosthesis image recognition model on the ground of improved VGG. The experiment demonstrated that the improved fruit fly search algorithm had an average absolute error and root mean square error values lower than 0.4, which was superior to other intelligent optimization algorithms. The performance of the improved image recognition model has been significantly improved, with a maximum AUC value of 0.942, a recognition accuracy range of 68.29%-97.23%, and a stable fitness curve of around 97.00. The maximum F1 value of the image recognition model designed in the study reached 91.47%, and the loss function curve converged to the minimum value. In the application of visual prostheses, the recognition accuracy and R-squared performance of this model were both the best. Compared with natural human vision, the contrast and functional visual effects matched well, the processing speed was faster, and the delay time did not affect the actual application of visual prostheses, achieving high user satisfaction. This study can enrich and improve the theoretical foundation of visual image analysis and visual prosthetics, and help visually impaired groups improve their lives and quality of life.
引用
收藏
页码:45727 / 45739
页数:13
相关论文
共 32 条
[1]   An on-chip photonic deep neural network for image classification [J].
Ashtiani, Farshid ;
Geers, Alexander J. ;
Aflatouni, Firooz .
NATURE, 2022, 606 (7914) :501-+
[2]   A 1984-Pixels, 1.26 nW/Pixel Retinal Prosthesis Chip With Time-Domain In-Pixel Image Processing and Bipolar Stimulating Electrode Sharing [J].
Choi, Dong-Hwi ;
Jee, Dong-Woo .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2023, 58 (10) :2757-2766
[3]   Convolutional neural network classifies visual stimuli from cortical response recorded with wide-field imaging in mice [J].
De Luca, Daniela ;
Moccia, Sara ;
Lupori, Leonardo ;
Mazziotti, Raffaele ;
Pizzorusso, Tommaso ;
Micera, Silvestro .
JOURNAL OF NEURAL ENGINEERING, 2023, 20 (02)
[4]   Multi-feature fusion: Graph neural network and CNN combining for hyperspectral image classification [J].
Ding, Yao ;
Zhang, Zhili ;
Zhao, Xiaofeng ;
Hong, Danfeng ;
Cai, Wei ;
Yu, Chengguo ;
Yang, Nengjun ;
Cai, Weiwei .
NEUROCOMPUTING, 2022, 501 :246-257
[5]   A novel elitist fruit fly optimization algorithm [J].
He, Jieguang ;
Peng, Zhiping ;
Qiu, Jinbo ;
Cui, Delong ;
Li, Qirui .
SOFT COMPUTING, 2023, 27 (08) :4823-4851
[6]   BioAdhere: tailor-made bioadhesives for epiretinal visual prostheses [J].
Hintzen, Kai-Wolfgang ;
Simons, Christian ;
Schaffrath, Kim ;
Roessler, Gernot ;
Johnen, Sandra ;
Jakob, Felix ;
Walter, Peter ;
Schwaneberg, Ulrich ;
Lohmann, Tibor .
BIOMATERIALS SCIENCE, 2022, 10 (12) :3282-3295
[7]   Flexible ultrasound-induced retinal stimulating piezo-arrays for biomimetic visual prostheses [J].
Jiang, Laiming ;
Lu, Gengxi ;
Zeng, Yushun ;
Sun, Yizhe ;
Kang, Haochen ;
Burford, James ;
Gong, Chen ;
Humayun, Mark S. ;
Chen, Yong ;
Zhou, Qifa .
NATURE COMMUNICATIONS, 2022, 13 (01)
[8]   Attitudes of potential recipients toward emerging visual prosthesis technologies [J].
Karadima, Vicky ;
Pezaris, Elizabeth A. ;
Pezaris, John S. .
SCIENTIFIC REPORTS, 2023, 13 (01)
[9]   A new framework for grayscale ear images recognition using generative adversarial networks under unconstrained conditions [J].
Khaldi, Yacine ;
Benzaoui, Amir .
EVOLVING SYSTEMS, 2021, 12 (04) :923-934
[10]  
Lachmann M, 2022, EUR HEART J-DIGIT HL, V3, P153, DOI 10.1093/ehjdh/ztac004