Stitched vision transformer for age-related macular degeneration detection using retinal optical coherence tomography images

被引：2

作者：

Azizi, Mohammad Mahdi ^{[1
]}

Abhari, Setareh ^{[1
]}

Sajedi, Hedieh ^{[1
]}

机构：

[1] Univ Tehran, Coll Sci, Dept Math Stat & Comp Sci, Tehran, Iran

来源：

PLOS ONE | 2024年 / 19卷 / 06期

基金：

美国国家科学基金会;

关键词：

CLASSIFICATION; EXTRACTION; DISEASES; EDEMA; CNN;

D O I：

10.1371/journal.pone.0304943

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Age-related macular degeneration (AMD) is an eye disease that leads to the deterioration of the central vision area of the eye and can gradually result in vision loss in elderly individuals. Early identification of this disease can significantly impact patient treatment outcomes. Furthermore, given the increasing elderly population globally, the importance of automated methods for rapidly monitoring at-risk individuals and accurately diagnosing AMD is growing daily. One standard method for diagnosing AMD is using optical coherence tomography (OCT) images as a non-invasive imaging technology. In recent years, numerous deep neural networks have been proposed for the classification of OCT images. Utilizing pre-trained neural networks can speed up model deployment in related tasks without compromising accuracy. However, most previous methods overlook the feasibility of leveraging pre-existing trained networks to search for an optimal architecture for AMD staging on a new target dataset. In this study, our objective was to achieve an optimal architecture in the efficiency-accuracy trade-off for classifying retinal OCT images. To this end, we employed pre-trained medical vision transformer (MedViT) models. MedViT combines convolutional and transformer neural networks, explicitly designed for medical image classification. Our approach involved pre-training two distinct MedViT models on a source dataset with labels identical to those in the target dataset. This pre-training was conducted in a supervised manner. Subsequently, we evaluated the performance of the pre-trained MedViT models for classifying retinal OCT images from the target Noor Eye Hospital (NEH) dataset into the normal, drusen, and choroidal neovascularization (CNV) classes in zero-shot settings and through five-fold cross-validation. Then, we proposed a stitching approach to search for an optimal model from two MedViT family models. The proposed stitching method is an efficient architecture search algorithm known as stitchable neural networks. Stitchable neural networks create a candidate model in search space for each pair of stitchable layers by inserting a linear layer between them. A pair of stitchable layers consists of layers, each selected from one input model. While stitchable neural networks had previously been tested on more extensive and general datasets, this study demonstrated that stitching networks could also be helpful in smaller medical datasets. The results of this approach indicate that when pre-trained models were available for OCT images from another dataset, it was possible to achieve a model in 100 epochs with an accuracy of over 94.9% in classifying images from the NEH dataset. The results of this study demonstrate the efficacy of stitchable neural networks as a fine-tuning method for OCT image classification. This approach not only leads to higher accuracy but also considers architecture optimization at a reasonable computational cost.

引用

页数：24

共 65 条

[1] Multi-Stage Classification of Retinal OCT Using Multi-Scale Ensemble Deep Architecture
Akinniyi, Oluwatunmise
Rahman, Md Mahmudur
Sandhu, Harpal Singh
El-Baz, Ayman
Khalifa, Fahmi
[J]. BIOENGINEERING-BASEL, 2023, 10 (07):
[2] Albarrak A., 2013, P 2013 INT C MED IM, P59
[3] Pyramidal deep neural network for classification of retinal OCT images
Almasganj, Mohammad
Fatemizadeh, Emad
[J]. 2023 30TH NATIONAL AND 8TH INTERNATIONAL IRANIAN CONFERENCE ON BIOMEDICAL ENGINEERING, ICBME, 2023, : 381 - 385
[4] Aykat Sukru, 2023, International Journal of Computational and Experimental Science and Engineering, V9, P62
[5] Wavelet scattering transform application in classification of retinal abnormalities using OCT images
Baharlouei, Zahra
Rabbani, Hossein
Plonka, Gerlind
[J]. SCIENTIFIC REPORTS, 2023, 13 (01)
[6] Artificial intelligence based detection of age-related macular degeneration using optical coherence tomography with unique image preprocessing
Celebi, Ali Riza Cenk
Bulut, Erkan
Sezer, Aysun
[J]. EUROPEAN JOURNAL OF OPHTHALMOLOGY, 2023, 33 (01) : 65 - 73
[7] A Deep Learning-Based Framework for Retinal Disease Classification
Choudhary, Amit
Ahlawat, Savita
Urooj, Shabana
Pathak, Nitish
Lay-Ekuakille, Aime
Sharma, Neelam
[J]. HEALTHCARE, 2023, 11 (02)
[8] B-Scan Attentive CNN for the Classification of Retinal Optical Coherence Tomography Volumes
Das, Vineeta
Prabhakararao, Eedara
Dandapat, Samarendra
Bora, Prabin Kumar
[J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (1025-1029) : 1025 - 1029
[9] Multi-scale deep feature fusion for automated classification of macular pathologies from OCT images
Das, Vineeta
Dandapat, Samarendra
Bora, Prabin Kumar
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2019, 54
[10] Classification and segmentation of OCT images for age-related macular degeneration based on dual guidance networks
Diao, Shengyong
Su, Jinzhu
Yang, Changqing
Zhu, Weifang
Xiang, Dehui
Chen, Xinjian
Peng, Qing
Shi, Fei
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 84

← 1 2 3 4 5 6 7 →