Stitched vision transformer for age-related macular degeneration detection using retinal optical coherence tomography images

被引:2
作者
Azizi, Mohammad Mahdi [1 ]
Abhari, Setareh [1 ]
Sajedi, Hedieh [1 ]
机构
[1] Univ Tehran, Coll Sci, Dept Math Stat & Comp Sci, Tehran, Iran
来源
PLOS ONE | 2024年 / 19卷 / 06期
基金
美国国家科学基金会;
关键词
CLASSIFICATION; EXTRACTION; DISEASES; EDEMA; CNN;
D O I
10.1371/journal.pone.0304943
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Age-related macular degeneration (AMD) is an eye disease that leads to the deterioration of the central vision area of the eye and can gradually result in vision loss in elderly individuals. Early identification of this disease can significantly impact patient treatment outcomes. Furthermore, given the increasing elderly population globally, the importance of automated methods for rapidly monitoring at-risk individuals and accurately diagnosing AMD is growing daily. One standard method for diagnosing AMD is using optical coherence tomography (OCT) images as a non-invasive imaging technology. In recent years, numerous deep neural networks have been proposed for the classification of OCT images. Utilizing pre-trained neural networks can speed up model deployment in related tasks without compromising accuracy. However, most previous methods overlook the feasibility of leveraging pre-existing trained networks to search for an optimal architecture for AMD staging on a new target dataset. In this study, our objective was to achieve an optimal architecture in the efficiency-accuracy trade-off for classifying retinal OCT images. To this end, we employed pre-trained medical vision transformer (MedViT) models. MedViT combines convolutional and transformer neural networks, explicitly designed for medical image classification. Our approach involved pre-training two distinct MedViT models on a source dataset with labels identical to those in the target dataset. This pre-training was conducted in a supervised manner. Subsequently, we evaluated the performance of the pre-trained MedViT models for classifying retinal OCT images from the target Noor Eye Hospital (NEH) dataset into the normal, drusen, and choroidal neovascularization (CNV) classes in zero-shot settings and through five-fold cross-validation. Then, we proposed a stitching approach to search for an optimal model from two MedViT family models. The proposed stitching method is an efficient architecture search algorithm known as stitchable neural networks. Stitchable neural networks create a candidate model in search space for each pair of stitchable layers by inserting a linear layer between them. A pair of stitchable layers consists of layers, each selected from one input model. While stitchable neural networks had previously been tested on more extensive and general datasets, this study demonstrated that stitching networks could also be helpful in smaller medical datasets. The results of this approach indicate that when pre-trained models were available for OCT images from another dataset, it was possible to achieve a model in 100 epochs with an accuracy of over 94.9% in classifying images from the NEH dataset. The results of this study demonstrate the efficacy of stitchable neural networks as a fine-tuning method for OCT image classification. This approach not only leads to higher accuracy but also considers architecture optimization at a reasonable computational cost.
引用
收藏
页数:24
相关论文
共 65 条
  • [1] Multi-Stage Classification of Retinal OCT Using Multi-Scale Ensemble Deep Architecture
    Akinniyi, Oluwatunmise
    Rahman, Md Mahmudur
    Sandhu, Harpal Singh
    El-Baz, Ayman
    Khalifa, Fahmi
    [J]. BIOENGINEERING-BASEL, 2023, 10 (07):
  • [2] Albarrak A., 2013, P 2013 INT C MED IM, P59
  • [3] Pyramidal deep neural network for classification of retinal OCT images
    Almasganj, Mohammad
    Fatemizadeh, Emad
    [J]. 2023 30TH NATIONAL AND 8TH INTERNATIONAL IRANIAN CONFERENCE ON BIOMEDICAL ENGINEERING, ICBME, 2023, : 381 - 385
  • [4] Aykat Sukru, 2023, International Journal of Computational and Experimental Science and Engineering, V9, P62
  • [5] Wavelet scattering transform application in classification of retinal abnormalities using OCT images
    Baharlouei, Zahra
    Rabbani, Hossein
    Plonka, Gerlind
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [6] Artificial intelligence based detection of age-related macular degeneration using optical coherence tomography with unique image preprocessing
    Celebi, Ali Riza Cenk
    Bulut, Erkan
    Sezer, Aysun
    [J]. EUROPEAN JOURNAL OF OPHTHALMOLOGY, 2023, 33 (01) : 65 - 73
  • [7] A Deep Learning-Based Framework for Retinal Disease Classification
    Choudhary, Amit
    Ahlawat, Savita
    Urooj, Shabana
    Pathak, Nitish
    Lay-Ekuakille, Aime
    Sharma, Neelam
    [J]. HEALTHCARE, 2023, 11 (02)
  • [8] B-Scan Attentive CNN for the Classification of Retinal Optical Coherence Tomography Volumes
    Das, Vineeta
    Prabhakararao, Eedara
    Dandapat, Samarendra
    Bora, Prabin Kumar
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (1025-1029) : 1025 - 1029
  • [9] Multi-scale deep feature fusion for automated classification of macular pathologies from OCT images
    Das, Vineeta
    Dandapat, Samarendra
    Bora, Prabin Kumar
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2019, 54
  • [10] Classification and segmentation of OCT images for age-related macular degeneration based on dual guidance networks
    Diao, Shengyong
    Su, Jinzhu
    Yang, Changqing
    Zhu, Weifang
    Xiang, Dehui
    Chen, Xinjian
    Peng, Qing
    Shi, Fei
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 84