HTC-retina: A hybrid retinal diseases classification model using transformer-Convolutional Neural Network from optical coherence tomography images

被引:2
|
作者
Laouarem A. [1 ]
Kara-Mohamed C. [1 ]
Bourennane E.-B. [2 ]
Hamdi-Cherif A. [1 ]
机构
[1] Department of Computer Science, University of Ferhat Abbas 1, Setif
[2] ImViA Laboratory, University of Burgundy, Dijon
来源
关键词
Convolutional Neural Network; Deep learning; Hybridization; Optical coherence tomography; Retinal disease; Vision transformer;
D O I
10.1016/j.compbiomed.2024.108726
中图分类号
学科分类号
摘要
Retinal diseases are among nowadays major public health issues, deservedly needing advanced computer-aided diagnosis. We propose a hybrid model for multi label classification, whereby seven retinal diseases are automatically classified from Optical Coherence Tomography (OCT) images. We show that, by combining the strengths of Convolutional Neural Networks (CNNs) and Visual Transformers (ViTs), we can produce a more powerful type of model for medical image classification, especially when considering local lesion information such as retinal diseases. CNNs are indeed proved to be efficient at parameter utilization and provide the ability to extract local features and multi-scale feature maps through convolutional operations. On the other hand, ViT's self-attention procedure allows processing long-range and global dependencies within an image. The paper clearly shows that the hybridization of these complementary capabilities (CNNs-ViTs) presents a high image processing potential that is more robust and efficient. The proposed model adopts a hierarchical CNN module called Convolutional Patch and Token Embedding (CPTE) instead of employing a direct tokenization approach using the raw input OCT image in the transformer. The CPTE module's role is to incorporate an inductive bias, to reduce the reliance on large-scale datasets, and to address the low-level feature extraction challenges of the ViT. In addition, considering the importance of local lesion information in OCT images, the model relies on a parallel module called Residual Depthwise-Pointwise ConvNet (RDP-ConvNet) for extracting high-level features. RDP-ConvNet utilizes depthwise and pointwise convolution layers within a residual network architecture. The overall performance of the HTC-Retina model was evaluated on three datasets: the OCT-2017, OCT-C8, and OCT-2014; outperforming previous established models, achieving accuracy rates of 99.40%, 97.00%, and 99.77%, respectively; and sensitivity rates of 99.41%, 97.00%, and 99.77%, respectively. Notably, the model showed high performance while maintaining computational efficiency. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [1] Ensemble Convolutional Neural Networks for the Classification and Visualization Retinal Diseases in Optical Coherence Tomography Images
    Kim, Jongwoo
    Ran, Loc
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 123 - 128
  • [2] Automated retinal disease classification using hybrid transformer model (SViT) using optical coherence tomography images
    G. R. Hemalakshmi
    M. Murugappan
    Mohamed Yacin Sikkandar
    S. Sabarunisha Begum
    N. B. Prakash
    Neural Computing and Applications, 2024, 36 : 9171 - 9188
  • [3] Automated retinal disease classification using hybrid transformer model (SViT) using optical coherence tomography images
    Hemalakshmi, G. R.
    Murugappan, M.
    Sikkandar, Mohamed Yacin
    Begum, S. Sabarunisha
    Prakash, N. B.
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (16): : 9171 - 9188
  • [4] Ensemble Learning based on Convolutional Neural Networks for the Classification of Retinal Diseases from Optical Coherence Tomography Images
    Kim, Jongwoo
    Tran, Loc
    2020 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS(CBMS 2020), 2020, : 532 - 537
  • [5] Automatic Classification of Retinal Optical Coherence Tomography Images With Layer Guided Convolutional Neural Network
    Huang, Laifeng
    He, Xingxin
    Fang, Leyuan
    Rabbani, Hossein
    Chen, Xiangdong
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (07) : 1026 - 1030
  • [6] Retinal disease classification based on optical coherence tomography images using convolutional neural networks
    Stanojevic, Masa
    Draskovic, Drazen
    Nikolic, Bosko
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
  • [7] Diagnosis of retinal diseases using the vision transformer model based on optical coherence tomography images
    Zhou, Zenan
    Niu, Chen
    Yu, Huanhuan
    Zhao, Jiaqing
    Wang, Yuchen
    Dai, Cuixia
    SPIE-CLP CONFERENCE ON ADVANCED PHOTONICS 2022, 2023, 12601
  • [8] Segmentation of Intra-Retinal Cysts From Optical Coherence Tomography Images Using a Fully Convolutional Neural Network Model
    Girish, G. N.
    Thakur, Bibhash
    Chowdhury, Sohini Roy
    Kothari, Abhishek R.
    Rajan, Jeny
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (01) : 296 - 304
  • [9] An automated hybrid decoupled convolutional network for laceration segmentation and grading of retinal diseases using optical coherence tomography (OCT) images
    Pavithra Mani
    Neelaveni Ramachandran
    Sweety Jose Paul
    Prasanna Venkatesh Ramesh
    Signal, Image and Video Processing, 2024, 18 : 2903 - 2927
  • [10] An automated hybrid decoupled convolutional network for laceration segmentation and grading of retinal diseases using optical coherence tomography (OCT) images
    Mani, Pavithra
    Ramachandran, Neelaveni
    Paul, Sweety Jose
    Ramesh, Prasanna Venkatesh
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 2903 - 2927