Attention-based multi-scale feature fusion network for myopia grading using optical coherence tomography images

被引:5
作者
Huang, Gengyou [1 ]
Wen, Yang [2 ]
Qian, Bo [1 ]
Bi, Lei [3 ]
Chen, Tingli [4 ]
Sheng, Bin [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comupter Sci & Engn, Shanghai, Peoples R China
[2] Shenzhen Univ, Sch Elect & Informat Engn, Shenzhen, Peoples R China
[3] Shanghai Jiao Tong Univ, Inst Translat Med, Shanghai, Peoples R China
[4] Huadong Sanat, Wuxi, Jiangsu, Peoples R China
基金
美国国家科学基金会;
关键词
Optical coherence tomography (OCT); Myopia grading; Deep Learning; Vision Transformer; Attention fusion; OCT;
D O I
10.1007/s00371-023-03189-y
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Myopia is a serious threat to eye health and can even cause blindness. It is important to grade myopia and carry out targeted intervention. Nowadays, various studies using deep learning models based on optical coherence tomography (OCT) images to screen for high myopia. However, since regions of interest (ROIs) of pre-myopia and low myopia on OCT images are relatively small, it is rather difficult to use OCT images to conduct detailed myopia grading. There are few studies using OCT images for more detailed myopia grading. To address these problems, we propose a novel attention-based multi-scale feature fusion network named AMFF for myopia grading using OCT images. The proposed AMFF mainly consists of five modules: a pre-trained vision transformer (ViT) module, a multi-scale convolutional module, an attention feature fusion module, an Avg-TopK pooling module and a fully connected (FC) classifier. Firstly, unsupervised pre-training of ViT on the training set can better extract feature maps. Secondly, multi-scale convolutional layers further extract multi-scale feature maps to obtain more receptive fields and extract scale-invariant features. Thirdly, feature maps of different scales are fused through channel attention and spatial attention to further obtain more meaningful features. Lastly, the most prominent features are obtained by the weighted average of the highest activation values of each channel, and then they are used to classify myopia through a fully connected layer. Extensive experiments show that our proposed model has the superior performance compared with other state-of-the-art myopia grading models.
引用
收藏
页码:6627 / 6638
页数:12
相关论文
共 50 条
  • [41] Multi-scale Vertical Cross-layer Feature Aggregation and Attention Fusion Network for Object Detection
    Gao, Wenting
    Li, Xiaojuan
    Han, Yu
    Liu, Yue
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 139 - 150
  • [42] Tool Wear Prediction Based on a Multi-Scale Convolutional Neural Network with Attention Fusion
    Huang, Qingqing
    Wu, Di
    Huang, Hao
    Zhang, Yan
    Han, Yan
    INFORMATION, 2022, 13 (10)
  • [43] Object Detection of Remote Sensing Image Based on Multi-Scale Feature Fusion and Attention Mechanism
    Du, Zuoqiang
    Liang, Yuan
    IEEE ACCESS, 2024, 12 : 8619 - 8632
  • [44] A Multi-Scale Natural Scene Text Detection Method Based on Attention Feature Extraction and Cascade Feature Fusion
    Li, Nianfeng
    Wang, Zhenyan
    Huang, Yongyuan
    Tian, Jia
    Li, Xinyuan
    Xiao, Zhiguo
    SENSORS, 2024, 24 (12)
  • [45] Neural Network-Driven Image Hiding Using Multi-Scale Feature Fusion
    Zhang, Xiaomei
    Zhou, Wei
    Jiang, Yunxiao
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2025,
  • [46] CovTiNet: Covid text identification network using attention-based positional embedding feature fusion
    Md. Rajib Hossain
    Mohammed Moshiul Hoque
    Nazmul Siddique
    Iqbal H. Sarker
    Neural Computing and Applications, 2023, 35 : 13503 - 13527
  • [47] CovTiNet: Covid text identification network using attention-based positional embedding feature fusion
    Hossain, Md. Rajib
    Hoque, Mohammed Moshiul
    Siddique, Nazmul
    Sarker, Iqbal H. H.
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (18) : 13503 - 13527
  • [48] MADC: Multi-scale Attention-based Deep Clustering for Workload Prediction
    Huang, Jiaming
    Xiao, Chuming
    Wu, Weigang
    Yin, Ye
    Chang, Hongli
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 316 - 323
  • [49] Remaining useful life prediction based on parallel multi-scale feature fusion network
    Yin, Yuyan
    Tian, Jie
    Liu, Xinfeng
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 36 (5) : 3111 - 3127
  • [50] FMA-Net: Fusion of Multi-Scale Attention for Grading Cervical Precancerous Lesions
    Duan, Zhuoran
    Xu, Chao
    Li, Zhengping
    Feng, Bo
    Nie, Chao
    MATHEMATICS, 2024, 12 (07)