ODFormer: Semantic fundus image segmentation using Transformer for optic nerve head detection

被引:0
作者
Wang, Jiayi [1 ,2 ,3 ,4 ]
Mao, Yi-An [5 ]
Ma, Xiaoyu [6 ]
Guo, Sicen [1 ,2 ,3 ,4 ]
Shao, Yuting [6 ]
Lv, Xiao [6 ]
Han, Wenting [6 ]
Christopher, Mark [7 ,8 ]
Zangwill, Linda M. [7 ,8 ]
Bi, Yanlong [6 ,9 ]
Fan, Rui [1 ,2 ,3 ,4 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Dept Control Sci & Engn, MIAS Grp, Shanghai 201804, Peoples R China
[2] Tongji Univ, Shanghai Inst Intelligent Sci & Technol, Shanghai 201804, Peoples R China
[3] Tongji Univ, Shanghai Res Inst Intelligent Autonomous Syst, State Key Lab Intelligent Autonomous Syst, Shanghai 201210, Peoples R China
[4] Tongji Univ, Frontiers Sci Ctr Intelligent Autonomous Syst, Shanghai 201210, Peoples R China
[5] Shanghai Univ, Sch Life Sci, 99 Shangda Ave, Shanghai 200444, Peoples R China
[6] Tongji Univ, Tongji Hosp, Sch Med, Dept Ophthalmol, Shanghai 200065, Peoples R China
[7] Univ Calif San Diego, Hamilton Glaucoma Ctr, La Jolla, CA 92037 USA
[8] Univ Calif San Diego, Shiley Eye Inst, Viterbi Family Dept Ophthalmol, Div Ophthalmol Informat & Data Sci, La Jolla, CA 92037 USA
[9] Tongji Univ, Tongji Eye Inst, Sch Med, Shanghai 200065, Peoples R China
基金
中国国家自然科学基金;
关键词
Optic nerve head; Fundus image; Semantic segmentation; Convolutional neural network; Transformer; FOVEA;
D O I
10.1016/j.inffus.2024.102533
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optic nerve head (ONH) detection has been a crucial area of study in ophthalmology for years. However, the significant discrepancy between fundus image datasets, each generated using a single type of fundus camera, poses challenges to the generalizability of ONH detection approaches developed based on semantic segmentation networks. Despite the numerous recent advancements in general-purpose semantic segmentation methods using convolutional neural networks (CNNs) and Transformers, there is currently a lack of benchmarks for these state -of -the -art (SoTA) networks specifically trained for ONH detection. Therefore, in this article, we make contributions from three key aspects: network design, the publication of a dataset, and the establishment of a comprehensive benchmark. Our newly developed ONH detection network, referred to as ODFormer, is based upon the Swin Transformer architecture and incorporates two novel components: a multi-scale context aggregator and a lightweight bidirectional feature recalibrator. Our published large-scale dataset, known as TongjiU-DROD, provides multi -resolution fundus images for each participant, captured using two distinct types of cameras. Our established benchmark involves three datasets: DRIONS-DB, DRISHTI-GS1, and TongjiU-DROD, created by researchers from different countries and containing fundus images captured from participants of diverse races and ages. Extensive experimental results demonstrate that our proposed ODFormer outperforms other state -of -the -art (SoTA) networks in terms of performance and generalizability. Our dataset and source code are publicly available at https://mias.group/ODFormer.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Cost-effective instrumentation for quantitative depth measurement of optic nerve head using stereo fundus image pair and image cross correlation techniques
    Vieira de Carvalho, Luis Alberto
    Carvalho, Valeria
    OPHTHALMIC TECHNOLOGIES XXIV, 2014, 8930
  • [22] Automated Segmentation of Optic Nerve Head for the Topological Assessment
    Ko, M. W. L.
    Leung, C. K. S.
    Yuen, T. Y. P.
    2016 GLOBAL MEDICAL ENGINEERING PHYSICS EXCHANGES/PAN AMERICAN HEALTH CARE EXCHANGES (GMEPE/PAHCE), 2016,
  • [23] Automated segmentation of the optic nerve head for diagnosis of glaucoma
    Chrástek, R
    Wolf, M
    Donath, K
    Niemann, H
    Paulus, D
    Hothorn, T
    Lausen, B
    Lämmer, R
    Mardin, CY
    Michelson, G
    MEDICAL IMAGE ANALYSIS, 2005, 9 (04) : 297 - 314
  • [24] Automated Detection of Optic Disc for the Analysis of Retina Using Color Fundus Image
    Zubair, Muhammad
    Yamin, Abubakar
    Khan, Shoab A.
    2013 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST 2013), 2013, : 239 - 242
  • [25] MMT: Mixed-Mask Transformer for Remote Sensing Image Semantic Segmentation
    Xu, Zhe
    Geng, Jie
    Jiang, Wen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [26] Hybrid Attention Fusion Embedded in Transformer for Remote Sensing Image Semantic Segmentation
    Chen, Yan
    Dong, Quan
    Wang, Xiaofeng
    Zhang, Qianchuan
    Kang, Menglei
    Jiang, Wenxiang
    Wang, Mengyuan
    Xu, Lixiang
    Zhang, Chen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 4421 - 4435
  • [27] Cross-scale sampling transformer for semantic image segmentation
    Ma, Yizhe
    Yu, Long
    Lin, Fangjian
    Tian, Shengwei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (02) : 2895 - 2907
  • [28] CNN and Transformer Fusion for Remote Sensing Image Semantic Segmentation
    Chen, Xin
    Li, Dongfen
    Liu, Mingzhe
    Jia, Jiaru
    REMOTE SENSING, 2023, 15 (18)
  • [29] Detection of glaucomatous optic nerve head by using Heidelberg topographic maps
    Iester, M.
    Zanini, M.
    Vittone, P.
    Calabria, G.
    EYE, 2007, 21 (05) : 609 - 613
  • [30] Detection of glaucomatous optic nerve head by using Heidelberg topographic maps
    M Iester
    M Zanini
    P Vittone
    G Calabria
    Eye, 2007, 21 : 609 - 613