A Transformer-Based Knowledge Distillation Network for Cortical Cataract Grading

被引:1
|
作者
Wang, Jinhong [1 ,2 ]
Xu, Zhe [3 ]
Zheng, Wenhao [1 ,2 ]
Ying, Haochao [4 ]
Chen, Tingting [1 ,2 ]
Liu, Zuozhu [5 ]
Chen, Danny Z. [6 ]
Yao, Ke [3 ]
Wu, Jian [7 ,8 ]
机构
[1] Zhejiang Univ, Affiliated Hosp 2, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Affiliated Hosp 2, Eye Ctr, Hangzhou 310027, Peoples R China
[3] Zhejiang Univ, Affiliated Hosp 2, Eye Ctr, Sch Med, Hangzhou 310009, Zhejiang, Peoples R China
[4] Zhejiang Univ, Sch Publ Hlth, Hangzhou 310058, Peoples R China
[5] Zhejiang Univ, ZJU UIUC Inst, Res & Dev Ctr Intelligent Healthcare, ZJU Angelalign Inc, Haining 310058, Peoples R China
[6] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[7] Zhejiang Univ, Affiliated Hosp 2, Sch Med, Sch Publ Hlth, Hangzhou 310058, Peoples R China
[8] Zhejiang Univ, Inst Wenzhou, Hangzhou 310058, Peoples R China
关键词
Cataracts; Transformers; Annotations; Feature extraction; Image edge detection; Fuses; Knowledge engineering; Cataract grading; knowledge distillation; transformer; medical imaging classification; CLASSIFICATION; IMAGES;
D O I
10.1109/TMI.2023.3327274
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Cortical cataract, a common type of cataract, is particularly difficult to be diagnosed automatically due to the complex features of the lesions. Recently, many methods based on edge detection or deep learning were proposed for automatic cataract grading. However, these methods suffer a large performance drop in cortical cataract grading due to the more complex cortical opacities and uncertain data. In this paper, we propose a novel Transformer-based Knowledge Distillation Network, called TKD-Net, for cortical cataract grading. To tackle the complex opacity problem, we first devise a zone decomposition strategy to extract more refined features and introduce special sub-scores to consider critical factors of clinical cortical opacity assessment (location, area, density) for comprehensive quantification. Next, we develop a multi-modal mix-attention Transformer to efficiently fuse sub-scores and image modality for complex feature learning. However, obtaining the sub-score modality is a challenge in the clinic, which could cause the modality missing problem instead. To simultaneously alleviate the issues of modality missing and uncertain data, we further design a Transformer-based knowledge distillation method, which uses a teacher model with perfect data to guide a student model with modality-missing and uncertain data. We conduct extensive experiments on a dataset of commonly-used slit-lamp images annotated by the LOCS III grading system to demonstrate that our TKD-Net outperforms state-of-the-art methods, as well as the effectiveness of its key components.
引用
收藏
页码:1089 / 1101
页数:13
相关论文
共 50 条
  • [21] Fusformer: A Transformer-Based Fusion Network for Hyperspectral Image Super-Resolution
    Hu, Jin-Fan
    Huang, Ting-Zhu
    Deng, Liang-Jian
    Dou, Hong-Xia
    Hong, Danfeng
    Vivone, Gemine
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [22] PARFormer: Transformer-Based Multi-Task Network for Pedestrian Attribute Recognition
    Fan, Xinwen
    Zhang, Yukang
    Lu, Yang
    Wang, Hanzi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 411 - 423
  • [23] VulExplainer: A Transformer-Based Hierarchical Distillation for Explaining Vulnerability Types
    Fu, Michael
    Nguyen, Van
    Tantithamthavorn, Chakkrit
    Le, Trung
    Phung, Dinh
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (10) : 4550 - 4565
  • [24] ClST: A Convolutional Transformer Framework for Automatic Modulation Recognition by Knowledge Distillation
    Hou, Dongbin
    Li, Lixin
    Lin, Wensheng
    Liang, Junli
    Han, Zhu
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (07) : 8013 - 8028
  • [25] Breast cancer diagnosis through knowledge distillation of Swin transformer-based teacher-student models
    Kolla, Bhavannarayanna
    Venugopal, P.
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2023, 4 (04):
  • [26] Transformer-Based Feature Aggregation and Stitching Network for Crowd Counting
    Wang, Kehao
    Wang, Yuhui
    Ren, Ruiqi
    Zou, Han
    Shao, Zhichao
    IEEE ACCESS, 2023, 11 : 124833 - 124844
  • [27] Transformer-Based Regression Network for Pansharpening Remote Sensing Images
    Su, Xunyang
    Li, Jinjiang
    Hua, Zhen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [28] A transformer-based adversarial network framework for steganography
    Xiao, Chaoen
    Peng, Sirui
    Zhang, Lei
    Wang, Jianxin
    Ding, Ding
    Zhang, Jianyi
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 269
  • [29] Knowledge Distillation SegFormer-Based Network for RGB-T Semantic Segmentation
    Zhou, Wujie
    Gong, Tingting
    Yan, Weiqing
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (03): : 2170 - 2182
  • [30] Transformer-based Point Cloud Generation Network
    Xu, Rui
    Hui, Le
    Han, Yuehui
    Qian, Jianjun
    Xie, Jin
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4169 - 4177