Incomplete Multimodal Learning for Visual Acuity Prediction After Cataract Surgery Using Masked Self-Attention

被引:3
|
作者
Zhou, Qian [1 ]
Zou, Hua [1 ]
Jiang, Haifeng [2 ]
Wang, Yong [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] Wuhan Univ, Aier Eye Hosp, Wuhan, Peoples R China
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VII | 2023年 / 14226卷
关键词
Incomplete Multimodal Learning; Visual Acuity; Prediction; Self-Attention;
D O I
10.1007/978-3-031-43990-2_69
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the primary treatment option for cataracts, it is estimated that millions of cataract surgeries are performed each year globally. Predicting the Best Corrected Visual Acuity (BCVA) in cataract patients is crucial before surgeries to avoid medical disputes. However, accurate prediction remains a challenge in clinical practice. Traditional methods based on patient characteristics and surgical parameters have limited accuracy and often underestimate postoperative visual acuity. In this paper, we propose a novel framework for predicting visual acuity after cataract surgery using masked self-attention. Especially different from existing methods, which are based on monomodal data, our proposed method takes preoperative images and patient demographic data as input to leverage multimodal information. Furthermore, we expand our method to a more complex and challenging clinical scenario, i.e., the incomplete multimodal data. Firstly, we apply efficient Transformers to extract modality-specific features. Then, an attentional fusion network is utilized to fuse the multimodal information. To address the modality-missing problem, an attention mask mechanism is proposed to improve the robustness. We evaluate our method on a collected dataset of 1960 patients who underwent cataract surgery and compare its performance with other state-of-the-art approaches. The results show that our proposed method outperforms other methods and achieves a mean absolute error of 0.122 logMAR. The percentages of the prediction errors within +/- 0.10 logMAR are 94.3%. Besides, extensive experiments are conducted to investigate the effectiveness of each component in predicting visual acuity. Codes will be available at https://github.com/liyiersan/MSA.
引用
收藏
页码:735 / 744
页数:10
相关论文
共 50 条
  • [21] A deep learning sequence model based on self-attention and convolution for wind power prediction
    Liu, Chien-Liang
    Chang, Tzu-Yu
    Yang, Jie-Si
    Huang, Kai-Bin
    RENEWABLE ENERGY, 2023, 219
  • [22] SELF-ATTENTION BASED MODEL FOR PUNCTUATION PREDICTION USING WORD AND SPEECH EMBEDDINGS
    Yi, Jiangyan
    Tao, Jianhua
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7270 - 7274
  • [23] Attention-Enhanced Guided Multimodal and Semi-Supervised Networks for Visual Acuity (VA) Prediction after Anti-VEGF Therapy
    Wang, Yizhen
    Wang, Yaqi
    Liu, Xianwen
    Cui, Weiwei
    Jin, Peng
    Cheng, Yuxia
    Jia, Gangyong
    ELECTRONICS, 2024, 13 (18)
  • [24] Feature pyramid self-attention network for respiratory motion prediction in ultrasound image guided surgery
    Chen Yao
    Jishuai He
    Hui Che
    Yibin Huang
    Jian Wu
    International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 2349 - 2356
  • [25] Feature pyramid self-attention network for respiratory motion prediction in ultrasound image guided surgery
    Yao, Chen
    He, Jishuai
    Che, Hui
    Huang, Yibin
    Wu, Jian
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (12) : 2349 - 2356
  • [26] Intra-Modality Feature Interaction Using Self-attention for Visual Question Answering
    Shao, Huan
    Xu, Yunlong
    Ji, Yi
    Yang, Jianyu
    Liu, Chunping
    NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V, 2019, 1143 : 215 - 222
  • [27] Reconstructing computational spectra using deep learning's self-attention method
    Wu, Hao
    Wu, Hui
    Su, Xinyu
    Wu, Jingjun
    Liu, Shuangli
    OPTICA APPLICATA, 2024, 54 (03) : 383 - 394
  • [28] Prediction of postoperative visual acuity recovery after surgery for epimacular membrane with Lotmar interferometry
    Bovey, EH
    KLINISCHE MONATSBLATTER FUR AUGENHEILKUNDE, 2003, 220 (03) : 131 - 133
  • [29] Prognosis of visual acuity and complications after cataract surgery with primary bag-fixated IOL implantation in children
    Stahl, E
    Zubcov, AA
    Schnaudigel, OE
    Fries, U
    Ohrloff, C
    Stark, N
    OPHTHALMOLOGE, 1998, 95 (02): : 88 - 91
  • [30] Predictive Value of Excellent Uncorrected Visual Acuity Post-Operative Day One After Cataract Surgery
    Young, Jonathan W.
    Law, Nathan W.
    Tu, Daniel C.
    CLINICAL OPHTHALMOLOGY, 2020, 14 : 2777 - 2782